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METHODS OF DIAGNOSIS OF LUNG CANCER, COMPOSITIONS AND METHODS 
OF SCREENING FOR MODULATORS OF LUNG CANCER 

5 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application is related to USSN 60/284,770, filed April 18, 2001; USSN 
60/290,492, filed May 10, 2001; USSN 60/334,370, filed November 29, 2001; USSN 
60/339,245, filed November 9, 2001; USSN 60/350,666, filed November 13, 2001; and 
1 0 USSN 60/xxx,xxx, filed April 12, 2002 (Docket OMNI-002P); each of which is incorporated 
herein by reference in its entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression 
15 profiles and nucleic acids, products, and antibodies thereto that are involved in lung cancer; 
and to the use of such expression profiles and compositions in diagnosis and therapy of lung 
cancer. The invention further relates to methods for identifying and using agents and/or 
targets that inhibit lung cancer or related conditions. 

20 BACKGROUND OF THE INVENTION 

Lung cancer is the second most commonly occurring cancer in the United States and 
is the leading cause of cancer-related death. It is estimated that there are over 160,000 new 
cases of lung cancer in the United States every year. Of those who are diagnosed with lung 
cancer, 86 percent will die within five years. Lung cancer is the most common visceral 

25 cancer in men and accounts for nearly one third of all cancer deaths in both men and women. 
In fact, lung cancer accounts for 7% of all deaths, due to any cause, in both men and women. 

Smoking is the primary cause of lung cancer, with more than 80% of lung cancers 
resulting from smoking. About 400 to 500 separate gaseous substances are present in the 
smoke of a non-filter cigarette. The most noteworthy substances include nitrogen oxides, 

30 hydrogen cyanide, formaldehyde, benzene, and toluene. The particles present in cigarette 
smoke contain at least 3,500 individual compounds such as nicotine, tobacco alkaloids 
(nomicotine, anatabine, anabasine), polycyclic aromatic hydrocarbons (e.g., benzo(a)pyrene, 
B(a)P), naphthalenes, aromatic amines, phenols, and tobacco-specific nitrosamines. 
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Tobacco-specific nitrosamines are formed during tobacco curing and processing, and 
are suspected of causing lung cancer in humans. In rodent studies, regardless of the where or 
how it is applied, the tobacco-specific nitrosamine known as NNK produces lung adenomas 
and lung adenocarcinomas. The tobacco-specific nitrosamine known as NNAL also produces 
5 lung adenocarcinomas in rodents. 

Many of the chemicals found in cigarette smoke also affect the nonsmoker inhaling 
"secondhand" or sidestream smoke. Indeed, the smoke inhaled by non-smokers has a 
chemical composition similar to the smoke inhaled by smokers, but, importantly, the 
concentrations of the carcinogenic tobacco-specific nitrosamines are present in higher 
10 concentrations in second hand smoke. For this and other reasons, "passive smoking" is an 
important cause of lung cancer, causing as many as 3,000 lung cancer deaths in nonsmokers 
each year. 

In addition to smoking, other factors thought to be causes of lung cancer include on- 
the-job exposure to carcinogens such as asbestos and uranium, exposure to chemical hazards 

15 such as radon, polycyclic aromatic hydrocarbons, chromium, nickel, and inorganic arsenic, 
genetic factors, and diet 

Histological classification of various lung cancers define the types of cancer that 
begin in the lung. See, e.g., Travis, et al. H999^ Histological Typing of Lung and Pleural 
Tumours (International Histological Classification of Tumours, No 1 . Four major cell types 

20 make up more than 88% of all primary lung neoplasms. These are: squamous or epidermoid 
carcinoma, small cell (also called oat cell) carcinoma, adenocarcinoma, and large cell (also 
called large cell anaplastic) carcinoma. The remainder include undifferentiated carcinomas, 
carcinoids, bronchial gland tumors, and other rarer types. The various cell types have 
different natural histories and responses to therapy, and, thus, a correct histologic diagnosis is 

25 the first step of effective treatment. 

Small cell lung cancer (SCLQ accounts for 18-25% of all lung cancers, and occurs 
less frequently than non-small cell lung cancers, and generally spread to distant organs more 
rapidly than non-small cell lung cancer. In general, at the time of presentation small cell lung 
cancers have already spread beyond the beyond the bounds where surgery and curative intent 

30 can be undertaken. Hoever, if identified early enough, these cancers are often responsive to 
chemotherapy and thoracic radiation treatment. 

Non-small cell lung cancers (NSCLC) are the more frequently occurring form of lung 
cancer. They comprise squamous cell carcinoma, adenocarcinoma, and large cell carcinoma 
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and account for more than 75% of all lung cancers. Non-small cell tumors that are localized 
at the time of presentation can sometimes be cured with surgery and/or radiotherapy, but 
usually are not identified until significant metastasis has occurred, which are typically not 
very responsive to surgical, chemotherapy, or radiation treatment.. 
5 The screening of asymptomatic persons at high risk for lung cancer has often proven 

ineffective. In general, only 5 to 15 percent of lung cancer patients have their disease 
detected while they are asymptomatic. Of course, early detection and treatment are critical 
factors in the fight against lung cancer. The average survival rate is 49% for those whose 
cancer is detected early, before the cancer has spread from the lung. Lung cancer often 

10 spreads outside of the lung, and it may have spread to the bones or brain by the time it is 
diagnosed. While the prognosis may be better for lung cancers that are detected early, 
because of the lack ofv effective curative treatments, early detection does not necessarily alter 
the total death rate from lung cancer. 

Thus, methods for diagnosis and prognosis of lung cancer and effective treatment of 

15 lung cancer would be desirable. Accordingly, provided herein are methods that can be used 
in diagnosis and prognosis of lung cancer. Further provided are methods that can be used to 
screen candidate therapeutic agents for the ability to modulate, e.g., treat, lung cancer. 
Additionally, provided herein are molecular targets and compositions for therapeutic 
intervention in lung disease and other metastatic cancers. 

20 

SUMMARY OF THE INVENTION 
The present invention provides nucleotide sequences of genes that are up- and down- 
regulated in lung cancer cells. Such genes are useful for diagnostic purposes, and also as 
targets for screening for therapeutic compounds that modulate lung cancer, such as 

25 antibodies. The methods of detecting nucleic acids of the invention or their encoded proteins 
can be used for a number of purposes. Examples include early detection of lung cancers, 
monitoring and early detection of relapse following treatment of lung cancers, monitoring 
response to therapy of lung cancers, determining prognosis of lung cancers, directing therapy 
of lung cancers, selecting patients for postoperative chemotherapy or radiation therapy, 

30 selecting therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous lesions of the lung. Examples of benign or precancerous lesions 
include: atelectasis, emphysema, brochitis, chronic obstructive pulmonary disease, fibrosis, 
hypersensitivity pneumonitis (HP), interstitial pulmonary fibrosis (IPF), asthma, and 
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bronchiectasis. Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 

In one aspect, the present invention provides a method of detecting a lung cancer- 
associated transcript in a cell from a patient, the method comprising contacting a biological 
5 sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1 A-16. Alternatively, the sample may 
be contacted with a specific binding reagent, e.g., antibody. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 
95% identical to a sequence as shown in Tables 1A-16. In another embodiment, the 
10 polynucleotide comprises a sequence as shown in Tables 1A-16. 

In one embodiment, the biological sample is a tissue sample, or a body fluid. In 
another embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. In 
one embodiment, the polynucleotide is immobilized on a solid surface. In one embodiment, 
15 the patient is undergoing a therapeutic regimen to treat lung cancer. In another embodiment, 
the patient is suspected of having lung cancer. In one embodiment, the patient is a primate, 
e.g., a human. 

In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

20 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 

25 identical to a sequence as shown in Tables 1 A-16, thereby monitoring the efficacy of the 
therapy. Or the sample may be evaluated for protein, e.g., contacting the sample with an 
antibody. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated transcript to a level of the lung cancer-associated 
30 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. Or the sample may be evalated for comparison of protein. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
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biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated antibody in the biological sample by contacting the 
biological sample with a polypeptide encoded by a polynucleotide that selectively hybridizes 
to a sequence at least 80% identical to a sequence as shown in Tables .1 A-16, wherein the 
5 polypeptide specifically binds to the lung cancer-associated antibody, thereby monitoring the 
efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated antibody to a level of the lung cancer-associated antibody 
in a biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

10 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated polypeptide in the biological sample by contacting the 
biological sample with an antibody, wherein the antibody specifically binds to a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A- 16, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 
polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

20 treatment. In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 A- 16. In one embodiment, an 
expression vector or cell comprises the isolated nucleic acid. In one aspect, the present 
invention provides an isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1A-16. 

25 In another aspect, the present invention provides an antibody that specifically binds to 

an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1 A-16. In one embodiment," the antibody is conjugated to an 
effector component, e.g., a fluorescent label, a radioisotope or a cytotoxic chemical. In one 
embodiment, the antibody is an antibody fragment. In another embodiment, the antibody is 

30 humanized. 

In one aspect, the present invention provides a method of detecting lung cancer in a a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody or protein as described herein. 
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In another aspect, the present invention provides a method of detecting antibodies 
specific to a lung cancer gene in a patient, the method comprising contacting a biological 
sample from the patient with a polypeptide encoded by a nucleic acid comprises a sequence 
from Tables 1A-16. 

5 In another aspect, the present invention provides a method for identifying a compound 

that modulates a lung cancer-associated polypeptide, the method comprising the steps of: (i) 
contacting the compound with a lung cancer-associated polypeptide, the polypeptide encoded 
by a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a 
sequence as shown in Tables 1 A-16; and (ii) determining the functional effect of the 

1 0 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant In one 
embodiment, the functional effect is determined by measuring ligand binding to the 

15 polypeptide. 

In another aspect, the present invention provides a method of inhibiting proliferation 
or another critical process of a lung cancer-associated cell to treat lung cancer in a patient, the 
method comprising the step of administering to the subject a therapeutically effective amount 
of a compound identified as described herein. In one embodiment, the compound is an 
20 antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having lung cancer or a cell 
isolated therefrom; (ii) comparing the level of gene expression of a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
25 1 A-16 in a treated cell or mammal with the level of gene expression of the polynucleotide in 
a control cell or mammal, wherein a test compound that modulates the level of expression of 
the polynucleotide is a candidate for the treatment of lung cancer. 

In one embodiment, the control is a mammal with lung cancer or a cell therefrom that 
has not been treated with the test compound. In another embodiment, the control is a normal 
30 cell or mammal, or a non-malignant lung disease. 

In another aspect, the present invention provides a method for treating a mammal 
having lung cancer comprising administering a compound identified by the assay described 
herein. 
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In another aspect, the present invention provides a pharmaceutical composition for 
treating a mammal having lung cancer, the composition comprising a compound identified by 
the assay described herein and a physiologically acceptable excipient. 

5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and treatment of lung disease or cancer, as well as methods for 
screening for compositions which modulate lung cancer. "Treatment, monitoring, detection 
or modulation of lung disease or cancer" includes treatment, monitoring, detection, or 

10 modulation of lung disease in those patients who have lung disease (whether malignant or 
non-malignant, e.g., emphysema, bronchitis, or fibrosis) as well as patients with lung cancers 
in which gene expression from a gene in Tables 1 A-l 6 is increased or decreased, indicating 
that the subject is more likely to have disease. In particular,while these targets are identified 
primarily from lung cancer samples, these same targets are likely to be similarly found in 

15 analyses of other medical conditions. These other conditions may result from similar 
pathological processes which affect similar tissues, e.g., lung cancer, small cell lung 
carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamous cell carcinoma, 
adenocarcinoma, large cell lung carcinoma, carcinoid, granulomatous), fibrosis (idiopathic 
pulmonary fibrosis (IPF), hypersensitivity pneumonitis (HP), interstitial pneumonitis, 

20 nonspecific idiopathic pneumonitis (NSIP)), chronic obstructive pulmonary disease (COPD, 
e.g., emphysema, chronic bronchitis), asthma, bronchiectasis, and esophageal cancer. See, 
e.g., the NCI webpage and USSN 60/347,349 and USSN 60/xxx,xxx (docket LFBR-001-1P, 
filed March 29, 2002), each of which is incorporated herein by reference. The treatment may 
be of lung cancer or related condition itself, or treatment of metastasis. 

25 In particular, identification of markers selectively expressed on these cancers allows 

for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 
invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and 
small molecule agonists/antagonists, which will be useful to selectively identify those 
markers. For example, therapeutic methods may take the form of protein therapeutics which 

30 use the marker expression for selective localization or modulation of function (for those 
markers which have a causative disease effect), for vaccines, identification of binding 
partners, or antagonism, e.g., using antisense or RNAi. The markers may be useful for 
molecular characterization of subsets of lung diseases, which subsets may actually require 
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very different treatments. Moreover, the markers may also be important in related diseases to 
the specific cancers, e.g., which affect similar tissues in non-malignant diseases, or have 
similar mechanisms of induction/maintenance. Metastatic processes or characteristics may 
also be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related 
5 but distinct diseases, or to determine treatment strategy. The detection methods may be based 
upon nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, 
IHC, etc. The diagnosis may be qualitative or quantitative, and may detect increases or 
decreases in expression levels. 

Tables 1A-16 provide unigene cluster identification numbers for the nucleotide 

10 sequence of genes that exhibit increased or decreased expression in lung cancer samples. The 
tables also provide an exemplar accession number that provides a nucleotide sequence that is 
part of the unigene cluster. In Table 1 A, genes marked as 'target 1" or "target 2" are 
particularly useful as therapeutic targets. Genes marked as "target 3" are particularly useful 
as diagnostic markers. Genes marked as "chron" are upregulated in chronically diseased lung 

15 (e.g., emphysema, bronchitis, fibrosis) relative to lung tumors and normal tissue. In certain 
analyses, the ratio for the "chron" category was determined using the 70th percentile of 
chronically diseases lung samples divided by the 90th percentile of normal lung samples. 
The ratio for the targets was determined using the 70th percentile of lung tumor samples 
divided by the 90th percentile of normal lung samples. 

20 

Definitions 

The term "lung cancer protein" or "lung cancer polynucleotide" or "lung cancer- 
associated transcript" refers to nucleic acid and polypeptide polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have a nucleotide sequence that has greater than 

25 about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater nucleotide sequence identity, 
preferably over a region of over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to a nucleotide sequence of or associated with a unigene cluster of Tables 
1A-16; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

30 comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
unigene cluster of Tables 1A-16, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-16 and conservatively modified variants thereof; or (4) 
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have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
or 99% or greater amino sequence identity, preferably over a region of over a region of at 
least about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence 
5 encoded by a nucleotide sequence of or associated with a unigene cluster of Tables 1A-16. A 
polynucleotide or polypeptide sequence is typically from a mammal including, but not 
limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or 
other mammal. A "lung cancer polypeptide" and a "lung cancer polynucleotide," include 
both naturally occurring or recombinant forms. 

10 A "full length" lung cancer protein or nucleic acid refers to a lung cancer polypeptide 

or polynucleotide sequence, or a variant thereof, that contains the elements normally 
contained in one or more naturally occurring, wild type lung cancer polynucleotide or 
polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
translational processing or splicing, including alternative splicing. 

15 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a lung cancer protein, polynucleotide, or 
transcript Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

20 archival materials, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 

25 fish. Livestock and domestic animals are of interest. 

'Troviding a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

30 methods of the invention in vivo. Archival tissues or materials, having treatment or outcome 
history, will be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
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same or have a specified percentage of amino acid residues or nucleotides that are the same 
(e.g., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
5 measured using, e.g., a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
be "substantially identical." This definition also refers to, or may be applied to, the 
complement of a test sequence. The definition also includes sequences that have deletions 
10 and/or insertions, substitutions, and naturally occurring, e.g., polymorphic or allelic variants, 
and man-made variants. As described below, the preferred algorithms can account for gaps 
and the like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

15 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 

20 comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
contiguous positions selected from the group consisting typically of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 150 in which a sequence may be 

25 compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned Methods of alignment of sequences for comparison are 
well-known in the art. Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 
2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 

30 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Naf 1. Acad. 
Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, 
e.g., Ausubel, et al. (eds. 1995 and supplements) Current Protocols in Molecular Biology. 

Preferred examples of algorithms that are suitable for determining percent sequence 
identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 
5 described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) 
J. Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 
herein, to determine percent sequence identity for the nucleic acids and proteins of the 
invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

1 0 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 

15 them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 

20 hits in each direction are halted when: the cumulative alignment score falls off by the 

quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

25 uses as defaults a wordlength (W) of 1 1, an expectation (E) of 1 0, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

30 The BLAST algorithm also performs a statistical analysis of the similarity between 

two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nat'l. Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
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two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 

is considered similar to a reference sequence if the smallest sum probability in a comparison 

of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 

less than about 0.01, and most preferably less than about 0.001. Log values may be negative 

5 large numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 

polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 

antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 

polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 

10 peptides differ only by conservative substitutions. Another indication that two nucleic acid 

sequences are substantially identical is that the two molecules or their complements hybridize 

to each other under stringent conditions. Yet another indication that two nucleic acid 

sequences are substantially identical is that the same primers can be used to amplify the 

sequences. 

15 A "host ceiris a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 

20 Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

25 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

30 that the nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, 
and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant or component from the composition to bd purified. 
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In this sense, purification does not require that the purified compound be homogeneous, e.g., 
100% pure. 

The terms "polypeptide," '^peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
5 one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturaily occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as well 
as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refer to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain some basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refer to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to another amino acid. 

20 Amino acids may be referred to herein by either their commonly known three letter 

symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic acid 

25 sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG, and GCU each encode the amino 
acid alanine. Thus, at each position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
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conservatively modified variations. Every nucleic acid sequence Herein which encodes a 

polypeptide also describes silent variations of the nucleic acid In certain contexts each 

codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 

TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 

5 functionally similar molecule. Accordingly, a silent variation of a nucleic acid which 

encodes a polypeptide is implicit in a described sequence with respect to the expression 

product, but not necessarily with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 

1 0 alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded 
sequence is a "conservatively modified variant" where the alteration results in the substitution 
of an amino acid with a chemically similar amino acid. Conservative substitution tables 
providing functionally similar amino acids are well known in the art. Such conservatively 
modified variants are in addition to and do not exclude polymorphic variants, interspecies 

1 5 homologs, and alleles of the invention. Typically conservative substitutions include for one 
another. 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine 
(N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine 
(M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), 
Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)). 

20 Macromolecular structures such as polypeptide structures can be described in terms of 

various levels of organization. For a general discussion of this organization, see, e.g., 
Alberts, et al. (1994) Molecular Biology of the Cell (3 rd ed.) and Cantor and Schimmel (1980) 
Biophysical Chemistry Part I: The Confoimation of Biological Macromolecules. "Primary 
structure'* refers to the amino acid sequence of a particular peptide. "Secondary structure" 

25 refers to locally ordered, three dimensional structures within a polypeptide. These structures 
are commonly known as domains. Domains are portions of a polypeptide that often form a 
compact unit of the polypeptide and are typically 25 to approximately 500 amino acids long. 
Typical domains are made up of sections of lesser organization such as stretches of P-sheet 
and a-helices. "Tertiary structure" refers to the complete three dimensional structure of a 

30 polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed, usually by the noncovalent association of independent tertiary units. Anisotropic 
terms are also known as energy terms. 
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"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 
used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any 
5 length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, 
etc. A nucleic acid of the present invention will generally contain phosphodiester bonds, 
although in some cases, nucleic acid analogs are included that may have at least one different 
linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 

10 Practical Approach Oxford University Press); and peptide nucleic acid backbones and 
linkages. Other analog nucleic acids include those with positive backbones; non-ionic 
backbones, and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, in Sanghui and Cook, eds. Carbohydrate 
Modifications in Antisense Research, ASC Symposium Series 580. Nucleic acids containing 

15 one or more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to 
increase the stability and half-life of such molecules in physiological environments or as 
probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; 
alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring 

20 nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 

25 kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs, DNA and RNA typically exhibit a 2-4° C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C. 
Similarly, due to their non-ionic nature, hybridization of the bases attached to these 
backbones is relatively insensitive to salt concentration. In addition, PNAs are not degraded 

30 by cellular enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by those 
in the art, the depiction of a single strand also defines the sequence of the complementary 
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strand; thus the sequences described herein also provide the complement of the sequence. 

The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the 

nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations 

of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 

5 hypoxanthine, isocytosine, isoguanine, etc. 'Transcript" typically refers to a naturally 

occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 

"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 

nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 

naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic 

10 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

15 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the cancer nucleic acids, proteins, and antibodies. Many methods known in 
the art for conjugating the antibody to the label may be employed, including those methods 
described by Hunter, et al. (1962) Nature 144:945; David, et al. (1974) Biochemistry 

20 13:1014-1021; Pain, et al. (1981) J. Immunol. Meth.. 40:219-230; and Nygren (1982) I 
Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 
bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

25 The "effector" can be a variety of molecules including, e.g., detection moieties including 

radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

30 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
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using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 
binding to a target nucleic acid of complementary sequence through one or more types of 
5 chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 
formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases 
(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 
linkage other than a phosphodiester bond, preferably one that does not functionally interfere 
with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent 

1 0 bases are joined by peptide bonds rather than phosphodiester linkages. Probes may bind 

target sequences lacking complete complementarity with the probe sequence depending upon 
the stringency of the hybridization conditions. The probes are preferably directly labeled, 
e.g., with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with 
biotin to which a streptavidin complex may later bind. By assaying for the presence or 

1 5 absence of the probe, one can detect the presence or absence of the select sequence or 

subsequence. Diagnosis or prognosis may be based at the genomic level, or at the level of 
RNA or protein expression. 

The term "recombinant 5 ' when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the ceU, nucleic acid, protein or vector, has been modified by 

20 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
• acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 
cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 

25 originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 

30 understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
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recombinant for the purposes of the invention. Similarly, a Recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic acid 
5 indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
10 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 

15 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

20 to a functional linkage between a nucleic acid expression control sequence (such as a 

promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
e.g., wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

25 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

30 duplexing, or hybridizing of a molecule selectively to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
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The phrase "stringent hybridization conditions" refers to conditions under which a 
probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to essentially no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
5 higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
"Overview of principles of hybridization and the strategy of nucleic acid assays" in Tijssen 
(1993) Techniques in Biochemistry and Molecular Biology— Hybridiz ation with Nucleic 
Probes (vol. 24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C 
lower than the thermal melting point (T^) for the specific sequence at a defined ionic strength 

10 pH. The T m is the temperature (under defined ionic strength, pH, and nucleic concentration) 
at which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 

15 other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60° C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is 
typically at least two times background, preferably 10 times background hybridization. 

20 Exemplary stringent hybridization conditions are often: 50% formamide, 5x SSC, and 1% 
SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 0.2x SSC, 
and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32° C and 48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 

25 62° C is typical, although high stringency annealing temperatures can range from about 50° C 
to about 65° C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90° C - 95° C for 
0.5 - 2 min., an annealing phase lasting 0.5 - 2 min., and an extension phase of about 72° C 
for 1 - 2 min. Protocols and guidelines for low and high stringency amplification reactions 

30 are provided, e.g.. in Innis. et al.(1990) PCR Protocols. A Guide to Methods and 
Applications . 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
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occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 
permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% fonnamide, 1 M NaCl, 
5 1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least twice 
background. Alternative hybridization and wash conditions can be utilized to provide 
conditions of similar stringency. Additional guidelines for determining hybridization 
parameters are provided in numerous reference, e.g., Ausubel, et al. (ed.) Current Protocols in 
Molecular Biology Lippincott 

1 0 The phrase "functional effects" in the context of assays for testing compounds that 

modulate activity of a lung cancer protein includes the determination of a parameter that is 
indirectly or directly under the influence of the lung cancer protein or nucleic acid, e.g., a 
physiological, enzymatic, functional, physical, or chemical effect, such as the ability to 
decrease lung cancer. It includes ligand binding activity; cell viability, cell growth on soft 

15 agar; anchorage dependence; contact inhibition and density limitation of growth; cellular 
proliferation; cellular transformation; growth factor or serum dependence; tumor specific 
marker levels; invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and 
protein expression in cells undergoing metastasis, and other characteristics of lung cancer 
cells. "Functional effects" include in vitro 9 in vivo, and ex vivo activities. 

20 By "determining the functional effect" is meant assaying for a compound that 

increases or decreases a parameter that is indirectly or directly under the influence of a lung 
cancer protein sequence, e.g., physiological, functional, enzymatic, physical, or chemical 
effects. Such functional effects can be measured by many means known to those skilled in 
the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, 

25 refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for 
the protein, measuring inducible markers or transcriptional activation of the lung cancer 
protein; measuring binding activity or binding assays, e.g., binding to antibodies or other 
ligands, and measuring cellular proliferation. Determination of the functional effect of a 
compound on lung cancer can also be performed using lung cancer assays known to those of 

30 skill in the art such as an in vitro assays, e.g., cell growth on soft agar; anchorage 

dependence; contact inhibition and density limitation of growth; cellular proliferation; 
cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 
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expression in cells undergoing metastasis, and other characteristics of lung cancer cells. The 
functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for lung cancer-associated sequences, 
5 measurement of RNA stability, identification of downstream or reporter gene expression 
(CAT, luciferase, P-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, 
colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

'Inhibitors", "activators", and "modulators" of lung cancer polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 

10 compounds identified using in vitro and in vivo assays of lung cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of lung cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may seem to inhibit expression and subsequent function of the protein. 

15 "Activators" are compounds that increase, open, activate, facilitate, enhance activation, 
sensitize, agonize, or up regulate lung cancer protein activity. Inhibitors, activators, or 
modulators also include genetically modified versions of lung cancer proteins, e.g., versions 
with altered activity, as well as naturally occurring and synthetic ligands, antagonists, 
agonists, antibodies, small chemical molecules and the like. Such assays for inhibitors and 

20 activators include, e.g., expressing the lung cancer protein in vitro, in cells, or cell 

membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of lung cancer can also be 
identified by incubating lung cancer cells with the test compound and determining increases 
or decreases in the expression of 1 or more lung cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 

25 25, 30, 40, 50 or more lung cancer proteins, such as lung cancer proteins encoded by the 
sequences set out in Tables 1A-16. 

Samples or assays comprising lung cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

30 inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably 50%, more 
preferably 25-0%. Activation of a lung cancer polypeptide is achieved when the activity 
value relative to the control (untreated with activators) is 110%, more preferably 150%, more 
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preferably 200-500% (i.e., two to five fold higher relative to the control), more preferably 
1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as cell viability, formation of foci, 
5 anchorage independence, semi-solid or soft agar growth, changes in contact inhibition and 
density limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., Freshney (1994) Culture of Animal Cells a Manual of 

10 Basic Technique pp. 231-241 (3 rd ed.). 

'Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells, or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 

15 and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmoiphological changes, and/or malignancy (see, Freshney 
(1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.V). 

20 "Antibody" refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Ligjit chains are classified as either kappa or lambda. Heavy chains are classified as 

25 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 

30 tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 110 or more amino acids primarily responsible 
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for antigen recognition. The terms variable light chain (V L ) and variable heavy chain (Vh) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

5 antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 

which itself is a light chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'2 may be 

reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 

converting the F(aby 2 dimer into an Fab' monomer. The Fab' monomer is essentially Fab 

with part of the hinge region (see Paul (ed. 1999) Fundamental Immunology (4th ed.). While 

10 various antibody fragments are defined in terms of the digestion of an intact antibody, one of 
skill will appreciate that such fragments may be synthesized de novo either chemically or by 
using recombinant DNA methodology. Thus, the term antibody, as used herein, also includes 
antibody fragments either produced by the modification of whole antibodies, or those 
synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or those 

15 identified using phage display libraries (see, e.g., McCafferty, et al. (1990) Nature 348:552- 
554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler and Milstein 
(1975) Nature 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. 

20 (1 985V pp. 77-96 in Monoclonal Antibodies and Cancer Therapy: Coligan (1991 and 
supplements) Current Protocols in Immunology: Harlow and Lane (1988) Antibodies, A 
Laboratory Manual: and Goding (1986) Monoclonal Antibodies: P rinciples and Practice (2d 
ed.)). Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can 
be adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 

25 other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty, et al. (1990) 
Nature 348:552-554; Marks, et al. (1992) Biotechnology 10:779-783). 

A "chimeric antibody" is an antibody molecule in which, e.g, (a) the constant region, 

30 or a portion thereof; is altered, replaced, or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector 
function, and/or species, or an entirely different molecule which confers new properties to the 
chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the 
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variable region, or a portion thereof, is altered, replaced, or exchanged with a variable region 
having a different or altered antigen specificity. 

Identification of lung cancer-associated sequences 

5 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 

10 characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different lung cancer states, information regarding which genes are important 
(including both up- and down-regulation of genes) in each of these states is obtained. 

1 5 Molecular profiling may distinguish subtypes of a currently collective disease designation, 
e.g., different forms of lung cancer (chronic disease, adenocarcinoma, etc.) 

The identification of sequences that are differentially expressed in lung cancer versus 
non-lung cancer tissue allows the use of this information in a number of ways. For example, 
a particular treatment regime may be evaluated: does a chemotherapeutic drug act to down- 

20 regulate lung cancer, and thus tumor growth or recurrence, in a particular patient. 

Alternatively, a treatment step may induce other markers which may be used as targets to 
destroy tumor cells. Similarly, diagnosis and treatment outcomes may be done or confirmed 
by comparing patient samples with the known expression profiles. Malignant diseasemay be 
compared to non-malignant conditions. Metastatic tissue can also be analyzed to determine 

25 the stage of lung cancer in the tissue, or origin of primary tumor, e.g., metastasis from a 

remote primary site. Furthermore, these gene expression profiles (or individual genes) allow 
screening of drug candidates with an eye to mimicking or altering a particular expression 
profile; e.g., screening can be done for drugs that suppress the lung cancer expression profile. 
This may be done by making biochips comprising sets of the important lung cancer genes, 

30 which can then be used in these screens. PCR methods may be applied with selected primer 
pairs, and analysis may be of RNA or of genomic sequences. These methods can also be 
done on the protein basis; that is, protein expression levels of the lung cancer proteins can be 
evaluated for diagnostic purposes or to screen candidate agents. In addition, the lung cancer 
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nucleic acid sequences can be administered for gene therapy purposes, including the 
administration of antisense nucleic acids, or the lung cancer proteins (including antibodies 
and other modulators thereof) administered as therapeutic drugs or as protein or DNA 
vaccines. 

5 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in lung cancer relative to normal tissues and/or non-malignant lung 
disease, or in different types of lung disease, herein termed "lung cancer sequences." As 
outlined below, lung cancer sequences include those that are up-regulated (i.e., expressed at a 
higher level) in lung cancer, as well as those that are down-regulated (i.e., expressed at a 

10 lower level). In a preferred embodiment, the lung cancer sequences are from humans; 
however, as will be appreciated by those in the art, lung cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other lung 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

1 5 horses, etc.) and pets (dogs, cats, etc.). Lung cancer sequences from other organisms may be 
obtained using the techniques outlined below. 

Lung cancer sequences can include both nucleic acid and amino acid sequences. As 
will be appreciated by those in the art and is more fully outlined below, lung cancer nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, 

20 which will detect naturally occurring nucleic acids, as well as screening applications; e.g., 
biochips comprising nucleic acid probes or PCR microtiter plates with selected probes to the 
lung cancer sequences can be generated. 

A lung cancer sequence can be initially identified by substantial nucleic acid and/or 
amino acid sequence homology to the lung cancer sequences outlined herein. Such 

25 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, e.g., using homology programs or hybridization 
conditions. 

For identifying lung cancer-associated sequences, the lung cancer screen typically 
includes comparing genes identified in different tissues, e.g., normal and cancerous tissues, 
30 cancer and non-malignant conditions, non-malignant conditions and normal tissues, or tumor 
tissue samples from patients who have metastatic disease vs. non metastatic tissue. Other 
suitable tissue comparisons include comparing lung cancer samples with metastatic cancer 
samples from other cancers, such as, breast, other gastrointestinal cancers, prostate, ovarian, 
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etc. Samples of, non metastatic disease tissue and tissue undergoing metastasis are applied to 

biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 
and treated as is known in the art for the preparation of mKNA. Suitable biochips are 
commercially available, e.g., from Aflymetrix, Santa Clara, CA. Gene expression profiles as 
5 described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
lung, but also including, and not limited to colon, heart, brain, liver, breast, kidney, muscle, 
prostate, small intestine, large intestine, spleen, bone, and/or placenta. In a preferred 
10 embodiment, those genes identified during the lung cancer screen that are expressed in 
significant amounts in other tissues (e.g., essential organs) are removed from the profile, 
although in some embodiments, this is not necessary (e.g., where organs may be dispensible 
at a later stage of life). That is, when screening for drugs, it is usually preferable that the 
target expression be disease specific, to minimize possible side effects on other organs. 
15 In a preferred embodiment, lung cancer sequences are those that are up-regulated in 

lung cancer; that is, the expression of these genes is higher in cancerous tissue than in normal 
lung or other tissue. "Up-regulation" as used herein means, when the ratio is presented as a 
number greater than one, that the ratio is greater than one, preferably 1.5 or greater, more 
preferably 2.0 or greater. Another embodiment is directed to sequences up-regulated in non- 
20 malignant conditions relative to normal. Unigene cluster identification numbers and 

accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, e.g., Benson, DA, et al (1998) Nucleic Acids Research 26:1-7 and 
http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
25 European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
Another embodiment is directed to sequences up-regulated in non-malignant conditions 
relative to normal. In some situations, the sequences may be derived from assembly of 
available sequences or be predicted from genomic DNA using exon prediction algorithms, 
such as FGENESH (Salamov and Solovyev (2000) Genome Res. 10:516-522). In other 
30 situations, sequences have been derived from cloning and sequencing of isolated nucleic 
acids. 

In another preferred embodiment, lung cancer sequences are those that are dojvn- 
regulated in the lung cancer; that is, the expression of these genes is lower in cancerous tissue 
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or normal lung or other tissue. '"Down-regulation" as used herein means, when the ratio is 
presented as a number greater than one, that the ratio is greater than one, preferably 1 .5 or 
greater, more preferably 2.0 or greater, or, when the ratio is presented as a number less than 
one, that the ratio is less than one, preferably 0.5 or less, more preferably 0.25 or less. 

5 

Informatics 

The ability to identify genes that are over or under expressed in lung cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

10 biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with lung cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson (1998) Pharmaceutical Proteomics: Targets, 
Mechanism, and Function, paper presented at the BBC Proteomics conference, Coronado, CA 

15 (June 1 1-12, 1998)). Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
saccharides, lipids, drugs, and the like). 

20 Thus, in another embodiment, the present invention provides a database that includes 

at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 
database of the invention can be maintained on any electronic device allowing for the storage 

25 of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent to those of skill in the art that similar databases 
can be assembled for assay data acquired using an assay of the invention. 

30 The compositions and methods for identifying and/or quantitating the relative and/or 

absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing lung cancer, i.e., the identification of lung cancer-associated sequences 
described herein, provide an abundance of information, which can be correlated with 
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pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, gene- 
disease causal linkages, identification of correlates of immunity and physiological status, 
among others. Although the data generated from the assays of the invention is suited for 
manual review and analysis, in a preferred embodiment, data processing using high-speed 
5 computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known 
in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 

10 Patent 5,953,727 discloses a relational database having sequence records containing 
information in a format that allows a collection of partial-length DNA sequences to be 
catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 

1 5 sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 

20 dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 

25 fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount, et al (2001) Bioinformatics; Durbin, et al. (eds., 1999) Biological 
Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (; Baxevanis and 
Oeullette (eds., 1998) Bioinformatics: A Practical Guide to the Analysis of Genes and 

30 Proteins) : Rashidi and Buehler (1999) Bioinformatics: Basic Applications in Biological 
Science and Medicine : Setubal, et al. (eds 1997) Introduction to Computational Molecular 
Biology: Misener and Krawetz (eds, 2000V Bioinformatics: Methods and Protocols: Higgins 
and Taylor (eds., 2000) Bioinformatics: Sequence. Structure, and Databanks: A Practical 
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Approach: Brown (2001) Bioinformatics: A Biologist's Guide to Biocomputing and the 

Internet; Han and Kamber (2000) Data Mining: Concepts and Techniques (2000); and 

Waterman (1995) Introduction to Computational Biology: Maps. Sequences, and Genomes. 

The present invention provides a computer database comprising a computer and 

5 software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 

with data specifying the source of the target-containing sample from which each sequence 

specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 

is from a control tissue sample known to be free of pathological disorders, In a variation, at 

10 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for lung cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

1 5 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

20 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

25 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

30 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
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be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
5 SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
in a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 

10 linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 

line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

1 5 The invention also provides a method for transmitting assay data that includes 

generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

20 In a preferred embodiment, the invention provides a computer system for comparing a 

query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 

25 Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e ? g., DRAM, SRAM, SGRAM, or 

30 SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor 
can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
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MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 

molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 

data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 

SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 

5 be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 

adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 

device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
10 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

1 5 Characteristics of lung cancer-associated proteins 

Lung cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins or intracellular proteins. In one embodiment, the lung cancer protein 
is an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the 
nucleus. Intracellular proteins are involved in all aspects of cellular function and replication 

20 (including, e.g., signaling pathways); aberrant expression of such proteins often results in 
unregulated or disregulated cellular processes (see, e.g., Alberts (ed. 1994) Molecular 
Biolojgv of the Cell (3d ed.). For example, many intracellular proteins have enzymatic 
activity such as protein kinase activity, protein phosphatase activity, protease activity, 
nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins also serve 

25 as docking proteins that are involved in organizing complexes of proteins, or targeting 
proteins to various subcellular localizations, and are involved in maintaining the structural 
integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. In 

30 addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
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domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

few, have been shown to mediate protein-protein interactions. Some of these may also be 

involved in binding to phospholipids or other second messengers. As will be appreciated by 

5 one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

sequence; thus, an analysis of the sequence of proteins may provide insight into both the 

enzymatic potential of the molecule and/or molecules with which the protein may associate. 

One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 

10 Versions are available via the internet from Washington University in St Louis, the Sanger 

Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman, et al (2000) 

Nuc. Acids Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 

(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 

322). 

1 5 In another embodiment, the lung cancer sequences are transmembrane proteins. 

Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 

20 and/or may serve as a binding site for additional proteins. Frequently the intracellular 

domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

25 Transmembrane proteins may contain from one to many transmembrane domains. 

For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels, pumps, and adenylyl cyclases contain 
numerous transmembrane domains. Many important cell surface receptors such as G protein 

30 coupled receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Characteristics of transmembrane domains include 
approximately 17 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
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localization and number of transmembrane domains within the protein may be predicted (see, 

e.g., PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

motifs are found repeatedly among various extracellular domains. Conserved structure 

5 and/or functions have been ascribed to different extracellular motifs. Many extracellular 

domains are involved in binding to other molecules. In one aspect, extracellular domains are 

found on receptors. Factors that bind the receptor domain include circulating ligands, which 

may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 

1 0 cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, hormones, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they may mediate cell-cell interactions. 
Cell-associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol 
(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains may 

15 also associate with the extracellular matrix and contribute to the maintenance of the cell 
structure. 

Lung cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for extracellular immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

20 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ or in histological analysis. Alternatively, antibodies can also label intracellular proteins, 
in which case analytical samples are typically permeablized to provide access to intracellular 
proteins. In addition, some membrane proteins can be processed to release a soluble protein, 
or to expose a residual fragment. Released soluble proteins may be useful diagnostic 

25 markers, processed residual protein fragments may be useful lung markers of disease. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

30 In another embodiment, the lung cancer proteins are secreted proteins; the secretion of 

which can be either constitutive or regulated. These proteins may have a signal peptide or 
signal sequence that targets the molecule to the secretory pathway. Secreted proteins are 
involved in numerous physiological events; e.g., if circulating, they often serve to transmit 
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signals to various other cell types. The secreted protein may function in an autocrine manner 

(acting on the cell that secreted the factor), a paracrine manner (acting on cells in close 

proximity to the cell that secreted the factor), an endocrine manner (acting on cells at a 

distance, e.g., secretion into the blood stream), or exocrine (secretion, e.g., through a duct or 

5 to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal 

glands, mammary glands, sax producing glands of the ear, etc.). Thus secreted molecules 

often find use in modulating or altering numerous aspects of physiology. Lung cancer 

proteins that are secreted proteins are particularly preferred in the present invention as they 

serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

10 Those which are enzymes may be antibody or small molecule targets. Others may be useful 

as vaccine targets, e.g., via CTL mechanisms. 

Use of lung cancer nucleic acids 

As described above, lung cancer sequence is initially identified by substantial nucleic 
15 acid and/or amino acid sequence homology or linkage to the lung cancer sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a niRNA are found on the same 
molecule. 

20 The lung cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 

1A-16, can be fragments of larger genes, i.e., they are nucleic acid segments. "Genes" in this 
context includes coding regions, non-coding regions, and mixtures of coding and non-coding 
regions. Accordingly, as will be appreciated by those in the art, using the sequences provided 
herein, extended sequences, in either direction, of the lung cancer genes can be obtained, 

25 using techniques well known in the art for cloning either longer sequences or the full length 
sequences; see Ausubel, et al., supra. Much can be done by informatics and many sequences 
can be clustered to include multiple sequences corresponding to a single gene, e.g., systems 
such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a lung cancer nucleic acid is identified, it can be cloned and, if necessary, its 

30 constituent parts recombined to form the entire lung cancer nucleic acid coding regions or the 
entire mKNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant lung cancer nucleic acid can be further-used as a probe to identify and isolate 
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other lung cancer nucleic acids, e.g., extended coding regions. It can also be used as a 

"precursor" nucleic acid to make modified or variant lung cancer nucleic acids and proteins. 

The lung cancer nucleic acids of the present invention are used in several ways. In a 

first embodiment, nucleic acid probes to the lung cancer nucleic acids are made and attached 

5 to biochips to be used in screening and diagnostic methods, as outlined below, or for 

administration, e.g., for gene therapy, RNAi, vaccine, and/or antisense applications. 

Alternatively, the lung cancer nucleic acids that include coding regions of lung cancer 

proteins can be put into expression vectors for the expression of lung cancer proteins, again 

for screening purposes or for administration to a patient. 

10 In a preferred embodiment, nucleic acid probes to lung cancer nucleic acids (both the 

nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the lung cancer nucleic acids, i.e., the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 

15 hybridization of the target sequence and the probes of the present invention occurs. As 

outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 

20 conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under appropriate reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and 

25 partially double stranded. The strandedness of the probe is dictated by the structure, 

composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 

30 lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 
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particular target. The probes ean be overlapping (i.e., have some sequence in common), or 
separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
5 equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is typically meant one or more of 
electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is 

10 the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 

15 be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 

20 synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 

25 method. Often the substrate may contain discrete individual sites appropriate for ndivitual 
partitioning and identification. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and 
other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon, etc.), 

30 polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including 
silicon and modified silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the 
substrates allow optical detection and do not appreciably fluoresce. A preferred substrate is 
described in US application entitled Reusable Low Fluorescent Plastic Biochip, U.S. 
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Application Serial No. 09/270,214, filed March 15, 1999, herein incorporated by reference in 
its entirety. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
5 placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 
volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 

10 the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 

1 5 homo-or hetero-bifunctional linkers as are well known (see 1 994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the surface 
20 of the solid support. Either the 5* or 3' terminus may be attached to the solid support, or 

attachment may be via linkage to an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 

yet non-coValent. For example, biotinylated oligonucleotides can be made, which bind to 

surfaces covalently coated with streptavidin, resulting in attachment. 
25 . Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 

and techniques are used In a preferred embodiment, the nucleic acids can be synthesized in 

situ, using known photolithographic techniques, such as those described in WO 95/25 116; 

WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of 
30 which are expressly incorporated by reference; these methods of attachment form the basis of 

the Afifymetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 

lung cancer-associated sequences. These assays are typically performed in conjunction with 
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reverse transcription. In such assays, a lung cancer-associated nucleic acid sequence acts as a 
template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a 
quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
5 measure of the amount of lung cancer-associated RNA. Methods of quantitative 

amplification are well known to those of skill in the art. Detailed protocols for quantitative 
PCR are provided, e.g., in Innis, et al. (1 990) PCR Protocols, A Guide to Methods and 
Applications . 

In some embodiments, a TaqMan based assay is used to measure expression. 

10 TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 9 end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3* 

15 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560, Landegren, et aL (1988) 
Science 241:1077, and Barringer, et al. (1990) Gene 89:1 17), transcription amplification 

20 (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1173), self-sustained sequence 

replication (Guatelli, et al. (1990) Proc. Nat. Acad. Sci. USA 87: 1874), dot PCR, and linker 
adapter PCR, etc. 

Expression of lung cancer proteins from nucleic acids 

25 In a preferred embodiment, lung cancer nucleic acids, e.g., encoding lung cancer 

proteins, are used to make a variety of expression vectors to express lung cancer proteins 
which can then be used in screening assays, as described below. Expression vectors and 
recombinant DNA technology are well known to those of skill in the art (see, e.g., Ausubel, 
supra, and Fernandez and Hoeffler (eds 1999) Gene Expression Systems) and are used to 

30 express proteins. The expression vectors may be either self-replicating extrachromosomal 
vectors or vectors which integrate into a host genome. Generally, these expression vectors 
include transcriptional and translational regulatory nucleic acid operably linked to the nucleic 
acid encoding the lung cancer protein. The term "control sequences" refers to DNA 
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sequences used for the expression of an operably linked coding sequence in a particular host 
organism. Control sequences that are suitable for prokaryotes, e.g., include a promoter, 
optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to 
utilize promoters, polyadenylation signals, and enhancers. 
5 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; or a ribosome binding site is operably 

1 0 linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 

1 5 accordance with conventional practice. Transcriptional and translational regulatory nucleic 
acid will generally be appropriate to the host cell used to express the lung cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 

20 not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
25 may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known in the art, and are useful in the 
present invention. 

In addition, an expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
30 organisms, e.g., in mammalian or insect cells for expression and in a prokaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector often contains at least one sequence homologous to the host cell genome, and 
preferably two homologous sequences which flank the expression construct. The integrating 
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vector may be directed to a specific locus in the host cell by selecting the appropriate 
homologous sequence for inclusion in the vector. Constructs for integrating vectors are well 
known in the art (e.g., Fernandez and Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a selectable 
5 marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The lung cancer proteins of the present invention are usually produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a lung 
cancer protein, under the appropriate conditions to induce or cause expression of the lung 

1 0 cancer protein. Conditions appropriate for lung cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 

15 growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 

20 and other yeasts, E. coli, Bacillus subtilis, S© cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the lung cancer proteins are expressed in mammalian 
cells. Mammalian expression systems are also known in the art, and include retroviral and 

25 adenoviral systems. Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, since the viral genes are often highly expressed and have a broad 
host range. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter, herpes simplex virus promoter, and the CMV 
promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription termination and 

30 polyadenylation sequences recognized by mammalian cells are regulatory regions located 3 * 
to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. Examples of transcription terminator and polyadenylation signals include those 
derived form SV40. 
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The methods of introducing exogenous nucleic acid into mammalian hosts, as well as 
other hosts, is well known in the art, and will vary with the host cell used. Techniques 
include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, eiectroporation, viral infection, encapsulation of the 
5 polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, lung cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used and are known in the art. In addition, 
synthetic promoters and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of 
the tip and lac promoter sequences. Furthermore, a bacterial promoter can include naturally 

1 0 occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA 
polymerase and initiate transcription. In addition to a functioning promoter sequence, an 
efficient ribosome binding site is desirable. The expression vector may also include a signal 
peptide sequence that provides for secretion of the lung cancer protein in bacteria. The 
protein is either secreted into the growth media (gram-positive bacteria) or into the 

1 5 periplasmic space, located between the inner and outer membrane of the cell (gram-negative 
bacteria). The bacterial expression vector may also include a selectable marker gene to allow 
for the selection of bacterial strains that have been transformed. Suitable selection genes 
include genes which render the bacteria resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. Selectable markers 

10 also include biosynthetic genes, such as those in the histidine, tryptophan and leucine 

biosynthetic pathways. These components are assembled into expression vectors. Expression 
vectors for bacteria are well known in the art, and include vectors for Bacillus subtilis, E. 
coli 9 Streptococcus cremoris, and Streptococcus lividans, among others (e.g., Fernandez and 
HoeflQer, supra). The bacterial expression vectors are transformed into bacterial host cells 

-5 using techniques well known in the art, such as calcium chloride treatment, eiectroporation, 
and others. 

In one embodiment, lung cancer proteins are produced in insect cells. Expression 
vectors for the transformation of insect cells, and in particular, baculovirus-based expression 
vectors, are well known in the art. 
*0 In a preferred embodiment, lung cancer protein is produced in yeast cells. Yeast 

expression systems are well known in the art, and include expression vectors for 
Saccliaromyces cerevisiae, Candida albicans and C. maltosa 9 Hansenula polymorpha, 
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Kluyveromyces fragilis and K. lactis, Pichia guillerimondii, and P. pastoris, 

Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The lung cancer protein may also be made as a fusion protein, using techniques well 

known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope 

5 is small, the lung cancer protein may be fused to a carrier protein to form an immunogen. 

Alternatively, the lung cancer protein may be made as a fusion protein to increase expression 

for affinity purification purposes, or for other reasons. For example, when the lung cancer 

protein is a lung cancer peptide, the nucleic acid encoding the peptide may be linked to other 

nucleic acid for expression purposes. 

1 0 In a preferred embodiment, the lung cancer protein is purified or isolated after 

expression. Lung cancer proteins may be isolated or purified in a variety of appropriate 
ways. Standard purification methods include electrophoretic,. molecular, immunological and 
chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the lung cancer protein 

1 5 may be purified using a standard anti-lung cancer protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes (1982) Protein Purification . 
The degree of purification necessary will vary depending on the use of the lung cancer 
protein. In some instances no purification will be necessary. 

20 Once expressed and purified if necessary, the lung cancer proteins and nucleic acids 

are useful in a number of applications. They may be used as immunoselection reagents, as 
vaccine reagents, as screening agents, therapeutic entities, for production of antibodies, as 
transcription or translation inhibitors, etc. 

25 Variants of lung cancer proteins 

In one embodiment, the lung cancer proteins are derivative or variant lung cancer 
proteins as compared to the wild-type sequence. That is, as outlined more fully below, the 
derivative lung cancer peptide will often contain at least one amino acid substitution, deletion 
or insertion, with amino acid substitutions being particularly preferred. The amino acid 
30 substitution, insertion or deletion may occur at a particular residue within the lung cancer 
peptide. 

Also included within one embodiment of lung cancer proteins of the present invention 
are amino acid sequence variants. These variants typically fall into one or more of three 
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classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the lung cancer 
protein, using cassette or PCR mutagenesis or other techniques, to produce DNA encoding 
the variant, and thereafter expressing the DNA in recombinant cell culture as outlined above. 
5 However, variant lung cancer protein fragments having up to about 100-150 residues may be 
prepared by in vitro synthesis. Amino acid sequence variants are characterized by the 
predetermined nature of the variation, a feature that sets them apart from naturally occurring 
allelic or interspecies variation of the lung cancer protein amino acid sequence. The variants 
typically exhibit a similar qualitative biological activity as the naturally occurring analogue, 

10 although variants can also be selected which have modified characteristics as will be more 
fully outlined below. 

While the site or region for introducing an amino acid sequence variation is often 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 

15 conducted at the target codon or region and the expressed lung cancer variants screened for 
the optimal combination of desired activity. Techniques exist for making substitution 
mutations at predetermined sites in DNA having a known sequence, e.g., Ml 3 primer 
mutagenesis and PCR mutagenesis. Screening of mutants is often done using assays of lung 
cancer protein activities. 

20 Amino acid substitutions are typically of single residues; insertions usually will be on 

the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
occasionally tolerated. Deletions generally range from about 1 to about 20 residues, although 
in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive 

25 at a final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. Larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of a lung cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution chart provided in the 
definition section. 

30 Variants typically exhibit essentially the same qualitative biological activity and will 

elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of lung cancer proteins as needed. Alternatively, the 
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variant may be designed or reorganized such that a biological activity of the lung cancer 

protein is altered. For example, giycosylation sites may be added, altered, or removed. 

Covalent modifications of lung cancer polypeptides are included within the scope of 

this invention. One type of covalent modification includes reacting targeted amino acid 

5 residues of a lung cancer polypeptide with an organic derivatizing agent that is capable of 

reacting with selected side chains or the N-or C-terminal residues of a lung cancer 

polypeptide. Derivatization with Afunctional agents is useful, for instance, for crosslinking 

lung cancer polypeptides to a water-insoluble support matrix or surface for use in a method 

for purifying anti-lung cancer polypeptide antibodies or screening assays, as is more fully 

10 described below. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2- 
phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic 
acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), Afunctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-((p-azidophenyl)dithio)propipimidate. 

1 5 Other modifications include deamidation of glutaminyl and asparaginyl residues to 

the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of serinyl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains (Creighton 
(1983) Proteins: Structure and Molecular Properties, pp. 79-86), acetylation of the N-terminal 

20 amine, and amidation of any C-tenninal carboxyl group. 

Another type of covalent modification of the lung cancer polypeptide encompassed by 
this invention is an altered native giycosylation pattern of the polypeptide. "Altering the 
native giycosylation pattern" is intended herein to mean adding to or deleting one or more 
carbohydrate moieties of a native sequence lung cancer polypeptide. Giycosylation patterns 

25 can be altered in many ways. For example the use of different cell types to express lung 
cancer-associated sequences can result in different giycosylation patterns. 

Addition of giycosylation sites to lung cancer polypeptides may also be accomplished 
by altering the amino acid sequence thereof. The alteration may be made, e.g., by the 
addition of, or substitution by, one or more serine or threonine residues to the native sequence 

30 lung cancer polypeptide (for O-linked giycosylation sites). The lung cancer amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by 
mutating the DNA encoding the lung cancer polypeptide at preselected bases such that 
codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the lung cancer 
polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such 
methods are described in the art, e.g., in WO 87/05330, and in Aplin and Wriston (1981) 
CRC Crit. Rev. Biochem.. pp. 259-306. 
5 Removal of carbohydrate moieties present on the lung cancer polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding 
for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are known in the art and described, for instance, by Hakimuddin, et al. (1 987) 
Arch. Biochem. Biophvs.. 259:52 and by Edge, et al. (1981) Anal. Biochem.. 118:131. 
1 0 Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a 
variety of endo-and exo-glycosidases as described by Thotakura, et al. (1987) Meth. 
Enzvmol.. 138:350. 

Another type of covalent modification of lung cancer comprises linking the lung 
cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene 

15 glycol, polypropylene glycol, or polyoxyalkylenes, in the maimer set forth in U.S. Patent 
Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Lung cancer polypeptides of the present invention may also be modified in a way to 
form chimeric molecules comprising a lung cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of a lung cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind The epitope tag is 
generally placed at the amino-or carboxyl-tenninus of the lung cancer polypeptide. The 
presence of such epitope-tagged forms of a lung cancer polypeptide can be detected using an 
antibody against the tag polypeptide. Also, provision of the epitope tag enables the lung 

25 cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a lung cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known and examples 

include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) JVfol 
Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies 
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thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); and the Herpes 

Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 

Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 

(1988) BioTechnologv 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

5 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 266:15163- 

15166); and the 17 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) Proc. Naf 1 

Acad. Sci. USA 87:6393-6397\ 

Also included are other lung cancer proteins of the lung cancer family, and lung 

cancer proteins from other organisms, which are cloned and expressed as outlined below. 

10 Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be used to 
find other related lung cancer proteins from primates or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include unique areas of the lung cancer nucleic acid sequence. As is generally known in the 
art, preferred PCR primers are from about 1 5 to about 35 nucleotides in length, with from 

1 5 about 20 to about 30 being preferred, and may contain inosine as needed. PCR reaction 
conditions are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to lung cancer proteins 

In a preferred embodiment, when a lung cancer protein is to be used to generate 
20 antibodies, e.g., for immunotherapy or immunodiagnosis, the lung cancer protein should 
share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant* * herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller lung cancer protein will be able to bind to the full-length protein, 
25 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
30 immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1A-16 or fragment thereof or a fusion protein thereof. It may be useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized. Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 

albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 

Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 

trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in 

5 the art. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
10 or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of the tables, or fragment thereof, 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes ('TBLs") are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if non- 
15 human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (Goding (1986) Monoclonal Antibodies: Principles and Practice, pp. 59-103 ). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovin, or primate origin. Usually, rat or mouse myeloma cell lines are employed. 
20 The hybridoma cells may be cultured in a suitable culture medium that preferably contains 
one or more substances that inhibit the growth or survival of the unfused, immortalized cells. 
For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
25 growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
30 protein encoded by a nucleic acid of the tables or a fragment thereof the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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In a preferred embodiment, the antibodies to lung cancer protein are capable of 
reducing or eliminating a biological function of a lung cancer protein, in a naked form or 
conjugated to an effector moiety. That is, the addition of anti-lung cancer protein antibodies 
(either polyclonal or preferably monoclonal) to lung cancer tissue (or cells containing lung 
5 cancer) may reduce or eliminate the lung cancer. Generally, at least a 25% decrease in 
activity, growth, size or the like is preferred, with at least about 50% being particularly 
preferred and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the lung cancer proteins are humanized 
antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, 

10 Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab*, 
F(ab')2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues from a complementary determining 

15 region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of a human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 

20 framework sequences. In general, a humanized antibody will comprise substantially all of at 
least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of 
the framework (FR) regions are those of a human immunoglobulin consensus sequence. A 
humanized antibody optimally also will typically comprise at least a portion of an 

25 immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones, et 
al. (1986) Nature 321 :522-525; Riechmann, et al. (1988) Nature 332:323-329; and Presta . 
(1992) Curr. Op. Struct. Biol. 2:593-596). Humanization can be performed following the 
method of Winter and co-workers (Jones, et al. (1986) Nature 321 :522-525; Riechmann, et al. 
(1988) Nature 332:323-327; Verhoeyen, et al. (1988) Science 239:1534-1536), by 

30 substituting rodent CDRs or CDR sequences for corresponding sequences of a human 

antibody. Accordingly, such humanized antibodies are chimeric antibodies (U.S. Patent No. 
4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by corresponding sequence from a non-human species. 



48 



WO 02/086443 PCT/US02/12476 

Human-like antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom and Winter (1991) J. Mol, BioL 227:381; 
Marks, et al. (1991) J. Mol. Biol. 222:581). The techniques of Cole, et al. and Boerner, et al. 
are also available for the preparation of human monoclonal antibodies (Cole, et al. (1985) 
5 Monoclonal Antibodies and Cancer Therapy, p. 77 and Boerner, et al. (1991) J. Immunol. 
147(l):86-95). Similarly, human antibodies can be made by introducing human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 
human antibody production is observed, which closely resembles that seen in humans in 

10 nearly all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Fishwiid, et al. (1996) Nature Biotechnology 14:845-51; Neuberger 

15 (1996) Nature Biotechnology 14:826; and Lonberg and Huszar (1995) Intern. Rev. Immunol. 
13:65-93. 

By immunotherapy is meant treatment of lung cancer with an antibody raised against 
a lung cancer proteins. As used herein, immunotherapy can be passive or active. Passive 
immunotherapy as defined herein is the passive transfer of antibody to a recipient (patient). 

20 Active immunization is the induction of antibody and/or T-cell responses in a recipient 
(patient). Induction of an immune response is the result of providing the recipient with an 
antigen to which antibodies are raised. The antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 

25 expression of the antigen, leading to an immune response. 

In a preferred embodiment the lung cancer proteins against which antibodies are 
raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, may bind and prevent the secreted protein from binding to its receptor, 
thereby inactivating the secreted lung cancer protein. 

30 In another preferred embodiment, the lung cancer protein to which antibodies are 

raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment may bind the extracellular domain of the lung cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
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antibody may cause down-regulation of the transmembrane lung cancer protein. The 
antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein binding 

4 

to the extracellular domain of the lung cancer protein. The antibody may be an antagonist of 
the lung cancer protein or may prevent activation of a transmembrane lung cancer protein, or 
5 may induce or suppress a particular celiular pathway. In some embodiments, when the 
antibody prevents the binding of other molecules to the lung cancer protein, the antibody 
prevents growth of the cell. The antibody may also be used to target or sensitize the cell to 
cytotoxic agents, including, but not limited to TNF-cc, TNF-p, IL-1 , INF-y, and IL-2, or 
chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, 

10 and the like. In some instances the antibody may belong to a sub-type that activates serum 
complement when complexed with the transmembrane protein thereby mediating cytotoxicity 
or antigen-dependent cytotoxicity (ADCC). Thus, lung cancer may be treated by 
administering to a patient antibodies directed against the transmembrane lung cancer protein. 
Antibody-labeling may activate a co-toxin, localize a toxin payload, or otherwise provide 

15 means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be various molecules, including labeling moieties such as radioactive 
labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic 
moiety is a small molecule that modulates the activity of a lung cancer protein. In another 

20 aspect the therapeutic moiety may modulate an activity of molecules associated with or in 
close proximity to a lung cancer protein. The therapeutic moiety may inhibit enzymatic or 
signaling activity such as protease or collagenase activity associated with lung cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to lung cancer tissue or cells results in a reduction 

25 in the number of afflicted cells, thereby reducing symptoms associated with lung cancer. 

Cytotoxic agents are numerous and varied and include, but are not limited to, cytotoxic drugs 
or toxins or active fragments of such toxins. Suitable toxins and their corresponding 
fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, 
crotin, phenomycin, enomycin, saporin, auristatin, and the like. Cytotoxic agents also include 

30 radiochemicals made by conjugating radioisotopes to antibodies raised against lung cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to. 
the antibody. Targeting the therapeutic moiety to transmembrane lung cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the lung cancer 
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afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the untargeted therapeutic moiety. 

In another preferred embodiment, the lung cancer protein against which the antibodies 
are raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 
5 or other entity which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the lung cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody theretomay contain a signal for that target localization, i.e., 
a nuclear localization signal. 
10 The lung cancer antibodies of the invention specifically bind to lung cancer proteins. 

By "specifically bind" herein is meant that the antibodies bind to the protein with a K<j of at 
least about 0.1 mM, more usually at least about 1 pM, preferably at least about 0.1 jiM or 
better, and most preferably, 0.01 \)M or better. Selectivity of binding to the specific target 
and not to related other sequences is also important. 

15 

Detection of lung cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the lung cancer phenotype. Expression levels of genes in normal tissue (e.g., 
not undergoing lung cancer), in lung cancer tissue (and in some cases, for varying severities 

20 of lung cancer that relate to prognosis, as outlined below), or in non-malignant disease are 
evaluated to provide expression profiles. A gene expression profile of a particular cell state 
or point of development is essentially a "fingerprint* ' of the state of the cell. While two states 
may have a particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the state 

25 of the cell. By comparing expression profiles of cells in different states, information 

regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 

30 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
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normal versus lung cancer tissue. Genes may be turned on or turned off in a particular state, 

relative to another state thus permitting comparison of two or more states. A qualitatively 

regulated gene will exhibit an expression pattern within a state or cell type which is 

detectable by standard techniques. Some genes will be expressed in one state or cell type, but 

5 not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 

expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 

an increased amount of transcript, or downregulated, resulting in a decreased amount of 

transcript. The degree to which expression differs need only be large enough to quantify via 

standard characterization techniques as outlined below, such as by use of Affymetrix 

10 GeneChip™ expression arrays, Lockhart (1996) Nature Biotechnology 14: 1675-1680, hereby 
expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 

15 150%, more preferably at least about 200%, with from 300 to at least 1000% being especially 
preferred 

Evaluation may be at the gene transcript or the protein level. The amount of gene 
expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 

20 gene product itself (protein) can be monitored, e.g., with antibodies to the lung cancer protein 
and standard immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy 
assays, 2D gel electrophoresis assays, etc. Proteins corresponding to lung cancer genes, e.g., 
those identified as being important in a lung cancer or disease phenotype, can be evaluated in 
a lung cancer diagnostic test In a preferred embodiment, gene expression monitoring is 

25 performed simultaneously on a number of genes. 

The lung cancer nucleic acid probes may be attached to biochips as outlined herein for 
the detection and quantification of lung cancer sequences in a particular cell. The assays are 
further described below in the example. PCR techniques can be used to provide greater 
sensitivity. Multiple protein expression monitoring can be performed as well. Similarly, 

30 these assays may be performed on an individual basis as well. 

In a preferred embodiment nucleic acids encoding the lung cancer protein are 
detected. Although DNA or RNA encoding the lung cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a lung cancer protein is detected. 
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Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is complementary to 

and hybridizes with the mRNA and includes, but is not limited to, oligonucleotides, cDNA or 

RNA. Probes also should contain a detectable label, as defined herein. In one method the 

mRNA is detected after immobilizing the nucleic acid to be examined on a solid support such 

5 as nylon membranes and hybridizing the probe with the sample. Following washing to 

remove the non-specifically bound probe, the label is detected In another method detection 

of the mRNA is performed in situ. In this method permeabilized cells or tissue samples are 

contacted with a detectably labeled nucleic acid probe for sufficient time to allow the probe 

to hybridize with the target mRNA. Following washing to remove the non-specifically bound 

10 probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) that 
is complementary to the mRNA encoding a lung cancer protein is detected by binding the 
digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue 
tetrazolium and 5-bromo-4-cMoro-3-indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 

15 described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The lung cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing lung cancer sequences are used in diagnostic assays. This can be performed on an 
individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 

20 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, lung cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of lung cancer, e.g., for prognostic 
or diagnostic purposes. Detection of these proteins in putative lung cancer tissue allows for 
25 detection, prognosis, or diagnosis of lung cancer or similar disease, and perhaps for selection 
of therapeutic strategy. In one embodiment, antibodies are used to detect lung cancer 
proteins. A preferred method separates proteins from a sample by electrophoresis on a gel 
(typically a denaturing and reducing protein gel, but may be another type of gel, including 
% ' isoelectric focusing gels and the like). Following separation of proteins, the lung cancer 
30 protein is detected, e.g., by immunoblotting with antibodies raised against the lung cancer 
protein. Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the lung cancer protein find use in in situ 
imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) Methods in Cell Biology: 
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Antibodies in Cell Biology, volume 37. In this method cells are contacted with from one to 
many antibodies to the lung cancer protein(s). Following washing to remove non-specific 
antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
5 label, e.g., multicolor fluorescence or confocal imaging. In another method the primary 

antibody to the lung cancer protein(s) contains a detectable label, e.g., an enzyme marker that 
can act on a substrate. In another preferred embodiment each one of multiple primary 
antibodies contains a distinct and detectable label. This method finds particular use in 
simultaneous screening for a plurality of lung cancer proteins. Many other histological 

1 0 imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing lung cancer from 

15 blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 
samples to be probed or tested for the presence of lung cancer proteins. Antibodies can be 
used to detect a lung cancer protein by previously described immunoassay techniques 
including ELISA, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology and the like. Conversely, the presence of antibodies may indicate an immune 

20 response against an endogenous lung cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled lung cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including lung cancer 
tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then 
performed. When comparing the fingerprints between an individual and a standard, the 

25 skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is 
further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 
In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

30 modified proteins and cells containing lung cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to lung cancer, clinical, 
pathological, or other information, in terms of long term prognosis. Again, this may be done 
on either a protein or gene level, with the use of genes being preferred. Single or multiple 
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genes may be useful in various combinations. As above, lung cancer probes may be attached 

to biochips for the detection and quantification of lung cancer sequences in a tissue or patient. 

The assays proceed as outlined above for diagnosis. PCR method may provide more 

sensitive and accurate quantification. 

5 . 

Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 

herein are used in drug screening assays. The lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing lung cancer sequences are used in drug screening 

1 0 assays or by evaluating the effect of drug candidates on a "gene expression profile" or 
expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokarnik, et al. (1998) Science 279:84-8; Heid (1996) Genome Res. 6:986-94. 

1 5 In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing the native or modified lung cancer proteins are used in 
screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the lung cancer phenotype or an identified physiological 
function of a lung cancer protein. As above, this can be done on an individual gene level or 

20 by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified differentially expressed genes herein, a variety of assays may be 

25 performed. In a preferred embodiment, assays may be run on an individual gene or protein 
level. That is, having identified a particular gene with altered regulation in lung cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
lung cancer protein. "Modulation" thus includes an increase or a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 

30 gene expression in normal versus tissue undergoing lung cancer, with changes of at least 

10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in lung cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in lung 
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cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 

expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 

quantification of gene expression levels, or, alternatively, the gene product itself can be 

5 monitored, e.g., through the use of antibodies to the lung cancer protein and standard 

immunoassays. Proteomics and separation techniques may also allow quantification of 

expression. 

In a preferred embodiment, gene or protein expression monitoring of a number of 
entities, i.e., an expression profile, is monitored simultaneously. Such profiles will typically 
1 0 involve a plurality of those entities described herein. 

In this embodiment, the lung cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of lung cancer sequences in a particular 
cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may be used 
with dispensed primers in desired wells. A PCR reaction can then be performed and analyzed 
15 for each well. 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more lung cancer-associated sequences, e.g., a polynucleotide sequence 
set out in the tables. Generally, in a preferred embodiment, a test compound is added to the 
cells prior to analysis. Moreover, screens are also provided to identify agents that modulate 

20 lung cancer, modulate lung cancer proteins, bind to a lung cancer protein, or interfere with 
the binding of a lung cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or. 

25 indirectly alter the lung cancer phenotype or the expression of a lung cancer sequence, e.g., a 
nucleic acid or protein sequence. In preferred embodiments, modulators alter expression 
profiles of nucleic acids or proteins provided herein. In one embodiment, the modulator 
suppresses a lung cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In 
another embodiment, a modulator induces a lung cancer phenotype. Generally, a plurality of 

30 assay mixtures are run in parallel with different agent concentrations to obtain a differential 
' response to the various concentrations. Typically, one of these concentrations serves as a 
negative control, i.e., at zero concentration or below the level of detection. 
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In one aspect, a modulator will neutralize the effect of a lung cancer protein. By 

"neutralize" is meant that activity of a protein and the consequent effect on the cell is 

inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 

5 screened for an ability to bind to a lung cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 

chemical compound (called a "lead compound") with some desirable property or activity, 

e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 

and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 

» 

generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks (Gallop, 
et al. (1994) J. Med. Chem. 37(9):1233-1251). 

15 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature. 354:84-88), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

10 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,5 14), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Nat. Acad. Sci. USA 
90:6909-6913), vinylogous polypeptides (Hagihara, et al. (1 992) J. Amer. Chern. Soc. 
114:6568), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding (Hirschmann, et 
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al. (1992) J. Amer. Chem. Soc. 1 14:9217-9218), analogous organic syntheses of small 

compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 116:2661), oligocarbamates 

(Cho, et al (1993) Science 261 :1303), and/or peptidyl phosphonates (Campbell, et al. (1994) 

J. Org. Chem. 59:658). See, generally, Gordon, et al. (1994) J. Med. Chem. 37:1385, nucleic 

5 acid libraries (see, e.g., Stratagene, Corp.), peptide nucleic acid libraries (see, e.g., U.S. 

Patent 5,539,083), antibody libraries (see, e.g., Vaughn, et al. (1996) Nature Biotechnology 

14(3):309-314, andPCT/US96/10287), carbohydrate libraries (see, e.g., Liang, et al. (1996) 

Science 274:1520-1522, and U.S. Patent No. 5,593,853), and small organic molecule libraries 

(see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33; isoprenoids, U.S. Patent 

10 No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, 
U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent No. 
5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). . 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, 

15 Wobum, MA, 433 A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, 
MA). 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 

20 systems utilizing robotic arms (Zymate n, Zymark Corporation, Hopkinton, Mass.; Orca, 
Hewlett-Packard, Palo Alto, Calif.), which mimic the manual synthetic operations performed 
by a chemist. The above devices, with appropriate modification, are suitable for use with the 
present invention. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N J., Asinex, Moscow, Ru, Tripos, 

25 Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of lung cancer gene transcription, polypeptide 
expression, and polypeptide activity. 

30 High throughput assays for evaluating the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
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U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
5 Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate procedures, including sample and reagent pipetting, liquid dispensing, 
timed incubations, and final readings of the microplate in detector(s) appropriate for the 
assay. These configurable systems provide high throughput and rapid start up as well as a 

10 high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 

15 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way 
libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 

20 useful test compound will be directed to the class of proteins to which the target belongs, e.g., 
substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred The peptides may be digests of naturally occurring 

25 proteins, random peptides, or "biased" random peptides. By "randomized" or grammatical 
equivalents herein is meant that the nucleic acid or peptide consists of essentially random 
sequences of nucleotides and amino acids, respectively. Since these random peptides (or 
nucleic acids, discussed below) are often chemically synthesized, they may incorporate a 
nucleotide or amino acid at any position. The synthetic process can be designed to generate 

30 randomized proteins or nucleic acids, to allow the formation of all or most of the possible 

combinations over the length of the sequence, thus forming a library of randomized candidate 
bioactive proteinaceous agents. 
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In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited number 
of possibilities. In a preferred embodiment, the nucleotides or amino acid residues are 
5 randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 
threonines, tyrosines or histidines for phosphorylation sites, etc. 

Modulators of lung cancer can also be nucleic acids, as defined above. 

10 As described above generally for proteins, nucleic acid modulating agents may be 

naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 

15 After a candidate agent has been added and the cells allowed to incubate for some 

period of time, the sample containing a target sequence is analyzed. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PCR performed as appropriate. For example, an in vitro transcription 

20 with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 

25 alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 

30 is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise "sandwich 
assays", which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 
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5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 

5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of 

which are hereby incorporated by reference. In this embodiment, in general, the target nucleic 

acid is prepared as outlined above, and then added to the biochip comprising a plurality of 

5 nucleic acid probes, under conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 

high, moderate and low stringency conditions as outlined above. The assays are generally 

run under stringency conditions which allow formation of the label probe hybridization 

complex only in the presence of target. Stringency can be controlled by altering a step 

10 parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration, pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697: Thus it may be desirable to perform certain steps at 

1 5 higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 
other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 

20 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
The assay data are analyzed to determine the expression levels, and changes in 

25 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the lung cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile,, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 

30 important in a particular state, screens can be performed to identify modulators that alter 

expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, 
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screens are performed to identify agents that bind and/or modulate the biological activity of 
the gene product, or evaluate genetic polymorphisms. 

Genes can be screened for those that are induced in response to a candidate agent. 
After identifying a modulator based upon its ability to suppress a lung cancer expression 
5 pattern leading to a normal expression pattern, or to modulate a single lung cancer gene 
expression profile so as to mimic the expression of the gene from normal tissue, a screen as 
described above can be performed to identify genes that are specifically modulated in 
response to the agent. Comparing expression profiles between normal tissue and agent 
treated lung cancer tissue reveals genes that are not expressed in normal tissue or lung cancer 

10 tissue, but are expressed in agent treated tissue. These agent-specific sequences can be 
identified and used by methods described herein for lung cancer genes or proteins. In 
particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated lung cancer tissue sample. 

15 Thus, in one embodiment, a test compound is administered to a population of lung 

cancer cells, that have an associated lung cancer expression profile. By "administration" or 
"contacting" herein is meant that the candidate agent is added to the cells in such a manner as 
to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

20 candidate agent (i.e., a peptide) may be put into a viral construct such as an adenoviral or 
retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019; Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 

25 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., lung cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress a lung cancer phenotype. A change in at least one gene, 
preferably many, of the expression profile indicates that the agent has an effect on lung 
30 cancer activity. By defining such a signature for the lung cancer phenotype, screens for new 
drugs that alter the phenotype can be devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor does 
the level of transcript for the target protein need to change. 
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Measure of lung cancer polypeptide activity, or of lung cancer or the lung cancer 
phenotype can be performed using a variety of assays. For example, the effects of the test 
compounds upon the function of the metastatic polypeptides can be measured by examining 
parameters described above. A suitable physiological change that affects activity can be used 
5 to assess the influence of a test compound on the polypeptides of this invention. When the 
functional consequences are determined using intact cells or animals, one can also measure a 
variety of effects such as, in the case of lung cancer associated with tumors, tumor growth, 
tumor metastasis, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 

10 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian lung cancer polypeptide is typically used, e.g., 
mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a lung cancer polypeptide is first contacted with a potential modulator and 

15 incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, the 
lung cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is typically measured using immunoassays such as western 
blotting, ELISA and the like with an antibody that selectively binds to the lung cancer 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 

20 PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 
blotting, are preferred. The level of protein or mRNA is typically detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a lung cancer protein 

25 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
CAT, or P-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 

30 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
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genes are sometimes referred to herein as "lung cancer proteins." The lung cancer protein 

may be a fragment, or alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
5 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
10 gene product is used; that is, the gene products of one or more differentially expressed 

nucleic acids are made. For example, antibodies are generated to the protein gene products, 
and standard immunoassays are run to determine the amount of protein present. Alternatively, 
cells comprising the lung cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a lung cancer 
1 5 protein and a candidate compound, and determining the binding of the compound to the lung 
cancer protein. Preferred embodiments utilize the human lung cancer protein, although other 
mammalian proteins may also be used, e.g., for the development of animal models of human 
disease. In some embodiments, as outlined herein, variant or derivative lung cancer proteins 
may be used. 

20 Generally, in a preferred embodiment of the methods herein, the lung cancer protein 

or the candidate agent is non-diffusably bound to an insoluble support, preferably having 
isolated sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble 
supports may be made of a composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 

25 screening. The surface of such supports may be solid or porous and of a convenient shape. 
Examples of suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient because 
a large number of assays can be carried out simultaneously, using small amounts of reagents 

30 and samples. The particular manner of binding of the composition is typically not crucial so 
long as it is compatible with the reagents and overall methods of the invention, maintains the 
activity of the composition, and is nondiffusable. Preferred methods of binding include the 
use of antibodies (which do not sterically block either the ligand binding site or activation 
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sequence when the protein is bound to the support), direct binding to "sticky" or ionic 
supports, chemical crosslinking, the synthesis of the protein or agent on the surface, etc. 
Following binding of the protein or agent, excess unbound material is removed by washing. 
The sample receiving areas may then be blocked through incubation with bovine serum 
5 albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the lung cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support 
and the lung cancer protein is added. Novel binding agents include specific antibodies, non- 
natural binding agents identified in screens of chemical libraries, peptide analogs, etc/Of 

10 particular interest are screening assays for agents that have a low toxicity for human cells. A 
wide variety of assays may be used for this purpose, including labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the lung cancer 

15 protein may be done in a number of ways. In a preferred embodiment, the compound is 

labeled, and binding determined directly, e.g., by attaching all or a portion of the lung cancer 
protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), washing 
off excess reagent, and determining whether the label is present on the solid support. Various 
blocking and washing steps may be utilized as appropriate. 

20 In some embodiments, only one of the components is labeled, e.g., the proteins (or 

proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor for 
the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

25 In one embodiment, the binding of the test compound is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(i.e., a lung cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 
binding moiety, with the binding moiety displacing the compound. In one embodiment, the 

30 test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40° C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
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between 0. 1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 

away. The second component is then added, and the presence or absence of the labeled 

component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by a test 

5 compound. Displacement of the competitor is an indication that the test compound is binding 

to the lung cancer protein and thus is capable of binding to, and potentially modulating, the 

activity of the lung cancer protein. In this embodiment, either component can be labeled. 

Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 

displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the lung cancer protein with a higher affinity. Thus, if the 
test compound is labeled, the presence of the label on the support, coupled with a lack of * 

15 competitor binding, may indicate that the test compound is capable of binding to the lung 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the lung cancer proteins. In one 
embodiment, the methods comprise combining a lung cancer protein and a competitor in a 

20 first sample. A second sample comprises a test compound, a lung cancer protein, and a 

competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the lung cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the lung cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native lung cancer protein, but cannot bind to modified lung cancer proteins. The structure of 
the lung cancer protein may be modeled, and used in rational drug design to synthesize agents 
that interact with that site. Drug candidates that affect the activity of a lung cancer protein 

30 are also identified by screening drugs for the ability to either enhance or reduce the activity of 
the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
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Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
5 bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
10 protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a lung cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising lung cancer 
15 proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a lung cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
20 cytokines, growth factors, action potentials, pharmacological agents including 

chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate lung cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the lung 
25 cancer protein. Once identified, similar structures are evaluated to identify critical structural 
feature of the compound. 

In one embodiment, a method of inhibiting lung cancer cell division is provided. The 
method comprises administration of a lung cancer inhibitor. In another embodiment, a 
method of inhibiting lung cancer is provided. The method may comprise administration of a 
30 lung cancer inhibitor. In a further embodiment, methods of treating cells or individuals with 
lung cancer are provided, e.g., comprising administration of a lung cancer inhibitor. 

In one embodiment, a lung cancer inhibitor is an antibody as discussed above. In 
another embodiment, the lung cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 
5 Normal cells require a solid substrate to attach and grow. When the cells are 

transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 

10 grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of lung cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

1 5 Techniques for soft agar growth or colony formation in suspension assays are 

described in Freshney (1994) Culture of Animal Cells a Manual of Basic Technique (3 rd e<L), 
herein incorporated by reference. See also, the methods section of Garkavtsev, et al. (1996), 
supra, herein incorporated by reference. 

20 Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

25 higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 

30 normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
with a lung cancer-associated sequence and are grown for 24 hours at saturation density in 
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non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 



determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 
5 Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J.Natl. Cancer Insti. 37:167-175; Eagle, et al. (1970) I 
Exp. Med. 131:836-879); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

10 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 

1 5 Gullino, " Angiogenesis, tumor vascularization, and potential interference with tumor growth" 
in Mihich (ed. 1985) Biological Responses in Cancer, pp. 178-184). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) "Angiogenesis and Cancer" in Sem Cancer Biol.) . 
Various techniques which measure the release of these factors are described in 

20 Freshney (1994), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 
Strickland and Beers (1976) J. Biol. Chem . 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino, "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" in Mihich (ed. 1985) Biological Responses in Cancer, pp. 178-184; Freshney 
Anticancer Res. 5:111-130(1985). 

25 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate lung cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
30 invasiveness of cells into Matrigel or some other extracellular matrix constituent In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these hoist cells would decrease invasiveness of the host cells. 
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Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other 

extracellular matrix constituent Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance 

5 moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 

the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

Effects of lung cancer-associated sequences on cell growth can be tested in transgenic 

10 or immune-suppressed mice. Knock-out transgenic mice can be made, in which the lung 
cancer gene is disrupted or in which a lung cancer gene is inserted. Knock-out transgenic 
mice can be made by insertion of a marker gene or other heterologous gene into the 
endogenous lung cancer gene site in the mouse genome via homologous recombination. 
Such mice can also be made by substituting the endogenous lung cancer gene with a mutated 

1 5 version of the lung cancer gene, or by mutating the endogenous lung cancer gene, e.g., by 
exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 

20 that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi, et al. (1989) Science 244:1288). Chimeric targeted mice can be 
derived according to Hogan, et al. (1988) Manipulating the Mouse Embrvo: A Laboratory 
Manual Cold Spring Harbor Laboratory and Robertson (ed. 1987) Teratocarcinomas and 

25 Embryonic Stem Cells: A Practical Approach. , IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Natl. Cancer Inst. 52:921), a SCID mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263; Selby, et al. (1980) Br. J. Cancer 41 :52) 

30 can be used as a host. Transplantable tumor cells (typically about 10 6 cells?) injected into 

isogenic hosts will produce invasive tumors in a high proportions of cases, while normal cells 
of similar origin will not. In hosts which developed invasive tumors, cells expressing a lung 
cancer-associated sequences are injected subcutaneously. After a suitable length of time, 
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preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest 
dimensions) and compared to the control. Tumors that have statistically significant reduction 
(using, e.g., Student's T test) are said to have inhibited growth. 

5 Polynucleotide modulators of lung cancer 

Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a lung cancer-associated protein is 
downregulated, or entirely inhibited, by the use of antisense or an inhibitory polynucleotide, 
i.e., a nucleic acid complementary to, and which can preferably hybridize specifically to, a 

10 coding mRNA nucleic acid sequence, e.g., a lung cancer protein mRNA, or a subsequence 
thereof. Binding of the antisense polynucleotide to the mRNA reduces the translation and/or 
stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 

15 close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the lung cancer protein mRNA. See, e.g., 
Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

20 Such antisense polynucleotides can readily be synthesized using recombinant means, 

or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense oligonucleotides. 

25 Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for lung cancer molecules. A preferred antisense molecule is for a lung 
cancer sequence in the tables, or for a ligand or activator thereof. Antisense or sense 

30 oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
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is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659 and van der Krol, et al. 

(1988^ BioTechniaues 6:958V 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

manner. See, e.g., Brumelkamp, et al. (2002^ Sciencexpress (21March2002); Sharp (1999) 

Genes Dev. 13:139-141; andCathew (200n Curr. Op. Cell Biol. 13:244-248. In mammalian 

cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 

be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1:494- 

498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 

treatment of or validation of relevance to disease. 



Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of lung cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

15 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25 : 289-3 1 7 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

20 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6340-6344; Yamada, et al. 
(1994) Human Gene Therapy 1:39^5; Leavitt, et al. (1995) Proc. Natl. Acad. Sci. USA 
92:699-703; Leavitt, et al. (19994) Human Gene Therapy 5:1151-120; and Yamada, et al. 
(1994) Virology 205: 121-126). 

25 Polynucleotide modulators of lung cancer may be introduced into a cell containing the 

target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 

30 substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of lung 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
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formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating lung cancer in cells or organisms 
5 are provided. In one embodiment, the methods comprise administering to a cell an anti-lung 
cancer antibody that reduces or eliminates the biological activity of an endogenous lung 
cancer protein. Alternatively, the methods comprise administering to a cell or organism a 
recombinant nucleic acid encoding a lung cancer protein. This may be accomplished in any 
number of ways. In a preferred embodiment, e.g., when the lung cancer sequence is down- 

1 0 regulated in lung cancer, such state may be reversed by increasing the amount of lung cancer 
gene product in the cell. This can be accomplished, e.g., by overexpressing the endogenous 
lung cancer gene or administering a gene encoding the lung cancer sequence, using known 
gene-therapy techniques. In a preferred embodiment, the gene therapy techniques include the 
incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 

15 as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 

Alternatively, e.g., when the lung cancer sequence is up-regulated in lung cancer, the activity 
of the endogenous lung cancer gene is decreased, e.g., by the administration of a lung cancer • 
antisense or RNAi nucleic acid. 

In one embodiment, the lung cancer proteins of the present invention may be used to 

20 generate polyclonal and monoclonal antibodies to lung cancer proteins. Similarly, the lung 
cancer proteins can be coupled, using standard technology, to affinity chromatography 
columns. These columns may then be used to purify lung cancer antibodies useful for 
production, diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies 
are generated to epitopes unique to a lung cancer protein; that is, the antibodies show little or 

25 no cross-reactivity to other proteins. The lung cancer antibodies may be coupled to standard 
affinity chromatography columns and used to purify lung cancer proteins. The antibodies 
may also be used as blocking polypeptides, as outlined above, since they will specifically 
bind to the lung cancer protein. 

3 0 Methods of identifying variant lung cancer-associated sequences 

Without being bound by theory, expression of various lung cancer sequences is 
correlated with lung cancer. Accordingly, disorders based on mutant or variant lung cancer 
genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant lung cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous lung cancer genes in a cell. In a preferred embodiment, 
the invention provides methods of identifying the lung cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one lung cancer gene of the individual. 
5 This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced lung cancer gene to a known lung cancer gene, i.e., 
a wild-type gene. 

The sequence of all or part of the lung cancer gene can then be compared to the 
10 sequence of a known lung cancer gene to determine if any differences exist. This can be 
done using known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the lung cancer gene of the patient and the 
known lung cancer gene correlates with a disease state or a propensity for a disease state, as 
outlined herein. 

15 lii a preferred embodiment, the lung cancer genes are used as probes to determine the 

number of copies of the lung cancer gene in the genome. 

In another preferred embodiment, the lung cancer genes are used as probes to 

determine the chromosomal localization of the lung cancer genes. Information such as 

chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
20 chromosomal abnormalities such as translocations, and the like are identified in the lung 

cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a lung cancer protein or 
25 modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel, et al. (1992) Pharmaceutical Dosage Forms and Drug 
Delivery: Lieberman, Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 
30 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art, Science and Technology of 
Pharmaceutical Compounding : and Pickar (1999) Dosage Calculations) . Adjustments for 
lung cancer degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
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drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other 
animals, particularly mammals. Thus the methods are applicable to both human therapy and 
5 veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the lung cancer proteins and modulators thereof of the present 
invention can be done in a variety of ways, including, but not limited to, orally, 
subcutaneously, intravenously, intranasaliy, transdermaUy, intraperitoneally, intramuscularly, 

10 intrapulmonary, vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment 
of wounds and inflammation, the lung cancer proteins and modulators may be directly 
applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a lung cancer 
protein in a form suitable for administration to a patient. In the preferred embodiment, the 

1 5 pharmaceutical compositions are in a water soluble form, such as being present as 

phannaceutically acceptable salts, which is meant to include both acid and base addition 
salts. 'Thannaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 

20 sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
'Tharmaceutically acceptable base addition salts" include those derived from inorganic bases 

25 such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 

30 ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
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lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 
5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that lung cancer protein modulators (e.g., antibodies, antisense 
constructs, ribozymes, small organic molecules, etc.) when administered orally, should be 
protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

10 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in Ihe art. 

The compositions for administration will commonly comprise a lung cancer protein 
modulator dissolved in a pharmaceutical^ acceptable carrier, preferably an aqueous carrier. 
A variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions 

15 are sterile and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate 

20 and the like. The concentration of active agent in these formulations can vary widely, and 
will be selected primarily based on fluid volumes, viscosities, body weight and the like in 
accordance with the particular mode of administration selected and the patient's needs (e.g., 
Remington's Pharmaceutical Science (15th ed., 1980) and Hardman, et al. (eds. 1996) 
Goodman and Gilman: The Pharmacologial Basis of Therapeutics^ 

25 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into 
the blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

30 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman, The Pharmacologial Basis 
of Therapeutics, supra. 
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The compositions containing modulators of lung cancer proteins can be administered 
for therapeutic or prophylactic treatments. In therapeutic applications, compositions are 
administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficient to 
cure or at least partially arrest the disease and its complications. An amount adequate to 
5 accomplish this is defined as a "therapeutically effective dose." Amounts effective for this 
use will depend upon the severity of the disease and the general state of the patient's health. 
Single or multiple administrations of the compositions may be administered depending on the 
dosage and frequency as required and tolerated by the patient. In any event, the composition 
should provide a sufficient quantity of the agents of this invention to effectively treat the 

10 patient. An amount of modulator that is capable of preventing or slowing the development of 
cancer in a mammal is referred to as a "prophylactically effective dose." The particular dose 
required for a prophylactic treatment will depend upon the medical condition and history of 
the mammal, the particular cancer being prevented, as well as other factors such as age, 
weight, gender, administration route, efficiency, etc. Such prophylactic treatments may be 

15 used, e.g., in a mammal who has previously had cancer to prevent a recurrence of the cancer, 
or in a mammal who is suspected of having a significant likelihood of developing cancer r 
based, at least in part, upon gene expression profiles. Vaccine strategies may be used, in 
either a DNA vaccine form, or protein vaccine. 

It will be appreciated that the present lung cancer protein-modulating compounds can 

20 be administered alone or in combination with additional lung cancer modulating compounds 
or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in the tables, such as antisense or RNAi 
polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 

25 invention provides methods, reagents, vectors, and cells useful for expression of lung cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo 9 or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 

30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and other well known methods for introducing cloned genomic 
DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., 
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Berger and Kimmel, Guide to Molecular Cloning Techniques. Methods in Enzvmologv 

volume 152 (Berger), Ausubel, et al. (eds. 1999) Current Protocols (supplemented through 

1999), and Sambrook, et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 

1-3). 

5 In a preferred embodiment, lung cancer proteins and modulators are administered as 

therapeutic agents, and can be formulated as outlined above. Similarly, lung cancer genes 
(including both the full-length sequence, partial sequences, or regulatory sequences of the 
lung cancer coding regions) can be administered in a gene therapy application. These lung 
cancer genes can include antisense or inhibitory applications, e.g., as inhibitory RNA or gene 

10 therapy (e.g., for incorporation into the genome) or as antisense compositions. 

Lung cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al. (1995) J. Clin. Invest. 95:341), 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 

15 (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1994) Vaccine 
12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions contained in 
immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al. (1990) Nature 
344:873-875; Hu, et al. (1998) Clin Exp Immunol. 1 13:235-243), multiple antigen peptide 
systems (MAPs) (see, e.g., Tarn (1988) Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413; Tarn 

20 (1996) J. Immunol. Methods 196: 17-32), peptides formulated as multivalent peptides; 

peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al., p. 379 In: Kaufinann (ed. 1996) Concepts in vaccine development: 
Chakrabarti, et al. (1986) Nature 320:535; Hu, et al. (1986) Nature 320:537; Kieny, et al. 
(1986) AIDS Bio/Technology 4:790; Top, et al. (1971) J. Infect. Pis. 124:148; Chanda, et al. 

25 (1990) Virology 175:535), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) 
J. Immunol. Methods 192:25; Eldridge, et al. (1993) Sem. Hematol. 30:16; Falo, et al. (1995) 
Nature Med. 7:649), adjuvants (Warren, et al. (1986) Annn. Rev, Immunol. 4:369; Gupta, et 
al. (1993) Vaccine 11:293), liposomes (Reddy, et al. (1992) J. Immunol. 148:1585; Rock 
(1996) Immunol. Today 17:131), or, naked or particle absorbed cDNA (Ulmer, et al. (1993) 

30 Science 259:1745; Robinson, et al. (1993) Vaccine 1 1:957; Shiver, et al., p. 423 In: 

Kaufinann (ed. 1996) Concepts in vaccine development: Cease and Berzofeky (1994) Annu. 
Rev. Immunol. 12:923 and Eldridge, et al. (1993^1 Sem. Hematol. 30:16V Toxin-targeted 
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delivery technologies, also known as receptor mediated targeting, such as those of Avant 

Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 

designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 

5 mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 

as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 

MI); Merck Adjuvant 65 (Merck and Company, Inc., Railway, NJ); AS-2 (SmithKline 

Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

10 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 

polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 

Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 

used as adjuvants. 

15 Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 

encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et. al. (1990) Science 247:1465 as well as 
U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies include 

20 "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 
Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 

25 attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode lung cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useftd in immunization protocols are described in, e.g., U.S. 

30 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. (1991) Nature 351:456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
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like, will be apparent to those skilled in the art from the description herein (see, e.g., Shata, et 
al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) J. Leukoc. Biol. 68:793-806; 
Hipp, et al. (2000) hiVivo 14:571-85). 

Methods for the use of genes as DNA vaccines are well known, and include placing a 
5 lung cancer gene or portion of a lung cancer gene under the control of a regulatable promoter 
or a tissue-specific promoter for expression in a lung cancer patient. The lung cancer gene 
used for DNA vaccines can encode full-length lung cancer proteins, but more preferably 
encodes portions of the lung cancer proteins including peptides derived from the lung cancer 
protein. In one embodiment, a patient is immunized with a DNA vaccine comprising a 

10 plurality of nucleotide sequences derived from a lung cancer gene. For example, lung cancer- 
associated genes or sequence encoding subfragments of a lung cancer protein are introduced 
into expression vectors and tested for their immunogenicity in the context of Class I MHC 
and an ability to generate cytotoxic T cell responses. This procedure provides for production 
of cytotoxic T cell responses against cells which present antigen, including intracellular 

15 epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the lung cancer polypeptide encoded by the DNA vaccine. 
Additional or alternative adjuvants are available. 

20 In another preferred embodiment lung cancer genes find use in generating animal 

models of lung cancer. When the lung cancer gene identified is repressed or diminished in 
metastatic tissue, gene therapy technology, e.g., wherein antisense or inhibitory RNA directed 
to the lung cancer gene will also diminish or repress expression of the gene. Animal models 
of lung cancer find use in screening for modulators of a lung cancer-associated sequence or 

25 modulators of lung cancer. Similarly, transgenic animal technology including gene knockout 
technology, e.g., as a result of homologous recombination with an appropriate gene targeting 
vector, will result in the absence or increased expression of the lung cancer protein. When 
desired, tissue-specific expression or knockout of the lung cancer protein may be necessary. 
It is also possible that the lung cancer protein is overexpressed in lung cancer. As 

30 such, transgenic animals can be generated that overexpress the lung cancer protein. 

Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 
determined and compared for a determination of the expression level of the transgene. 
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Animals generated by such methods will find use as animal models of lung cancer and are 
additionally useful in screening for modulators to treat lung cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

5 For use in diagnostic, research, and therapeutic applications suggested above, kits are 

also provided by the invention. In diagnostic and research applications such kits may include 
at least one of the following: assay reagents, buffers, lung cancer-specific nucleic acids or 
antibodies, hybridization probes and/or primers, antisense polynucleotides, ribozymes, RNAi, 
dominant negative lung cancer polypeptides or polynucleotides, small molecule inhibitors of 

10 lung cancer-associated sequences, etc. A therapeutic product may include sterile saline or 
another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 

15 capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media, (e.g., CD ROM), and the like. Such 
media may include addresses to internet sites that provide such instructional materials. 
The present invention also provides for kits for screening for modulators of lung 

20 cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a lung 
cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for testing 
lung cancer-associated activity. Optionally, the kit contains biologically active lung cancer 
protein. A wide variety of kits and components can be prepared according to the present 

25 invention, depending upon the intended user of the kit and the particular needs of the user. 
Diagnosis would typically involve evaluation of a plurality of genes or products. The genes 
typically will be selected based on correlations with important parameters in disease which 
may be identified in historical or outcome data. 
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Example 1 : Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 
described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981- 
993). 
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Tables 1A and 1 B were previously filed on April 18, 2001 in USSN 60/284,770 (1 8501-001500US) and on November 29, 2001 In USSN 60/334,370 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



(18501-001520US) 










Table 1A 












PKey 


ExAccn 


UrugenelD 


Unigene Hub 


7 AO/ .dnniong/ kll 

/o% cnron/yu% nl 


yryar or»A HJOfW KM 


100134 


013264 


Hs.49 


macrophage scavenger receptor 1 


l.Ol 


n 7 a 


100780 


HG3731-HT4001 


""Immunoglobulin Heavy Chain, Vdjrc Reg 


9 CQ 


9 90 

QUO 


100971 


J02874 


Hs.83213 


fatty add binding protein 4) adipocyte 


1.30 


U.14 


101088 


L05568 


Hs.553 


solute carrier family 6 (neurotransrnitte 


n to 
U./9 




101102 


UD7594 


Hs.79059 


transforming growth factor; beta recepto 


9 cc 


4 

1 


101168 


L15388 


Hs.211569 


G protein-coupled receptor kinase 5 


U.00 


ft 97 


101277 


L38486 


Hs.118223 


mrcrofibnllar-associated protein 4 


U.09 


ft 9R 


101330 


L43821 


Hs.80261 


enhancer of fflamentation 1 (cas-ltke do 


n co 
u.by 


ft 9Q 


101336 


L49169 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


1 1 tr 
1.10 


ft A*. 
U.41 


101345 


L76380 


Hs.152175 


calcitonin receptor-Tike 


n 01 


ft 94 


101678 


M62505 


Hs.2161 


complement component 5 receptor 1 (C5a I 


1 94 

l.«Jl 


n 77 

0.// 


101764 


M80563 


Hs.81256 


oiuu caictum-Dinotng protein A4 factum 


1 AA 
1.44 


ft 09 


101771 


M81750 


Hs.153837 


myeloid cell nuclear dirrerentiafion ant 


0.96 


0.45 


101642 


M93221 


Hs.75182 


man nose receptor; C type 1 


4 T7 


0.37 


102283 


U31384 


Hs.83381 


guanine nucleotide binding protein 11 


1.04 


0.3 


102363 


U39447 


Hs.198241 


amine oxidase; copper containing 3 (vase 


0.96 


0.26 


109017 


U52154 


Hs. 193044 


potassium imvardry-recSfying channel; s 


2.81 


3.45 


102698 


U75272 


Hs.1867 


progastricsin (pepsinogen C) 


0.95 


0.23 


103025 


X54131 


Hs.123641 


protein tyrosine phosphatase; receptor t 


1.62 


0.21 


103280 


X79981 


HsJ6206 


cadherin 5; VE-cadherin (vascular epithe 


0.9 


0.41 


103496 


Y09267 


Hs. 132821 


flavin containing monooxygenase 2 


1.27 


0.49 . 


103541 


211697 , 


Hs.79197 . 


CO 83 antigen (activated B lymphocytes; i 


1.86 


1 


103554 


Z18951 


Hs.74034 


caveolin 1 ; caveolae protein; 22k0 


1.27 - 


A At 
U.47 


104212 


AB002298 


Hs.173035 


KIAA0300 protein 


1.17 


0,16 


104691 


AA011176 


Hs.37744 


bo IS 


1 no 
I.Uo 


ft 9C 
U.CO 


104825 


AA035613 


Hs.141883 


ESTs 


0.75 


0.27 


104857 


AA043219 


Ks.19058 


ESTs 


2.6 


3.3 


1 (Unfit; 


M045136 


Hs.22575 


ESTs 


1.23 


0.49 


104989 


AA102098 


Hs.1 18615 


ESTs 


0.63 


0.32 


10C79Q 


AA292694 


Hs.3807 


ESTs; Weakly similar to PHOSPHOLbMMAN PR 


0.86 


0.34 


10SJ47 


AA398606 


Hs.32241 


ESTs 


1.32 


0.4 




AA400979 


Hs.25691 


calcitonin receptor-like receptor activi 


0.78 


0.28 


1f1R4Qfl 


AA451861 


Hs.1 15537 


tSTs, Weakly similar to dipeptidase prec 


1.2 


0.47 


tofiC9fi 
tuDOOD 


AA453997 


Hs.23804 


ESTs 


0.82 


0.15 


IUO0U9 


AA457718 


Hs.21103 


Homo sapiens mKNA; cDNA DKFZp564B076 (fr 


0.99 


0.07 


106667 


AA461086 


Hs.16578 


ESTs 


1.17 


0.4 


106773 


AA478109 


Hs.1 88833 


ESTs 


1.46 


0.43 


1 0*770.7 
lUOf 9/ 


AM78962 


Hs.169943 


ESTs 


1.18 


0.32 


106844 


AA485055 


Hs.158213 


sperm associated antigen 6 


0.98 


0.51 . 


106870 


AA487576 


Hs.26530 


serum deprivation response (phosphatidyl 


1.05 


0.14 


105954 


AA496980 


Hs.204038 


ESTs 


1.25 


0.33 


107054 


AA600150 


Hs.14366 


ESTs 


1.11 


0.4 


WJttvDt, 


T30407 


Hs.4789 


ESTs; WeaWy similar to coadafive-stress 


1.07 


2.58 


IU/39# 


AA03681J 


Hs.165030 


ESTs 


0.7 


0,21 


107997 


AA037388 


Hs.82223 


Human DMA sequence from clone 141 H5 on c 


1.02 


0.48 


108041 


AA041552 


Hs.61957 


ESTs 


1.44 


0.51 


108087 


AA045709 


Hs.40545 


ESTs 


1.98 


1 




AA074685 


Hs.67726 


macrophage receptor with collagenous str 


1.52 


0.72 


iftQA9c 

luO*WO 


AA078787 


Hs.194101 


ESTs 


2.53 


4 CO 

1.53 




AA081093 


Hs.68055 


ESTs 


1.56 


0.48 


109252 


AA194830 


Hs.85944 


ESTs 


2.69 


3.18 


1U3D3U 


F01534 


Hs.26981 


ESTs 


1.19 


0.65 


mofii9 


F03031 


Hs.27519 


ESTs 


1.01 


0.29 




H00656 


Hs.29792 


ESTs 


0.81 


0.15 


109893 


K04768 


Hs.30484 


ESTs 


1.44 


0.32 


1000 RZ 

IU39D4 


K09594 


Hs.1 0299 


ESTs 


0.62 


0.14 


unrwo. 


H16568 


Hs.23748 


ESTs 


1.01 


0.28 


11/VW7 
1 lUOJ/ 


N30796 


Hs.17424 


ESTs; Weakly simiiar to semapnonn F |H 


1.1 


0.22 




M69825 


Hs.16762 


Homo sapiens mKNA; cDNA DKFZp564B2062 (f 


1.26 


0.26 


11 11A1 


N80935 


Hs.22483 


ESTs 


1.57 


0.52 


11 ic.m 

11 IDiU 


R07856 


Hs.16355 


ESTs 


3.95 


1 


1117T7 
lll/«3f 


R25410 


Hs.9218 


ESTs 


0.97 


0.24 


■M9105 
1 10199 


T57112 




"■yc20g1 U1 Stratagene lung (#937210) 


1.22 


0.35 


11?9Vt 


T62979 


Hs.189813 


ESTs 


. 2.27 


0.45 


1 IOO*tU 


T90496 


Hs.16757 


ESTs 


1.06 


0.22 


113552 


T908B9 


Hs.16026 


ESTs 


1.16 


0.42 


113606 


T93093 


Hs.17125 


ESTs 


1.48 


0.7 


113695 


T96965 


Hs.17948 


CCTp 

cols 


1.54 


ft TO 

U.ZO 


113946 


W84753 


Hs.37896 


ESTs 


1.79 


0.72 


114251 


Z39898 


Hs.21948 


ESTs 


1.95 


0.25 


114359 


Z41589 


Hs.153483 


ESTs; Moderately simnar to H1 chloride 


1.42 


0.13 


115230 


AA278300 


Hs.182980 


ESTs 


2.62 


0.42 


115279 


AA279760 


Hs.63671 


ESTs 


1.79 


0.91 


115566 


AA398083 


Hs.43977 


ESTs 


0.86 


0.2 


115965 


AA446661 


Hs.173233 


ESTs 


0.79 


0.04 


116166 


AA461556 


Hs.202949 


KIAA1 102 protein 


2.29 


0.68 


116279 


M486073 


Hs.57362 


ESTs 


Z27 


0.78 • 


117023 


H88157 


Hs.41105 


ESTs 


1.36 


0.16 



83 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



11 Tina 
11/4U9 


U0OORQ 
1033339 


rts.44/oo 


PQTe 
CO IS 


1 Afi 

1.40 


OAR 


11 ROM 

1 I03UI 


(VSUf 19 


Hs.94445 


ESTs 


1.51 


4 


1 10301 


* W00Q0Q 
NS0003 


HS.034O0 


COTe 
CO IS 


1 3A 

1.04 


U.HO 


110/171 
1 IbU/ 0 


K04034 


115.433 1 4 


v-6fe svisn Giythroblsstosls virus E26 o 


1 14 


097 

v.4/ 


113441 


K901U3 




***iirOrtn4 4 e1 Cearoc falfif fn/ar enfoan 

yroug 1 1 .s i ooa/cs reia uver sptcen 


| a 04 


0.53 


11QR9A 
1 13044 


lA/7/COR 
W/4300 


U e 4QA 


sdvanced giycosytsbon end product-sped 


■J 


0.19 


11QRR1 
i 19001 


TVOUf 13 




PSTe- Morfaraffth/ fiknibv to tni ALU SUB 


1.83 


0.45 


120041 


W92775 


Ur COORD 
(15*39090 


ESTs 




0.55 


190,109 

1 4U1 04 


700O0Q 
400003 


US. 143w 1 3 


co i s, niyiuy sui war to im/vwooo ptuieui 


U.3 1 


0 37 

U.O/ 


120457 


AA9R1R7Q 

/v\43l3/ 3 


Lk 1R7R9A 
rl5. 10/ 04O 


ESTs 


1.87 


1.91 


191HA 
14 101** 


AA4YI97QQ 
MM4U4/39 


ns. 104000 


cois 


13 


0.31 


191RA1 
141040 


AAA17fY7R 
/W4 1 /U/O 


Ur 4Q07R7 


CO IS 


2.31 


0.68 


14109U 


AAA1fln7A 
AA410U/4 


MS. 1 1U400 


PQTe 
COlS 


1 A7 

1.4/ 


fiR1 
U.3 1 


19OC0O 
144000 


A/V431U0U 


Ue OjtQRO 

nS-J4O30 


innioiior oi una uinuing 4, ouiTunani neg 


1 0.1 
1. 01 


U.00 


100O7Q 
1403/0 


r»onRRi 

U4UO30 


Ue 17/19711 

nS.1 /U4/0 


PQTe 
to 1 S 


1 R.9 
1.34 


n ^9 

U.04 


144414 


nooouo 


Ue 1C10OO 

hS.1 31 04O 


CCTe • 

CO IS 


n oo 

U.30 


n or 

U.03 


19A0R7 
14403/ 


N444U1 




ywo/gu/^i naQuDii rwm uocnisa noinu 


1 90 
1.49 


•j 


144400 


N4U100 


Ue inORRfl 
n5. IU433U 


PQTe 
CO IS 


1.36 


0.7 


19R1R7 
14310/ 


VV4330U 


Uc 1 f!9<U1 
US. 1U4041 


PQTe 
CO IS 


1 AR 
1.40 


DRQ 

U.09 


19R17A 
1431/4 


wriror 

W31003 


Ue 931 n&9 
riS.40 IU04 


PQT 
COI 


o<n7 


3.76 


19RA09 


aaojvwoq 

AA3U0443 


Ue 1R0717 

ns.130/1/ 


PQTe 
CO IS 


1 *U 


0.3 


19RRA1 
143001 


AIA17RR7 
AJ41/DD/ 


Ue 0OO70 


PQTe 
CO IS 


1 RQ 
1X9 


n r_o 

U.00 


19RA04 
I 43601 


UOU300 




aM ktl IM1 ZRRftqp. nnnffvh hitman fohri hrarn 
MUMi 430U9O Vfionisun mflnan leiaj urain 


n qa 

U.94 


0.35 


1970/19 

14/ JU4 


RORORn 
rvoooou 


Lie 04070 
IT5.443/9 


ESTs 


3.02 


4.06 


197007 
14/ ou/ 


AA1RQ1R7 
SVW0330/ 


U» 19R749 
ITS. 140/ 14 


PQTe WeaWu etmilar In nil 9 hurmfhpHra 


1.01 


0.69 


127609 


AAfiOOCCQ 
MM044333 


ns. iouoio 


ESTs 


1.21 


0.32 


14/939 


AI0T19A71 
Ml 0U44/ 1 


u« 1 049Q9 
no. 144494 


PQTe 
CO IS 


2.5 


1 


140430 


ncoioo 

U04130 


Ue RfiOin 
nS.0a04U 


PQTe 
CO IS 


1 10. 

1. 10 


U.00 


140044 


AA4700AQ 


Ue m9RZ7 
nS.IU404/ 


PQTe 
CO 1 5 


1.45 


0.58 


140/09 


AAAfifiRfi7 
MA4O030/ 


Uc m^fiQR 

nS. 1 U3033 


P9Te 

CO IS 


1.1 


0.34 


140 /bo 


ArUl4300 


Ue 1 ARQOO 


cny moKiriB \yr\j mourj reccpior hikb 4 


1 1R 
1. 10 


n rr 

U.33 


140304 


DR1/V7R 
KOlU/o 


Ue 1/Y70R1 
nS.lU/001 


C0Te« Ut/iSh/ eiVnlfar it* Dan 9 lniararf!nrt 

CO I S, Mty lay Similar 10 KSpc inic/acuiiy 


OfU 
4.v4 


9A 
4.4 


19QflR7 

149U3/ 


A0440Q 


He 9147A9 
ns^i (4/44 


rJlWR9 anflnan /CAMPATU-1 on1 : n«nl 
WJW34 anuyoil \UrMVJrn 1 rr 1 allbywij 


1.77 


0.73 


ioooin 
14341U 


AA4U1D04 


Ue onoo/o 
nS.4U4343 


N/v\i 1U4 protein 


1 11 

l.ii 


n or 

U.00 


loooAn 

14944U 


W440OU 


Ue 907flRB 
n5-40/o00 


interieukin 7 receptor 


noi 

U.ut 


n A1 

U.4I 


1494U4 


TR07R1 
100/01 




" n ur9inm e1 Qtrntan^np It inn OtOffliCW 
yc4 igu 1.51 ouaiag«no lung \ir90f tiuj 


1.36 


0.43 


190RRR 

143000 


X77777 


Ue 1QR70R 
nS.190f 40 


VooOaCuVc iniBSulkil pupiiUo leCcpUN 1 


0R7 

u.o/ 


u.uo 


143393 


AA40/U10 


Ue OQ91i4 
I1S.30O14 


Unmn cardane mPMA> eDMA rWP7nRfifil M9/1 /f 

nomo sapiens rnnivs curtM ur^Lpooouj i4u ^i 


1 ? 

- l.O 


U.44 


19QR9R 
143040 


/W»4f4IU 


Ue 11719 

ns. 1 1 / 14 


PQTe* VfaaMv cjmllar tn ltd Al 11 Ql IRFAW1 
CO IS, Wooniy Sllllllar \U ll.l rU-U OUurrUnl 


1.28 


0.46 


190RQQ 


AAARftR7fi 
AA4303/0 


Ue 19(117 

ns.i4ui / 


klA AflAOO nmtain* hnmnlrm rtf irbad hhlmt 

IWVW409 proioui, nomoiog 01 yeasi uoiqu 


1 Rfl 

1.30 


1 

1 




N4B595 


He IWIR 
n». 1 3430 


ESTs 


1.13 


0.53 


143900 


1 90.RQ1 
L4U33 f . 


He 1070 

ns.io/o 


annoxin mo 


U.O 1 


n ?1 

U.O 1 


19/W70 
lOUA/O 


1 1ROQ4 / 
U39314 


Ue 4C0DCO 

ns.looobo 


MAD {mothers agslnsl decapentdplegk^ Of 


n rq ' 


n 99 

U.44 


130655 


N92904 


Hs.17409 


cysteinB-rich protein 1 (intestinal} 


1 AA 
1.44 


n 7R 
u./o 


l 30657 


1 34434 


U« OA1CQ4 


COTe 

CoIS 


n qr 

U.3D 


n A9 

U.44 


lOlUOl 


N&4040 


Ue 90CC7 
nS.4430/ 


CCTe. kb*Anr**oki rimilw In UVDATUCTIPAI 

to is, MOQeraiery amuarto MYruintiiuAL 


1 R1 

1.01 


n ar 

U.43 


101DRR 


ruyuuo 


Uc 99RBR 
nSJC4300 


PQTe 
COIS 


nQ7 

U.3 1 


n 17 

u.o/ 


1019CO 

10 1400 


DO Q 004 
K0OOO4 


Ue o/ocn 

nS.4493V 


regulator of G-protein signaHmg 5 


90A 
4.04 


9 R9 
4.04 


101 COO 
101309 


U341UU 


Ue 00401 
nS^49l3l 


epilheTtal membrane protein 2 


1 9 

1.4 


n R9 

U.04 


101CQC 
lOlOOO 


A A1C7A0D 
AAlD/440 


u. 9nCD7 


Lao4-associateo oinoer 4 


AQR 
U.90 


n oo 

U.00 


1117*51 
I01 /3 1 


uiooor 
n toooD 


He 01 RR9 
nS.O 1 304 


PQTe 
COIS 


1.47 


0.52 


104401/ 


I40D0U 


Ue 9RAR7R 


PQT 
CO I 


1 RR 


2.09 


1 09/7R 
1344/0 


no/194 


Ue AQA7R 
nS.434/0 


Ufvmrt e^ntedte elArwi T1 1 A Q Pri Ail netfil 

nomo sapiens cione t uno L^n-ou-cnei rey i 


1 71 
l./O 


n rr 

U.30 


132636 


F09557 


He R7Q0Q 
1)5^/ 349 


eBt mmennhlloA hnmnlnn ^ 


0.91 


0^9 


10, von 

10J14U 


A043D3 


Uc RRAOA 
MS. 03444 


tEtranecBn [ptssnnnogsn-tinding protein 


n R9 

U.04 


n 9 

U.4 


100400 


L/430/U 


Ue 7A19n 
ns./4 14U 


adipose specific 2 


1 90 
1.49 


n ar 


Ioorrr 

100300 


noiuoo 


ns^irtooi 


PQTe 

CO lb 


2,25 


0.57 


101RR1 

100031 


1 !Q7in«i 
U9/1U3 


Uc 17O0R1 

ns. i / ooo i 




1.65 


0.62 


101R0R 

100003 


AAnCQ/QQ 
MAU39409 


Ue 7fifiin 


PQTe* UJnSKf elmllar In RfiO 09 1R nrnvon 

co 1 5, mgniy sutu wr id r\ou-Oi [ruiwrveg 


1.16 


0.34 


19.007ft 
1003/0 


Yit 0033 


He 7flJTR1 

ns./ ocio i 


transcnpSon factor 21 


0.79 


0.27 


10O0QC 
I009O3 


L0400/ 


Ue 7R1>4C 


platfitefondotheiis! ceil adhesion moJec 


noo 

U.33 


n 9fl 

U.40 


104233 


AA40/030 


Ue 04 OC 

MS. 0103 


CCTe 
CO IS 


1 M 
1.U4 


n ar 

U.40 


1040UU 


1 101QQA 
U0l9O4 


Ue ICRHQO 

M5.1D0UO4 


enooine&ai kao oomam proiein x 


n rr 

U.00 


n A9 

U.44 


104 TOO 
I 04040 


AAn0ftQ7ft 


Uc A17R 
MS.0 1 id 


Urvmn can] one mDKIA' eTlMA nKP7nRRAMn7R0 ff 

noiuo sapiens rrmrMM, curtn ui\r z-povwivi do {i 


1 1Q 
1. 19 


n 97 


10/tOJO 

104040 


UOUOOO 


Ue ROPOfl 

MS.04U40 


uHrtsionTtirig growui lactor; oeta recepiD 


1 91 


n R7 

U.O/ 


10441/ 


U0/9O9 


Ue R9001 
nS.04341 


soi u is earner Tanary oo \umr-siaiic aa 


1 9R 

l .40 


4 

1 


4 0.4CR4 
104301 


1 17R/I01 


Ue ncono 

MS.030U4 


eaenosine oeanunsse, Kw^specinc, di (n 


9 19 
4.14 


n rr 

U.33 


104044 


W0/14f 


Ue 07nn 
MS.o/UU 


deleted in fiver cancer 1 


4.03 


9 7A 
4./4 


10ARQR 
104090 


UOO0K4 
Mo 0034 


Ue ftftftl 

ns.oooi 


PCTe 

cois 


1 oj; 

1.33 


noo 

U.00 


104/49 


L1U333 


M&.03400 


fir K/^ fite dfihurfraea /W 

GaDumc annyorasB iv 


0.89 


0.2 


11A7RR 


1 (1R1 0.Q 
LUO 1 03 


He pqfiAfj 
ns.0904'0 


TCV lumelna IrhtaeA* anftnlheltal /upnniie 

i crv tyiuauie luiuuo, ciiuuu itaiai yi cinua 


0.48 


0.21 


104003 


I 03400 


He QflA91 
nS.9U44 1 


PQTe* Mnrfprafplv eJmtlar fn IU1 Al U Rl IR 


ZH 


2.64 


10R14R 
103040 


M21056 


Hs.992 


phospholipase A2; group IB (pancreas) 


0.63 


0.13 


innm 


D00591 


Hs.84746 


wuoniosurriy cunuensouuii i 


•j 


2.15 


1/M1A7 

iuyi«*f 


D13666 


HS.13834B 


Umia eantane mRMA fnr neionhlael cnprifi 

nonuj sapiens iutvmm hji uoieuukioi opcun 


0.5 


2 


1UU40U 


D42085 


Hs.155314 


K1AA0095 gene product 


1 n9 

1.U4 


1 oo 

1.09 


1fW30R 
I WOOD 


063391 


Hs.6793 


piaieiei~acuvaiing lacvoi ouoiyuiyuioia 


•J 


5.58 


100360 


D78335 


Ks.75939 


Uridine monophosphate kinase 


0.91 


104 


100372 


D79997 


Hs.184339 


KJAA0175 gene product 


0.75 


Z03 


100466 


H61112-HT1112 


TIGR: ras^ke protein TC4* 
'collagen, type VII, dpha 1" 


1.09 


1.93 


100559 


HG21S7-HT2267 


0.97 


3.6 


100576 


HG2290-KT2386 


'calcitonin/alpha-CGRP, ait transcript 


1 


1 


100668 


HG2981-HT3938 


TIGR: CD44(epfcan.alt transcript 12 


0.85 


1.9 


100906 


HG4716-HT5158 


Guanosine S'-Monophosphate Synthase 


1.18 


Z29 


100930 


HG72LHT4827 




"TIGR: placental protein 14\ endometrial 


1 


1.45 



84 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ioooro 


J00124 


Hs 117729 

rw» i if i bv 


keratin 14 fenidermolvsis bullosa slmute 


0.84 


2.6 


101071 




Hs 151738 


"Malm mfiiannnmtftlnjisft 0 fnfliafinasa 


0.77 


1.52 


101111 
1U1 111 


LQB424 


Hs.1619 


Arhaptft-erartfi mmnlpt fnmeftrflfla^ homo! 


1 


1 


101194 


L10343 


Hs 112341 


'Pm^P.s^p fnhfhifnr 7 eknwtortveff ^SKA 


0.62 


2.67 


101175 


L18920 


Hs.36980 


"Mplanoma anfiaen familv A_ 2" 


1 


1 


101204 


L24203 


Hs 82237 


AlairfaJHbmnifvtssta nroi in D-associaied 


0.74 


4.1 


101431 


M19£ffi8 


Hs.1076 


Rm^ll nmitnft-rirh nrolfiffi 1B fcofniftnl 


0.85 


251 


101448 


M21389 


Hs.1 95850 


keratin 5 (epidermolysis bulissa slmpiex 


. 0.61 


8.83 


101511 


M27826 


Hs .267319 


EndOQenous retroviral protease 


1.03 


1.13 


101526 


M29540 


Hs 220529 


Pflfrinownhivflntf* antiofirwRlated cell ad 


1.07 


4.61 


101548 


M31328 


Hs.71642 


"Guanine nucleotide binding protein (G p 


0.97 


1.13 


101625 


M57293 




"Human parathyroid bormone-relalBd peptl * 


1 


1 


101649 


M6D047 


Hs.1690 


Heparin-binding growth factor bindinQ pr 


1 


Z7 


101724 


M59225 


Hs.620 


bullous pemphigoid antigen 1 (230/240kO) 


1 


8.98 


101748 


M76482 


Hs 1925 

no* 1 3£w 


Dfi<;mnnlpin 7 fnerrmhiaue vuloaris anfiopn 


1 


278 


IU1/09 


MR0944 


Ue 1R4501 
no. t o*tuu i 


■^ftltrtft nflmpr fnmSlv 7 fcaRnnlr amino 


1.07 


245 


101804 


M86699 


Hs 169840 


TTK nmtpin kinase 


1 


1 


101ROR 
I VJOUO 


MRR757 

MOO/ 3/ 


Hs,1 12408 


S1Q0 caldunvbindlna orotein A7 fosorlas 


0.74 


1.76 


101ROO 


MRRR4Q 




"Hnmn ^anipnc rnnnpyin 9R /GJ9?\ mRNA c 


1 


7 


101R45 


MQ3495 


He 7RR57 


"Pmtp in iumeJno nhnenhataep nprpntor- 


1 


1 


lUlOOl 


UQjI950 


He R9045 


KJTIrfVlro /nuitnto rrrou/JhunrnmAtinn fiWnr 
MUUUilc yllcUillc yiUWUr^JJUlIluUliy laWUJ 


1.13 


26 


1fi90R3 
lUZvOO 


1 110197 
U 1 U040 


Ue 75117 


"Intpripiikin pnhannpr hlnHmn fartnr 9 


103 


1.51 


IU4 (On 


1117750 
Uif row 


He 75517 


"f flminin hpJa 1 Inlcenn M 95kDl )ca!!ni 


0.94 


3.62 


109107 
IUZ ISO 


1 1907 55 
waif DO 


Hs.313 


cprjntpd nhncnhn nmtpin 1 fnetnftnonRn* 


0^34 


4.59 


409"*05 


U0O40D 


He Q0077 


ohmmnenmo cpnronatinn 1 h/pnet hnmnlnn^ 
CniDTiKfisOITra Sc^lcQaUUn 1 yFtSsPl (lUUKJtUyy 


1.45 


297 


102348 


U37519' 


He R757Q 


AlHphvHo Hphvrlrnnpnasp R 


0.52 


225 


109501 


UOl 149 


He 7795R 

ns./ ( 400 


Cnhanrfsr nf tocIo /rVr>ennhtla\ hnmn!(Vi 9 

cnndnuDi w 4csio ^uiuwpnudj uuiiuiuy c 


0.91 


246 


102610 


U65011 


Hs 30743 


Rreferentialiy expressed antigen in meia 


1 


3.88 


102623 


U66083 


Hs 37110 


"Melanoma antigen, family A, 9 (MAGE-9)" 


1 


1 


102669 


U71207 




Pvpe nKeftnt fDrnqonhila^ hnrrcloa 2 


1 


1 


109RQR 


1 17451 9 


Hs.239 


Forkhead box M1 


1.06 


277 


409R9Q 
1UZ043 


II0151R 


He R0QR9 


Mai rmloricifi 


\ 


1 


102888 


X04741 


He. 7R11R 


UbiquiGn carboxyt-termina! esterase L1 


1.13 


259 


10901^ 
iuzy io 


Y07RQ5 
AIJS Dav 


He R0749 


U Pratt n 15 


0.7 


4.72 


10,9015 


Yn7fl90 


He 995R 
nS.4400 


Mafrre Mnfalirmm^nflce 10 ^trnmnlv^n 

WiairiA flftcldlfupiULclltdjU IU (OUUIIIUIjrottl 


1.15 


3.35 


109QR7 


¥1504? 


He 7705R 


"PalrKrmin/ralHfnnin-nMatpH nntvnprrfi 
"Intfiarln hpta4" 


\ 


1 


103021 


X53587 


He R5955 


1.38 


234 


IUOUjD 


¥54095 
A0n?40 


He R71RQ 


Matny mpfallnnmfpflen 1 fintpretHial c 
OUduiUI . 


1 


14.93 


inin^B 

IUOUOO 


¥57Q4fi 
AO'0*h) 


He 1RA510 


1.25 


4.17 


103060 


¥57755 


Hs.1 55394 


matrix metafioprotebiase 1 1 {stromelystn 
"Cadherin3 P-cariherin fnlacentafi" 


1 


1.72 


10111 B 




Hs.2877 


1.16 


7.38 


103206 


X72755 


Hs.77^7 


mnnnkinp InHiinfiH hv oamrm infftrferon 


0.71 


1.48 


103242 


X76342 


Hs,389 


'AbWinl dphvrirftopnasfi 7 f class l\A mil 


1 


1 


10^319 




He 71 Rfi 
na.o i oo 


■| umnhiwfp aniinftn fi mrnntpx Ificus D* 


0.92 


1.28 


103478 


Y07755 


Hs 38991 ' 


S100 cakaom-blndlna oroteln A2 


l!05 


5.81 


10355R 


71Q574 
4.190/4 


Hs.2785 


koratin 17 

PlOI dUI Mr 


0.65 


6.68 
1.73 


10357R 


79R717 


He9fil1 


npCTTWtlptn 9 
UBallFUyiCiM £ 


0.79 


ioi«ai7 

IUJ007 


79Q0P7 


■ He R91 9R 
rid. 04 140 


5T4 Dnrvrffital ontinon 
0 1 ^ vllwHclal aliuycll 


{ 


3.93 


1U03?H 


771550 


HeR1R 


•Qpv /cav rlotatmfninn ran inn W-hoY 9 n 


0.71 


7.23 


tUOYOO 


AA0R0QQ7 




"P<TTe Hinhh/ cimiJflf fn Intenrfl) membra 


0.99 


1.8 


1041 5R 


AA454Q0R 


Hs.8127 


IOAA0144 npnp nrndtirf 


o!96 


1.29 


lWooo 


DC5570 
FOOD So 




niimoJi unf\ sequence? hodi aonc ?or ru. i ui 


1.23 


7.23 


IfMRRQ 
lUWo? 


A AM 0555 




CO 18 


0.96 


211 


1047?^ 


AAMQ4QR 


He 97071 


ESTs 


l!l8 


1.88 


lUtaUO 


AA055ROO 

/vwooovy 


He 9RR09 


Pmfpin IrinacA nVtmatns nnntainfnfl [tfnfnf 


1.11 


3.15 


1U43/0 


AA0RRA5R 


He 10799 
fid. I9J44 


F^Tc' Wnaklv elmllar Id (HI Al IJ SUBFAM1 


1.64 


289 


105019 


AA1 15075 


He Q79Q • 


"Hnmn earrfpnc mRKIA fnr f1e751 mmnlplfi 


1.19 


3,91 


105175 


AA1R5R0A 
MnlODDUf 


He 95740 
ns.40/ w 


CCTp- Wnaktv c4milar fn tmknnwn rSjCbrav 
to io, iiwnij annual vj mnuHMVii 


0.9 


4.63 


1059RT 


AA227926 


He Rfift9 


ESTs 


0.95 


287 




&A977/IR0 

AA40M00 


Ue 95150 


C0 15 


\ 


1.13 


105719 
I UOO I Z 


AA973R54 


He 977AR 


Q-nhaeA Mnaefvaee/y<ia|Ad nmtpJn 2 1 d45 


1.32 


ioi 


105710 

I US' 10 


AA9Q1R44 


Hs.36793 


Hvnnthflltral nmfptn FL1931RR 


1.28 


231 


10574* 
lUO/40 


AA9Q7700 


He Q5QR 


CO 1 5 


1 


1 


105019 

7UDU 14 


AAA1 1 591 


He RRQ5 
ns.oo 3D 


FQTe' camp ae RFHR9 
CO 1 Si omi U3 da Dr no r 


0.94 


204 


mR971 


AA490571 


He 7R009 


10AA1\55 nmtfitn 


1.04 


1.5 


ioR54n 


AA454R07 


He 7R11Z 
ns.oo i m 


Hvnnthfllir al nrntflln FL1 1 1 100 


1.26 


226 


105575 
lUOOr O 


AA45R03Q 


He 105491 


COla 


y 


2 


1U00J4 


AAJCQOQ7 


He 11050 
ns. 1 law 


fZPLonr-hrimrl motoetae?e_aeenniatpjd nmfp 


0.87 


1.32 


105797 


AA455349 


He 74045 


Hvnnthpliral nrftteln F LI 90754 


0.87 


1.59 


1U03UO 


AAAQ0977 


He 999094 


Tranerrinfinn ferlnr RMAI 9 /rvHfi-HkR f 


0.61 


1.6 


107059 


AAR0R545 


Hq 91044 


RAD51 np.rpvf<;iae^ homntao fE co!i Re 


0.48 


267 


1071 04 


AA5nQ7R5 


He 1 5941 

ns. i o4to 


Mttrlonlar nmtoin 1 M9tTkm 


1.01 


1.44 


107151 
Ivf 10 1 


AA59115Q 
/vV04 1 ioy 


Hs.8667 


F^Te* nmrvtHanAn l-fvl nrntalnasR 


0.97 


289 






He 9Q1004 
nS-43 (HUH 


A/v^eenrw nmtpine RAP^1/RAP90 
fVOOSSSOff pUMCl'la Oflr J Itarv £.9 


1.15 


3.65 
3^44 


107001 


AA09R41R 


Ue 01570 

ni.5 1 iKJ3 


ESTs 


072 


107Q99 


AA09R09R 


He R14RD 
ris.o * *+o u 


In cunarfamilv ramntof I MIR Df&cursor 




248 


107932 


AA090717 


Hs 1B87R 
no* i uo t o 


HynnlheffTfll nrote'm FL121620 


1 


1 


108695 


AA1 21315 


Hs.70823 


K1AA1077 protein 


0.91 


3.53 


108857 


AA133250 


Hs.62160 


ESTs 


1 


1 


108860 


AA133334 . 


Hs.129911 


ESTs 


0.73 


7.3 


108990 


AA152296 


Hs.72045 


ESTs 


1 


1 


109166 


AA1 79845 


Hs.73625 


"RAB6 interacting, kinesin-like (rabkine 


1 


4.55 


109424 


AA227919 


Hs.85962 


Hyaluronan synthase 3 


1 


1.28 


109665 


F05012 


H^27Q27 


Hypothetical protein DKF2p762H1311 ' 


1.42 


2 


109970 


H09281 


Hs.13234 


ESTs 


1.13 


216 



85 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



110015 


H10QQA 


H&7164 


A ffeinfporiri and mptalln nrri At n p*?p rioms 


0.84 


1.95 


110156 


H1R067 
n 1 030/ 


Hs.4213 


ESTs 


0.94 


1.41 


110561 


H59617 


Hs.5199 


HSPC1 50 pro lain slmn^ to ubkjuifin-con 


0.91 


3.18 


111223 


N68921 




PSTs* Woaktv similar to nBOapntn fH^aol 


0.91 


3,13 


111345 


N89820 


Hs.14559 


Hvnoiheikai orotein FL1 1 0540 
'ESTs Weaktv similar to outalrvBDl 50 f 


1 


1.25 


111876 


R38239 


Hs^93246 


0.83 


1.27 


111902 


R39191 


Hs.109445 


K1AA1 020 crotein 


0.91 


0.91 


112244 




Hs70823 


K1AA1077 protein 


0.77 


3.01 


112973 


T17271 




'cDMA FU1330Bfis. done 0VARC1 001436 


1 


1 


112989 


T23482 


Hs.69981 




0.55 


1.03 


113047 


T25867 


Hs.7549 


ESTs 


0.87 


2 


113095 


T40920 


Hs 126733 


ESTs 


1 


1 


113531 


T90345 


Hs. 16740 


Hypothetical protein F U 1 1 036 


0.42 


1.44 


113970 


W86748 


Hs.8109 


ESTs 


1.17 


1.73 


114346 


241450 


Hs. 130469 


"ATPase, amlTOphosphofipid trans porter-) 


0.86 


0.82 


114407 


AA010188 


Hs.103305 


ESTs 


0.8 


1.88 


114471 


AA02B074 


Hs. 104613 


RP42 homobg 


1.06 


1.34 


114509 


AA043661 


Hs 101799 


K1AA1350 protein 


1.82 


2.32 


115060 


AA253214 


Hs. 198249 


"Gap junction protein, beta 5 (connexln 


0,79 


1.49 


115091 


AA266Q0n 


Hs 184523 


K1AA0965 DTOtein 


0.72 


1.92 


115123 


AA256642 


Hs 236894 


'ESTs HiahsimtoLRPI hu tow density 1 


0.59 


1.97 


115291 


AA279943 


Hs 122579 


ESTs 


1 


1.25 


11^06 


AAOQOK07 


H«t 4*5907 


HvDOthetica) orotein K1AA1335 


1.15 


1.48 


116699 


AA3313Q3 

AnJO IOOO 


Hs.47378 


ESTs 


0.5 


3.29 


115536 


AA347193 


Hs 62180 


ESTs 


1 


1 


115697 


AM1 1502 


Hs 63325 




1 


6.53 


115909 


AA436666 


Hs 59761 

1 iO.«JO( vJ 1 


ESTs 


1 


6.98 


1 15978 


Ai 447699 


Hs 69517 




1 


2.31 


116028 


AA452112 


Hs.42644 


thlftrAfinvinJtkfi 


0.99 


1.68 


116107 


AA456QS8 


Hs.92030 


ESTs 


1.14 


1.8 


116134 


AA46094R 


Hs.50441 


CGL04 nmfeln 


1.11 


1.86 


116157 


AA4R10R3 


Hs.44298 


Hvnaihetical nrolpin 


0.99 


1.9 


11R1 6A 
1 ID 190 


AA4611R7 

/WtOI 10/ 


|4c R1762 


Hvnmda-inriiirlWp nmlArn 2 


0.44 


0.86 


1 IOOOO 


AA495830 


Hs 87013 


■Homo eflniens cDNA FU10238fis clone. H 


0.62 


3,89 


11D400 


P1/OQ9 


He 7fi11fl 
no.r01 10 


1 IWniiHIn narhnwLiormlnal oetarasA 1 1 
uuAjunin Coi iHMjruflii unci doua 000 l i 


1.04 


2.36 


117320 




Hs 21 1092 


LUNX Dfotebr PLUMC/nalatB luna & nasal 




0,64 


117557 


M33920 


Hs.44532 


Dinbimiiiin 


1.11 


2.63 


117fioq 

11/090 


1440939 


He 119110 


PTD007 orotetn 


0.98 


1.79 


117RA1 

IllOOl 


IwUUf O 




RiitvralfvinHitr.f^l trsmserint 1 


1 


1.43 




N64339 


Hs 46956 


ESTs 


0.67 


2.86 


118566 


N68558 


Hs 42824 


Hypothetical protein PU1 071 6 


1.21 


0.83 


1 10090 


1^71781 


He snnfii 


KIAA1 199 see CVA7 doe 

iwvii too we vvni .UVw 


0.8B 


1.63 




W72967 


Hs 191381 

MS. 1 O 1 ww 1 


pcTc* Wfiakh/ shnilpr to hvwrihefica) nro 


1 


1 


1 1 30*10 


W7QQ90 

VV/OO&U 


nb. OOOO 1 


A nmf ArrwYW ir^vf nwMnfnr 67 
\j ytwait i^MuytfM lobopiui 01 


1 


1 


IZU IUt 


WQ649R 


He 139997 
no. ivwti 


"FSTe Modaratalv ermflar to nS3 nRfliifat 




1 




VVJjO**/ / 


He 1RA479 


ESTs 


0.69 


3.07 


120486 


AA253400 


Hs 137S69 

no. 1^1 wv* 


Tumor protein 63 KD3 with sIioiiq honroloQ 


toe 


12.05 


100050 


AA3501 58 


Hs.1619 


Acti33td"SCUts complox ^Qrosophil^ ItonwJ 


1 


1 


l£UOOU 


AA360240 


Hs 97019 


EST 


1 


1 


190Q4R 


AA397B22 


He. 104650 
no* unuju 


HvnnihRficat nrnWn PI J 10999 


1.04 


215 


HOoOO 




He Q76R7 
no. 3/ oo / 


EST 


1 


1 


121362 


AA405500 


Hs.97932 


ChondromoduBn 1 nrpfairsor 


1 


1 


121369 


AA405657 


Hs.128791 


HC3-09 nrotein 


1 


1.8 


191701 


A A/J00Q7R 


He 293317 


■ccTe WmWv similar to JM27 fH sarnans 


1 


1 


193006 

ItOUUO 


AAA7Q796 


He 105577 
no. ivoo/ / 


ESTs 


1 




l£OU*H 


A AAR164Q 


He 13DAA1 
ns. 10000 1 




0.95 


1.68 


4001 en 
led I Ov 


AttHOOOO/ 


He 284235 


ESTs 


1.59 


4.98 


100/70 


a Acoqycq 


He 135056 

no ■ iojuju 


eirvtfi RPS-850F9 on cflmmasoma 20 

uiUJiv iw t/vwwt-v yu wiiiwiiiujyiiro 


1.19 


1.64 


I ZOO/ I 


AHQU093Q 


He 112619 
ns. 1 1 to 1 a 


■ESTs, Weakly similar to PQ0109 Purldnje 


1.03 


1.14 


1 £00Z3 


A A R9 OR 07 


He 119909 
no. 1 1 uuo 




1.39 


22 


ioaoor 

I14UU0 


r»R0309 


He MaQ77 
no. luoor / 


ESTs 


1 


4.85 


17ATWQ 


r 100/0 


He QQ76Q 


ESTs 


1.49 


8.62 


194QR0 
ItHvOv 


T15386 


He 194766 
no. 1 3*+/ 00 


CpVyifm related oane 6 /moiies\-Hkfi 

vOttlilv ICTBIwy Mdlv V ^iiiuuooy^mw 


0.76 


0.77 


10C01A 
llOzlO 


\AT70Rft1 
W/0001 


He 1100.94 




1.33 


1.77 


125453 


RQ6041 


He.18048 


' 'Melanoma antigen, family A, 10* 


0.8 


142 




AAJOCC07 

/vvuOOO/ 


He fl.9996 


Glycoprotein (transmembrane) nmb 
"ESTs HlflWv slirrflar to unnamed orotein 


1.52 


2.26 


125972 


AA434562 


Hs.35406 


t05 


248 


10CQQA. 


H55782 


He 570700 
naxlU/ Oo 


EST 


1 


1.95 


19R0QC 
l&OJoO 


N7Q192 


Hs 978956 


Hvnnthpttral nrnfaln PI .112929 


1 


1.35 


IID040 




He R1R35 
no.u 1 OOO 


STEAP1 (Homo sapiens BAG clone RG041D11 


\ 


2.23 


127221 


Al 354332 


Hs.72365 


ESTs 


0.73 


3.27 


107/70 


AA0 10/ £1 


He 170790 
ns. \lvf£3 


OOuogoJlt lypo A, aipnd I ^ow^^uu inciofni 


0.51 


1.94 


19R1Q9 


A! 204246 




KIAA10fl5 orotein 


t8 


116 


I Zoo i u 


L0OOU0 


He 10947 
no. iuzi/ 




0.89 


0.97 


170777 
no/ / f 




He 10696 
no* iv«j£v 


HvetplnR and olvrtna-rich nratein 2 


1 


1 


10QOO4 

Il09£*» 


A A004QR9 


He 9RW7 
ns»zDoo/ 


PlaVnnhTm 3 


1.3 


297 


19Q0A1 

ldt»WI 


H58873 


Hs 169902 
no. 1 ooov£ 


'Solute carrier famSy 2 (facDltatsd gl 
*ATP-bInding cassette, sub-family C (CFT 


o'B4 


204 


129099 


H50398 


Hs.1 08660 


0.B7 


1.04 


129404 


AA172055 


Hs.111128 


ESTs 


1 


1 


129466 


L42583 




'Genbank Homo sapiens keratin 6 isoform 


0.72 


1167 


129605 


S72493 


Hs.115947 


Kerafin 1 6 (focal non-epidenmolytic palm 


0.92 


1.S 


129628 


U26727 


Hs.1174 


'Cycnn-dependent kinase inhibitor 2A (m 


0.85 


1.93 


130023 


X13461 


Hs^39600 


Catmodutio-tfce 3 


0.84 


1.22 


130080 


X14850 


Hs.147097 


"H2A histone family, member X" 


0.98 


1.96 


1303B5 


AA126474 


Hs.1 55223 


stanniocalcin 2 


1 ■ 


1 



86 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



10WIU 


VUI3I4 


He 155491 
nS. 1 iWr* 1 




fl CO 

U.DO 


0.63 




U35835 


ue 3013R7 
ri3>wv iwvf 


-Human DNA-PK mRNA, partial cds* 


1.15 


3.65 




L32866 


Hs.1578 


Bacukwiral IAP repeakontaining 5 (sur 


1 


1.88 


130553 


AA430032 


u q 952587 


Pituitary himor-rjansforming 1 


0.92 


1.95 


130577 


M35410 


Hs.162 


Insulin-Ike growth factor binding prole 


1.17 


4.7 


■f3nco7 

\0\JULl 


L23S0R 


Hs.1695 


Matrix metaltoproteinass 1 2 (macrophage 


0.69 


405 


130800 


AA223386 


Hs. 19574 


ESTs; Weakly similar to katanin p80 subu 


1.13 


141 


130939 


AA598689 


Hs.21400 


ESTs 


0.8 


0.89 


131046 


X02530 


Hs.2248 


INTERFERON-GAMMA INDUCED PROTEIN PRECURS 0.8 


1,15 


131244 


D38076 


Hs.24763 


DA VI _ 1-1— 4 

KAN otnaing protein 1 


1 17 

1.10 


1.85 


131877 


J04O88 


Hs. 156346 


I opojsomerase (UNA; u aipna (1 /okuj 


4 

1 


1 


131927 


AA461549 


Hs.34780 


'Doubtecortex; Essence phaly, X-Qnked ( 


n 01 
U.ol 


0.62 


131965 


W90146 


Hs 35982 


ESTs 


n Til 
0.74 


3.27 


131978 




no .M) cot. 


K1AA0166 gene product 


1 




132354 


L05187 


He 911Q11 
no. a 1 i?u 


Small proone-ncn protein 1 A 


0.69 


1.43 


I 04340 


A/UH71C9 


ns.o lu 1 


ESTs; Hjgnly similar to protein reguiau 


0.79 


A97 

4.4/ 




KKQ7RA 
N39/D4 


He 53QR 
Do. OO 30 


guanine- monophosphalB synthetase 


1 


1.08 




ini9M 


no, o*ft o 1 


"larninin gamma2 chain gene (LAMC2), exon 
"Low density lipoprotein receptor-relate 
'Serine (or cysteine) proteinase Inhibit 


1 




JO ZOO 3 


775100 
4/3I9U 


. no.3440 1 


0.89 


0.89 


132710 


W93726 


ns.334/ 0 


0.64 


441 


132758 


W52432 


He 5fi1Q5 
no. 3o ivo 


ESTs, WeaWy stmBar to WDNM RAT WDNM1 


1.33 


2.03 


132767 


LU3I00 


He 931 R99 
ns.40 I044 


Small proQne^rich protein 25 


0.83 


1 6fi 
1.00 


132816 


M74542 


Hs.675 


Aldehyde dehydrogenase 3 


0.55 


0.55 


132990 


AA45R7R1 


Hs*18387 


transcnption ractor Ar-/ aipna (acovai 


1 


3.53 


133070 


U69S11 


Hs.64311 


"A disintegrin and metaOoproteinase dom 


1.10 


2 


133282 




He 9RR145 
ns.40D 143 


"SRB7 (suppressor of RNA polymerase B, y 


1 


2.7 


133317 


AA9159QQ 


Hq 7HRM 
ns.f uoov 


U6 snRNA-associated Sm-Cke protein LSm7 


0.95 


1.42 




AA1RRRQ7 

rv\l3O09/ 


He791T7 
no.* c 13/ 


Homo sapiens mRNA: cDNA DKFZp564l1922 


1.12 . 


2.55 


10003 1 


YJT7G7Q 
A3/3/9 


He 797 
rlo.i Zf 


H.sapiens activin beta-A subunit (exon 2 
estrogen-responsive 8 box protein (EBBP) 
Serine (or cysteine) proteinase inhibit 


1.65 


1.76 


133832 


H03387 


He 941 ^ 


1.02 


1.39 


134032 


Z81326 


He 7R<;RQ 
no. f 0009 


1 




1041 DO 




Ue 1R1R74 
HS.10 1004 


Homo sapiens cDNA: FU23502 us, clone 


0.95 


1 <tt 

1. 30 


Io4<£10 


Mn44/40U 


ns.ouzuo 


Pim-2 oncogene 


1.36 


9 AR 

4.40 


134405 


R67275 


Hs.82772 


""collagen, type XI, alpha 1""" 


0.76 


2.86 


104430 


A (WOO 


TO. 00404 


SRY (sex celermining region Y)-oox 4 


1.89 


17R 
o./o 


134470 


X54942 


He R37RA 
no.oo f 30 


CDC28 protein kinase 2 


1.82 


4.11 


134645 


U87459 


He 1R7T7Q 
no. 10/ 0/9 


"Cancer/tesQs anflgen (NY-ESO-1 , CTAG1 , 


0.62 


0 83 

U.00 


104/01 


flrlif 100 


He AQR9R 

ns.oyozo 


ParathyrokJ hormone-like hormone 


1 


1 


135002 


U19147 


Hs.272484 


G antigen 6 


1 


1 


10 0040 


M97935 




AFFX control: STAT1 


0.92 


1.25 


101201 


L22524 


Hs.2256 


matrix metailoproteinase 7 (mabHysin; 


2.92 


8.5 


101664 


ntOU/34 


He 191ft17 


H2A histone family; member A 


1 


4 


102025 


U03911 


He 7RQ14 


mutS (E cofi) homolog 2 (colon cancer, 


0.8 


1.61 


102031 




Hs.2156 


RAR-related orphan receptor A 


1 




1(19991 

1U444 1 


U443/0 




UM domain only 4 


1 




102270 


uouzoo 


Hs. 75888 


phosphogluconata dehydrogenase 


1.08 


1.40 


1U4009 


UO/UZz 


Ue 0CC77 
na.sOO/ / 


eyefin-dependent kinase 4 


0.88 


1.04 


1U4091 


1 1A1 CCR 
U41DD0 


Ue 77/Qj4 
TO./ /434 


deoxyguanosine kinase 


1.07 


1.30 


IU3UUU 


A3 I SOD 


Ue lAfiqfifl 
no. iW9w 


enolase 2; {gamma; neuronal) 


0.91 


1.49 


luooyo 




Ue HQCTft 


metrjionine-tRNA synthetase 


0.89 


1.04 


1U3030 


Hft401333 


He 9fU1R 


Homo sapiens mRNA for for histone H2B;c 


0.91 


1 95 
1.40 


IUO/40 


AA9O097R 
Mft4p4040 


Ue Q7«14 


acfivaSng transcription factor 5 


0.94 


1 AR 
1.40 


114841 


rv\434/ 44 


Hs.55408 


ESTs; Moderately similar to CALCIUM-DEPE 


0.78 


1 56 


1 O4U0 


AA9R9AQ1 


He 1RRR79 
ITS. I0D3/4 


ESTs 


1 


1 

1 


1 1 corns 


/vvWDOlO 


He R93H9 

ns.ozouz 


ESTs 


0.74 


9 **9 
4.34 


119132 


R49046 


He 1 fY7Q1 1 
no. iu/9 1 1 


ATP-binding cassette; sub-family B (MDR/ 


1.1 


1 51 


l<*JlOO 


nouDjy 


Ue 1RQR3R 
ns. 109000 


ESTs 


1 


1 


126487 


AA4R95ft i i 


He 1R4R01 
no. ivtuvi 


solute carrier famify 7 (catfonic amino 


1.01 


1.46 


1971A1 

!£/ 141 




He 7RA7R 
ns.f 34/0 


WAA0956 protein 


0.85 


1 4 


IZOU04 


/WjUOf 04 


Ue 7C1ftt 


tyrosine 3-morK)oxygenase/tryptDphan 5-mo 


1 


1 1R 
(. IO 




AA97A3RR 
HM434003 


He 1(19^*% 
nS. IUZ430 


survival of motor neuron protein interne 


1 


1 5 


19RRW 
140093 


R37753 


Hs inRQRR 

no. IU09O3 


ESTs 


1.7 


2 


I Owl 99 


7ARR7Q 
4403/9 


He 179H9R 

ns. 1 / cvia 


a disintegrin and metalloprotease domain 


1 




I0U0Z4 


1 IROQOC 

uoyyya 


Ue 1 QQO'iA 
nS. 139104 


forkhead box E1 


1 


1 

1 


1Mrtn.fl 
IOOUUU 


U£4 1 34 


He R9A09 


p21/Cdc42/Rac1 -activated kinase 1 (yeast 


1 




1 00000 


N14D/00 


Ue 7KA7R 
nS./04cO 


secretogranin II (chromogranb C) 


1 


1 


100U4/ 


AA40U4DD 


He Q3CQ7 


ESTs 


1 


1 

1 


mnnc» 

lUUUOO 


M27830 




AFFX oontrot 28S nbosomal RNA 


0.88 


1 « 

1.30 


innni 

1UU114 


uuuooo 


He R9QR? 

ns. 04904 


UljllUUJflOiB OJ IIUK3U100 


068 


1 RR 


IUU140 




Ue R11R3 
(75. D 1 100 


proteasome (prosome; macropain) 26S subu 


i!a 


9 ni 
4.00 


1 (Villi 


1/I4D3/ 


He R1RQ9 
ns.o 1 094 


K1AA0101 gene product 


0.71 


4.26 


mniRi 

lUUIDl 


niARQA 

LM4094 


He 7739Q 
ns./ /049 


phospha&fylserine synthase 1 


1.02 


1.30 


IUUIOO 




He ^04 
ns.094 


adrenomedullln 


0.46 


1 17 
1. 1 / 


100187 


D17793 


Ue7R1R9 

ns.r 0 too 


aldo-keto reductase famfly 1; member C3 
rramcrranosome maintenance deficient (S. 


1 


4 


1001 88 


D21053 


He C71M 
no. 3/ wi 


0.97 


1.4 


100217 


D26600 


He RQAd 1 ; 


proteasome (prosome; macropain) subunit; 


1.13 


1.9 


lUUZZU 


n7n7fi/ 
UZOOO*t 




•"Human mRNA for annexin 11, 5UTR (seq 


1.11 


1 

1.30 


100287 


D43950 


Hs.1600 


chaperonln containing TCP1; subunit 5 (e 


1.13 


2.09 


100297 


D49489 


Hs.182429 


protein disulfide isomerase-related pro) 


0.92 


1.78 


100330 


D55716 


Hs.77152 


mirrichfomosome maintenance deficient (S. 


1.07 


1.61 


100355 


D78129 




■"Homo sapiens mRNA tor squalene epoxid 


0.96 


1.87 


100364 


D78586 


Ks.154868 


camamoyl-phosphate synthetase 2; aspart 


1.49 


2.46 


100368 


D79987 


Hs.153479 


extra spincfie poles; S. cerevlsiae; homo 


0.59 


1.32 


100398 


D84557 . 


Hs.155462 


mWchromosome maintenance deficient (ml 


1.08 


1.9 


100438 


D87448 


Hs.91417 


topdsornerase (DNA) II binding protein 


1 


2.15 



87 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



10JU.W 


D87953 


Hs.75789 . 


N-mvr downstream rani it atari 


0.91 


148 


\WHV 1 


HG1153-HT1153 


Nucleoside Diphosphate Kinase Nm23-H2s 


0.99 


141 


i UUv) IO 


HG174-HT174 




Desrnopiakin 1 


1^8 


117 


100528 


HG182mi857 


""Nexin, Gfia-Derivei"" 


0.68 


19 


100661 


HG2874-HT3018 


Ribosoma) Proton L39 Homolog 
""Eotean Alt SoBce 11*" 


1.1 


5.44 


100667 


HG2981-HT3127 


0.8 


197 


100830 


HG4074-HT4344 


Rad2 


1.01 


Z12 


101061 


K03515 


Hs.944 


glucose phosphate isornerase 


0.91 


1.79 


101131 


L10838 


Hs. 167460 


spficrng factory arginineJserin enrich 3 


1.23 


187 


101162 


L14595 


Hs. 174203 


solute carrier family 1 (glutamate/neutr 


1.35 


2.73 


101181 


L19686 


Hs.73798 


macrophage migration inhibitory factor ( 


1.03 


1.78 


101183 


L19779 


Hs.795 


H2A histone family; member 0 


0.57 


13 


101216 


L25876 


Hs.84113 


cycHn-dependant kinase inhibitor 3 (CDK 


0.7 


2.2 


101228 


L27706 


Hs.82916 


chaperordn containing TCP1; subunJt 6A ( 


0.99 


199 


101233 


129003 


Hs.878 


sorbite) dehydrogenase 


0.82 


Z11 


101247 


L33801 


Hs.78802. 


glycogen synthase kinase 3 beta 


1.2 


191 


101332 


L47276 




"""Homo sapiens (ceil line HL-6) alpha t 


0.69 


Z78 


101342 


L76191 


Hs.182018 


interteukift*1 receptors-associated kinase 


1.04 


184 


101396 


M15796 


Hs.78996 


nmlrfpratirtn ppR nudnar anftOAn 


0.95 


3.55 


101423 


M1B391 


Hs.89839 


EphA1 


1 


15 


101445 


M21259 


Hs.1066 


small miMpnr rihonitrlannrntpln nnh/npnt 
oiiicui iiuuiccu iiuuiiuvisuptutran puiypcpi 


1.21 


196 


IU IOUO 


M27396 


Hs.75692 


sens rani na cvnttintaco 


0.93 


16 


101525 


M29536 


Hs.12163 


piikarvnf \r translation Initiation factor 


l!l9 


193 


101535 


M30448 


Hs.251669 


naspin kmasa 9* hpia nntvnanf iria 

wOClll IVUkhC W| pUJJTpQpUUO 


0^96 


1.42 


101607 


M38690 


Hs.1244 


PTVO -mfinon f n941 


1.11 


125 


101624 


M55998 




iman atnha-1 rolianpn fvna t nana "\ 


1.17 


1.98 


101758 


M77836 


Hs.79217 


nvrmUnfi-S-rarhnxvlatA rpfliictass 1 


1.77 


3.45 


101839 


M9303S 


Hs.692 


membrane component; chromosomal 4! surfs 


0J1 


145 


101853 


M94362 


Hs.76084 


larnin B2 


o!fl4 


119 


101977 


S83364 




""nutafivfi RabS-inleracfina orate/in fcl 


0.89 


1.9 


101992 


U01038 


Hs.77597 


nolo I DrnsnnhiaVlike kanaka 

pW VOvpillfl/TIW Ml IUOO 


0.66 


146 




U02680 


Hs.82643 


nrntpin tvmstnp kinaSA Q 

pJUIdll IjlUdlliw IUII09U v 


1.23 


3.35 


102012 


U03057 


Hs.1 18400 


cinnpH ( nrmonhHaVJikfi /saa urchin fas 


0.85 


188 


102039 


U05861 


Hs.201967 


alrifvitptn rpritirtacp family 1* mpmhpr OA 


0.93 


2.32 


10.919*1 


U1451B 


Hs.1594 


ppntrnmpro nmtpln A f 17k HI 




4.28 


in.9i?n 


U15009 


Hs.1575 


pmafl nt iHoar rihnniir^fpnnrntpln m nnK/n 

ofiJaJJ JJUl/ttJaJ liUUIIUuicUpiUIBUI fnJlJfj 


0.89 


142 




U16954 


Hs.75823 


A) 1 Uiienrt nflnp from rhromftRnmft 1n 


0.8 


Z95 


102210 


U23028 


Hs.2437 


eukaryotjc translation initiation factor 


i!ot 


134 


102220 


U24389 


Hs.65436 


Ivsvt oxidasa-like 1 


1J5 


2.34 


102260 


U2B386 


Hs.1 59557 


karvnnhprin atnha 9 fRAR nnhnrt 1 ' imnnr 


1.14 


2.69 


102330 


U35451 


Hs.77254 


rhmmnhnY hnmnlnn 1 f nimnnhda HP1 hftta 


1.05 


17 


102423 


U44754 


Hb.179312 


cmaB ru intpar RMA a^Kvattnn Rnmnlax' nn 
. mi ion liUMcai rum owivoiuiy uuiiipioA) pu 


1.14 


2.99 


102455 


U48705 


Hs.75562 


riLcmtrfln rinmaln rprsntor farnlhr marnhar 

UIOwwUJH WvlllOIII IwwQpiUi IGlllUJi lildllUd 


1.05 


2.01 


102499 


U51478 


Hs.76941 


ATPase; Na+/K+ transporting; beta 3 poly 


127 


192 


102522 


U53347 


Hs.1 83556 


solute carrier family 1 (neutral amino a 
Homo sapiens enterocyto differentiati 


0.84 


131 


102590 


U62136 




1.11 


1.6 


102676 


1/72514 


Hs. 12045 


niitafivA nrnfANi 


1.04 


2.17 


102687 


U73379 


Hs.93002 


uhifitrlfin r^arriAf nrntain P?-C 


0^86 


2.28 


102704 


U76638 


Hs.54089 


RRf^AI a«nriatpH RIMH rirvnain 1 

ur\wnl aogUUaCU r\IITw UviJKUII 1 


112 


163 


M.97A1 


U83843 




nuiiidJi ru v* i lira iijiBrsbUiiy piuicin \ 


0.9 


139 


102784 


U85658 


Hs.61796 


transenpfon ftBctor AP-2 gamma (aclivat 


0.98 


2^16 


102827 


U91327 


Hs.6456 


rhanpmnin rrmtaininn TCP1* cuhunit ? fb 


0^96 


162 


102935 


X13482 


Hs.80506 


entail mirtnar rihnnrtNpfwntpjn nntvnant 


1.21 


4.2 


102972 


X16662 


Hs.87268 


. annpyin Aft 


1.25 


2.32 




X17620 


Hs.1 18638 


non-matas^aiir rails 1* nmlpln fNM23Al ' 


103 


1.83 


103023 


X53793 


Hs.1 17950 


rmrlfifiinrftftnal nnhmpnltHA similar to S 


1.58 


5.44 


IUOUOD 


X54941 


Hs.77550 


ffy^Ofi nmtptn l(fnp?o 1 

ul/\M»0 plUlSJIl lUIIOSO 1 


132 


3.79 


103075 


X59543 


Hs.2934 


rihnnitriprifrHR it>HitHacn M1 nfdvnAntlftA 

MUUIJUWCULUJC 1 CUUbUloO 1*1 1 pujjpupuuo 


111 


2.58 


IUJIOO 


X68314 


Hs.2704 


nhitathinno npmviHa^fi 9 fnastmintA^tin 


0.75 


3.05 


103185 


X69910 


Hs.74368 


trarrcrriprnhranA nmtpln i R^kDV andonlasmi 


101 


197 


103212 


X73874 


Hs.2393 


DhosnhnnriasA IdnasA! aloha 1 fmuscle) 


0.95 


172 


103223 


X74801 


Hs.1708 


rhanamnfn nantatninn TCP1* siibiinff 3 (a 

MiapoiuiiMi i/vutmmity ivr »i siwuhh »/ 


0.97 


1.77 




X7B416 


Hs.3155 


rswpin* alnha 

Uaorail, apiia 




1 


103262 


X78565 


Hs.204133 


hpYahrarhtrin ftsriasntri rvtotactinl 


123 


3.09 


103330 


X85373 


Hs.77496 


Kmatl mirtaar rihnniifJpnnrntaJn oolvrJAnt 

WIKBI liUUDOl IIUUIIUUBUpiUiaill puijpapt 


112 


2,25 




X90872 


Hs.75854 


SI II T1 f etiKritranefipracA 


2.85 


4.62 


IUJO/0 


X91868 


Hs.54416 


etnA rtrtrtle hnmanhny rnmsnnhilal homnln 
suits UMiiia iiMiicuuvA ^uiyaupimaj iiwiiiuiu 




2.48 


IUJJ9 1 


X94453 


HS. 114366 


m/TTnfiftA-'wraf+inrvJjrffl m/ntriAtasa /ahit 




153 




X95586 


Hs.78596 


nmteaenmp rnmenmp* manmnaln) filihifntt* 
fJiuicoaOrno iipiuolHilct iiiowupaiiiy ouuunti, 


0.92 


1.53 


103437 


X98260 


Hs.82254 


hfWiha^A nhnsnhoDrotain 1 1 
ifi^jjiasv pnwwpiiwpi wimui i i 


032 


154 




X99133 


Hs.204238 


Bfyv*afin 9 /nn^onAna 9*4n3^ 
flpuvallil i \UlUyUyoiia £*ip«9^ 


0.55 


0.96 


IUOOVJO 


Z35402 


Hs.194657 


rarihArtn 1- P-rtariherin reojtheiian 

Uduitciiii 1, ensoul lam ^cpiuirajaj/ 


1.32 


2.51 


IUOO*rD 


Z68228 


Hs.2340 


tiinpfinn nlaknnlnhm 
juiiviiuii piawjyiuuiii 


0.88 


1.28 


103658 


Z74615 


Hs.172928 


mllanpri' tvnp )* alnha 1 
wviiayrai, type i, aipno i 


1.06 


Z98 


103774 


AA092898 


Hs.92918 


PST«* Waaktv Rimitar tn R07G3 8 fC aiaoa 

LOlC, VVOOIUJ CjUIIIIOJ UJ lwl«*U i^w.wcyo 


1.88 


4.66 


104261 


AF008442 


Hs.5409 


RMA onlvmarasfi 1 snhnnit 


o!87 


Z17 


104276 


C02193 


Hs.85222 


ESTs- Weaklv similar to R27090 2 fH^ani 


1.4 


249 


104289 


C16281 


Hs.75478 


K1AA0956 protein 


115 


1.68 


104434 


L02870 


Hs.1640 


collagen; type VII; alpha 1 (epHderrndys 


1.04 


149 


104453 


M19169 


Hs.123114 


cystafln SN 


0.38 


0.76 


104611 


R98280 


Hs, 125845 


ributose-5i^osphate-3-epimerase 


108 


2.25 


104758 


AA024661 


Hs.7010 


ESTs; WeaWy similar to ACYL-COA OEHYDRO 


1.14 


1.65 


105114 


M156532 


Hs.1 1801 


adenosine A2b receptor pseudogene 


0.91 


1.38 


105132 


AA159501 


Hs.247280 


HBV associated factor 


1.08 


1.7 


105174 


AA186613 


Hs.34744 


ESTs 


0.95 


2.05 



88 



WO 02/086443 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



105280 


AA232215 


Uc 1 A firm 


1U0044 


AA23030J 


Ue Rfi/R 
n5.«WD 


1ACC1& 


A A0C7071 




lujbZl 


AAZoUoOO 


nS.DO/D 


iftCCOQ 

lObbyo 


A A "7Q7101 


Ue 1 CWV) 

nS.102U2 


1AC7AC 
1057UO 


AA290767 


nS.1012o2 


1 05724 


A AOOOAOfl 

AA29209O 


Ue OOQ1A 
115.22904 


105782 


A A9CA01C 

AA350215 


U» 01 RQA 
rtS.21boO 


105799 


AA372U16 


H5*24/40 


105807 




Ue 1 caco 
his. lboby 


100031 


AA4UU/bo 


Ue OCCO 


105936 


A A ifUQM 

AA4U4ood 




106069 


AA417741 


Ue 9OQ0O 

K5.2V539 


106103 


AA421104 


Ue ioaqjI 
nS.12Uy4 


1AC1 Aft 

106140 


AA424624 


Ue AAWi 

nS.14312 


106149 


AA42400l 


Ue occoni 


106154 


AA426o04 


Ue CQOi 

ns.oay4 


1061 62 


A A/IOCGfbO 


Ue 1ABO 


106/20 


A AAOQCDO 

AA420O0Z 


Ue 011 Qft 

nS.o21?D 


10622B 




Ue 47710 


lOoolO 




Ue ORAC 

ns.youo 


1UW41 


A A/M170Q 

AA441/bB 


Ue mA'i 


106432 


A aaaqsca 


Ue 17190 

nS.lfl.Ja 


1064/4 


AA40U212 


Ue AOARA 

re.42404 


106400 


A AXC1C7C 


Ue onooo 

ris.ouzyy 


106599 


AA 457235 


Ue lOflXO 

nS. 12042 


106611 


A AACOAftJ 

AA4589D4 


Hs.2bzb/ 


106654 


A A ACFiA Ad 

AA460449 


Hs.3784 


107076 


A ACfVH AC • 

AA609145 


Lie OiiAl 

HS^1143 


107115 


a Acini no 
AAbjUlUo 


Ue 07CQ9 

nS.2/ttfo 


107129 


AA520553 


Hs.4756 


107159 


AA62io4U 


Hs.10600 


107444 


W2B391 


Hs.5181 


107481 


Woo 24 f 


Ue 07JI97 

nS.2/407 


107516 


X56597 


1 1- AQBCO 

ns.yyooo 


107529 


v4oncc 
Y120OO 


Ue caoo 


107531 


Y13936 


1 1- 47001 


107801 


A AJ14 0411 

AAQ1943J 


LL> 1704 Aft 

HS.17ol00 


107957 


A AA44QXQ 

AA031948 


Ue C7C/Q 


108565 


A Af\QCiA*i 

AA085342 


Hs.152b 


108780 


A A 4 OOCC4 

Ml 28561 


LU 417Q0Q 

HS.1 17938 


108828 


AA1 31584 


Hs.71435 


109060 


AA1 60879 


1 I- <1jI4CC4 

HS-24i5di 


109112 


AA1 69379 


Hs. 72865 


109344 


AA213696 


HS.B6559 


109412 


AA227145 


Hs^09473 


110780 


N23174 


HS.22091 


110958 


N50550 


Ll_ lied? 

Hs^4587 


111018 


N54067 


Hs.3528 


111337 


N79512 


Hs. 16607 


112305 


DE4Q01 


Ue ^ClitA* 


112401 


R61279 


Ue OT7MC 

nS.2J/6oo 


112853 


T02843 


Hs.4351 


112869 


T03313 


Hs.4747 


112992 


T23513 


Hs.7147 


113048 


T25895 


Ue IB/firiQ 

HS. ItWULRJ 


113063 


T32438 


Hs.5027 


113179 


T55182 


Hs. 152571 


113573 


T911K> 


Hs. 15990 


113811 


W44928 


Hs.4878 


114086 


Z3B266 


Hs. 12770 


114587 


AA07D827 


HS.1 80320 


114846 


AA234929 


Hs.44343 


114964 


AA243B73 


11, 01404 

Hs.82184 


115047 


AA252627 


Hs.22554 


115166 


AA258409 


LLn H 00007 

HS.1 96907 


115167 


AA258421 


IL, AU*)Q 

nS,4J72o 


115239 


A A17QCCA 

AA278550 


Hs.73291 


115278 


A A0707C7 

AA279757 


HS.674Q6 


115552 


A A JACMlfi 

AA405098 


Ue 1047D 


115875 


AA433943 


Ue A 'id AC 

KS.43946 


116004 


AA449122 


Ue 7COOC 

HS./oUob 


116121 


AA459254 


Hs.48855 . 


116129 


AA459956 


Ue XQ4C1 

HS.49163 


116190 


A A JtJinM 

AA464963 


Hs.67776 


116312 


AA490494 


1 1- ec/fto 
HS.65403 


116732 


F13779 


M. 4£cnfM% 

Hs. 165909 


11/002 




Ue /4CRC 


117950 


N51394 


Hs.75478 


117992 


N52000 


Hs.172089 


118785 


N75386 


Hs.1 11867 


119717 


W69134 


Hs.57987 


119814 


W74069 


Hs.58350 


120128 


Z38499 


Hs.91448 


120242 


Z98443 


Hs.86366 



EST8 1 

ESTs 0.72 

ESTs 1.35 

Homo sapiens mRNA; cONA DKFZp564K0222 (f 1.23 

ESTs; Weakly similar to oligodendrocyte- 0.98 

Homo sapiens mRNA; cONA DKFZp434B102 {fr 0.92 

ESTs; Weakly similar to ZJNC RNGER PROT 0.99 

ESTs 1 

ESTs 1.08 

ESTs; Moderately similar to COLLAGEN ALP 0.95 

ESTs; Weakry similar to tumor necrosis f 0.87 

ESTs 1.14 

ESTs; Weakry similar to ZJNC FINGER PROT 1 

ESTs 1.04 

KIAA0286 protein 1.23 

ESTs 0.83 

ESTs 0.77 

ESTs 0.74 

ESTs; Moderately sirrujar to metargidin p 0.97 

ESTs 0.99 

pre-mRNA cleavage factor Im (25kD) 0.95 

ESTs; Moderately similar to p!L2 hypoihe 0.98 

ESTs 0.95 

Homo sapiens mRNA; cDNA DKFZp564C053 (fr 1 

JGWImRNA^nding protein 2 1.4 

ESTs; Moderately simflar to non-function 1 

ESTs; Weakry similar to brsinA [H.sap3e 1.49 

ESTs; Highly simBar to phosphoserine am 1 

ESTs; Weakly similar to fds39554 J [Rsa 1.11 

ESTs; Highly similar to CGM 24 protein 1 

flap structure-specific endonucteasa 1 1.13 

ESTs; Weakly similar to ORF YKR081C [S.c 1.05 

prdiferaSoivassodated 2G4; 38kD 1.18 

Homo sapiens kinesin superfamily motor K 0.99 

Marin 0.94 

nucleolar protein (KKE/D repeat) 1.05 

protein phosphatase 1G (formerly 2C); ma 1.06 

ESTs 1.03 

ESTs 0.95 

ATPase; Ca++ transporting; cardiac muse! 0.59 

collagen; type Xvll; alpha 1 1 

DKFZP564O0463 protein 1.33 

chloride channel; calcium activated; fam 0.67 

ESTs 1.03 

poly{AH>tndlng protein-like 1 0.97 

ESTs; Weakry similar to REGULATOR OF MTT 0.76 

solute carrier family 7 (calionic amino 0.9 

signal transduction protein (SH3 contain 1.17 

mltogen-activated protein kinase kinase 1.21 

ESTs; Highly simflar to Myosin heavy cha 1 

ESTs 1 

ESTs; Weakly simSar to F25B5.3 [C.eJega 1 .24 

EST 1.56 

dyskeratosis congenita 1; dyskertn 1.03 

ESTs 1 

ESTs; Weakry similar to RNA-blnding prot 1 .37 

ESTs 1 

ESTs; Highly similar to IGF-ll mRNArWnd 1.33 

ESTs 0.76 

ESTs 0.79 

Homo sapiens PAC clone DJ0777O23 from 7p 0.9 

ESTs; Weakly simBar to GOLQ 4-TRANSMEM 1.02 

ESTs 1.32 

ring finger protein 3 1.1 

homeo box B5 1.01 

myelin protein zero-Eke 1 1 .05 

hypothetical protein 1.52 

ESTs; Weakly similar to simflar to (he b 0.7 

ESTs; Weakry similar to BACN32G11J [D.m 1.14 

ESTs 0.82 

ESTs; Weakly sirnfer to Weak similarity 1.2 

ESTs; Highly similar to small zinc finge 0.96 

ESTs 0.97 

ESTs; Highly similar to putative ribonuc 1 .08 

ESTs 0.8 

ESTs 1.37 

ESTs 0.92 

ESTs; Weakly simflar to GOLIATH PROTEIN 1.15 

KIAA0956 protein 1.04 

Homo sapiens mRNA; cONA DKFZp5B6B0222 (f 0.62 

GLl-Kruppei family member GU2 1 

ESTs 1 

ESTs 0.78 

MKP-1 like protein tyrosine phosphatase 0.86 

ESTs 0.83 



PCT/US02/12476 



1.4 
102 
3.56 
1.82 
1.28 
1.32 
1.41 
1 

1.78 
1.34 
2.25 
1.45 
1 

1.44 
2.11 
1.48 
2.05 
Z23 
1.99 
1.54 
109 
2.66 
1.93 
1 

2.29 

1.82 

2.78 

1.4 

1.49 

1.03 

3.63 

109 

1.9 

2.74 

1.77 

129 

1.62 

1.4 

1.46 

1.35 

7.63 

156 

1.42 

131 

1.55 

1.87 

0.95 

126 

1.85 

1.45 

1 

1.64 
1.96 
1.57 
1 

126 
1 

17 

1.47 

1.51 

1.34 

1.76 

136 

1.84 

136 

131 

152 

157 

112 

4.67 

1.98 

1.31 

1.55 

173 

1.57 

165 

1.8 

1.84 

136 

1.29 

1 

1.4 
1.77 
1.46 
101 



89 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 





AA953QQ4 

AA/3/994 . 


Hs.157B 


wwifne'ie Inhfhlfrtr L fciirvh/in\ 


0.74 


1.64 


1/11/34 


AA0300U4 


ns.9/ oo/ 


CCTe 


1.05 


1.93 


19119R 


AA/A/9/A 
rv\4U4/40 


no.9/uoi 


F?Te' Wpfllctv errndpr tn ^Itnflsr tn nhvtn 
co i o, f vcoiuj oiinittu u) ommgi to piiyw 


0.98 


13 


19117R 
1/10/0 


AA/RSROQ 

AA4U3039 


no. (ODcOZ 


PCTe- Mnrfpratplv ctrmlar h QrHllUXA. AND 

CO 1 5, MUQEHflclj 5U1BUD UJ QL/UJVJ(Vr AI1U 


0.91 


183 




AA41 1448 


ru>,tv03oo 


ESTs 


0^91 


159 


1917Rn 
l/l/OU 


AA/99flftfi 
AA4//D00 


ns. izhqou 


ESTs 


0,46 


0.55 


121781 


M422150 




cytochroms P540 fsrnily msfnbcr prsdi ctcd 


1.07 


1.54 


121844 


AA425732 


Hs.98465 


Q8p junction protsin, bct3 2j 261(0 (conn 


0^94 


1.4 


122059 


AA 43 1737 


Hs.98749 


EST 


1.93 


233 


122338 


AA443311 


Ue QflQOfi 

ndi90990 


ESTs 


\ 


1 




AA/A1779 

AA440/ // 


M« 1RRR09 

no. I0003Z 


ESTs 


0.88 


1.39 


122591 


AA453265 


Hs.99311 


ESTs; Weakly similar to MRJ [tisapiens] 


Z28 


Z93 


122790 


AA460156 


Hs 99556 


ESTs' 


0.88 


1.3 


123398 


AA5919AS 
fv\0£ 1 ££hJ 


(19. 1 w«M 1 H 


ESTs 


\ 


1.93 


123518 


AAfiflRSH 
aaouooo I 


R<; 170^1^ 
no. 1 1 uo io 


ESTs 


1 


1 


191R71 
1109/ 0 


AAOU3*ti I 


Hs. 11 271 2 


ESTs 




1.15 


19/ftfifl 


D57317 


ns. / **00 1 




0.74 


. l!l2 


19/1fi7 
1/400/ 


IX/4UUO 


Hc QQIifi 

no^J9040 


rffeiaLloee hnmpn hnv ^ 
UIoUIHcoo Jim 1 lew UUA o 


0.67 


1.1 


19///7 


hl/RAOA 
NsoUUU 


He -MrtQvK 
nS.14U940 


Unrrm csmtonc mRMA' H1WA nKF7nWnl 1/1 /fr • 

no mo sapiens niruiA, cuiva ut\r cjjoool m i \u 


1.19 


1.7 


19C7RR. 
1/0/ 3D 


W/0430 


ns.oioo4 


A TO nmtKaea' Ux trafteftftffinn* m!fiv»Knf>rt 

A I r synOToSc, n+ ugnsponuiy, nuiwcnoiio 


0.93 


1.59 


1/3/03 


AL1R9Q79 
A10o/3f £ 


Hc R919R 


CT/ ftnrvtfatal (mnh/thlne) oWmTvmiofn 

014 oncutcuu uopnouiooi ^/wj^joujiii 


1.65 


6.76 


ROCOCO 


nv9/3U 


Hq 7fi/iW 
ns./ooov 


Hnrnn canto ne mRMA* nDWA DKT7n5S4Rl 964 /f 


0.72 


2.26 


l/39/*l 


AA59RR/Q 
AAOtOO^JJ 


He ft?inq 
rio.o^ iu9 


syndecan 1 


1.22 


225 


l/OUJ/ 


IWJOf // 


He fiflfiR 
nS.DvDO 


KIAA1119 nrntptn 


^35 


1.63 


126214 


N29455 


He 7A11R 
no./*fo id 




1.93 


3.55 


I/04 14 


M7R77fl 

rt/o/ fv 


He 99141Q 


ESTs 


1.21 


1.66 


126737 


AAAflJH'W 

/v¥*00 iOc 


He K9741 


ESTs 


1 


\ 


19fi7/1 
1/0/40 


AA17Q9*a 
AA I / 9/30 


He 1791R9 


pojy^A|-uinuLny prolan, cyiujjjdonin* i 


1.3 


2.16 


19C09R 
7/03/0 


AA170A4R 
MA J / 9040 


He A19 


CCT Q . Hinhh/ similar In IMTPi^RIM RPTA-fl 
CO 1 S, ruyiuy oUIUmu tu J Hi 1 CUTU/Y DC J A-0 


253 


28 


197/19 
I//40/ 


A A V1 17 til 
AADv 1 104 


He 17A111 
ns. 1 rUO 1 1 


hotornnononite niwiaar rihnnnHoAnmlnln 
ncttMOyciicDUo IIUOcoI lllAiliUwcujJJUUJiii 


1.57 


212 


1/0/ 10 


nu/oo/ 


Ua 00.1 QQ 
n3.99l09 


ESTsi Modoratsly similar to rccombmaSo 


1.24 


900 


1/03// 


M11R91 
MJ13/0 


He miDA7 

ns. iu iu4/ 


{mncfrintir>n farter 1 /C9A immiinnnlnhtri 

usnscnpiion laciof o \czj\ uiuiiunuyjuum 


1.08 


1.78 


1/0300 


ADUD/3 


He 9A7RRR 

ns.z4/ooo 


soenyiaiG Kuiase o 


1.23 


3.48 


19RCR/ 
1/0904 


MH/11 
Ml 1400 


He iniR5A 
no. 1U 100U 


icUllDHJHMiilly piOlcul i| Cellular 


0.87 


2^42 


128628 


■ C14037 


He 9 £ »1Q7R 
no.£0 19/0 


EST 


1.22 


1.9 


128691 


W97Q1Q 
VV//939 


He iniRIA 
no. iuooo4 


ESTs 


1.1 


1.73 


19071/ 
1/0/14 


v/nncoo 


He 17QRR1 

ns.i/9001 


Urttrift f nrvanc /Anna 9/7A1 Vkaia_filSiiBn mQ 
nOJTlQ Soptefio QOTlo tHf I/O DouMUOUHd [Tin 


0.92 


1.17 


128733 


A A 00000-3 
AA0/O393 


He 104^^0 
no. iv40oo 


ESTs 


1.34 


1.94 


19R7R1 


Vfli;i79 
AOOO// 


He inR/fit; 


email mtf^loar frHnniii^lonnrniaifi nAtunonl 

otTiall luictoaj noonucJBopTuiein poiypepi 


0.9 


1.34 


I/9U3/ 


AA/QR907 

AA490/3/ 


He 1R97Afl 


rihnenmnl nrntpln Q1 1 


2.59 


3.19 


I/9U93 


L l/03v 


He mRR91 
no. iUOOZO 


thrornbospondin 2 


1.04 


3.2 


129241 


AA435665 


no. iu9/ w 


CCTc WoHprsfph/ cWiilar fn HN1 fM mi/fiai 
coioi wiuueiausjj omiiucu uj nn i [mjtiuo**u 


0.95 


1.61 


129865 


M88458 


Hs11B778 

no. i i mi f u 


KDEL fLvs-AsD-Oii-LeiJl endoolssmic relic 


1.28 


263 


1907/17 

1/9/UO 


AM4U 1040 


He 17QQQQ 
no, 1/9999 


cots 


0.97 


1.63 


129720 


AA476582 


Re 191 ^9 

no. \L 104 


PSTr* Moderately similar to SIGNAL RECOG 


1.09 


179 


190001 

1 /903U 


MOACQI 
(\/U330 


He <^fi(U<; 
nS.0Do4D 


GOP dissociaOon inhibitor 2 


0.74 


168 


19QR0R 
1/909D 


AAnA1H91 
AAU40U/ 1 


He 1199A 


1 tnP-/^al'hai9(^pMAr tata 1-A- nabtrfnevtt 
UUr-OPi.UolatolCtiAC Oolo l t 4- gdwCtuoyU 


1 43 


4 19 
4. 19 


unnRQ 
louuoy 


AAU03030 


He 1 Aft/ 9ft 
no. 1404ZO 


lAjjidgBiii lyptt v, sipnd i 


1 17 

l.i/ 


198 


isn/nc 

IOlWUD 


noooo? 


He 1CC90R 
no. 100J90 


nti^lnar fanl/\f /an/hmtif_rfaH\iaH 9VJTV 

nuuear lacu/r { eiyt/ircw -uvi ivsq z/*un 


1 9fi 


1 70 

J./9 


13fK/1 
10U041 


AU30U0 


He 911^RA 
nS.Z I 1 304 


newoiUajneng ugni poiypepuoo ioojivj 


1 

1 




l old ay 


MslO/U 


ns.i/w/u 


ubk]uiOn camar proton 


1 07 
I.Uf 


1 fifl 
I.DD 


loUoo/ 


UU4U90 


u» once 


UDP glyocsy 1 transf srass 1 


1 

1 


/ ft 

4.0 


loiuuy 


AA0C9C0R 
AAU00O9O 


nS.££l44 


C0Te» WanWtr eimHir h* WAnH_rVTA/*HPAlLIC 


n qi 

V.90 


1 rw 


lolU/o 


U/U/4U 


He 9997 
- noxZc/ 


PPAAT/onhanrAr hhvlbin nmte^n /PJPRP\* 
v^vV\A l /ciuicuiUcj ujiuiny proiein ^wtDr^ 


\ 


1.23 


1imfl1 
101UOO 


1 IRRAR1 
UD00D1 


He 997P5 
ns.zz/oo 


yffliirna-&rTuiiuuuiyrK bqu \oadaj a i wepio 


1 I 


18 


131091 


T35341 


He 99Rfln 


FQTc Hinhtv eJmllflr tn WmnnttHvl npntt 


1.28 


198 


1111 M 
IOI 144 


1/1441/ 


He 91R9R 
no^OOZO 


PQTe* Htnhlv similar tn HfiPHfllR nratatn 
co i o, nry my ou joitii vj nor vaioo fjjuusii 


143 


206 


131148 


UWvJO 


Ue 91S7Q 
no .zoo/ a 


ESTs 


0.88 


3,38 


101104 


TUUOUO 


He lR99fift 


KBiaDn 19 


1.19 


277 


101103 


M/3/30 


He 91QRfl 

ns^090U 


eyefin B1 


0.86 


3.84 


11191Q 
101/19 


P(V\47R 
UUU4/D 


He 9A1Q<i 


cmplt tnHitPtWo f*utaVlno eiirrfamilu R (foi 
oil loll inouciuw cy toAine ouujdi imy o 


0.66 


296 


101404 


AA455896 


Hc 9600 
nS.£099 


giyptcan i 


0.99 


154 


1*31 CQ7 
101007 


LllUOO 


Hc inRO 

ns.ouoy 


ha at chnob 7nifH nmtoln OR /mftrtat1r\-7\ 

neai snocK /uku proiBin go \nvjntaut-c) 


1 
i 


1 1R 

1. 10 


111RRQ 
101009 


AA333000 


He IfiRQR 


trarve^rinfinn faMnrJIVo 'x fhaelr hplhf 
UanSCnpuon idHOi-Ulve 0 \UdalC lieiiA 


\ 


195 


10 103/ 


U0U914 


He IDTIfi 

ns.ou/oo 


KlAAni9/ nrntpin 


1.55 


239 


101/00 


AA10O3O4 


Ue 191 9C 
nS.0£l£3 


CCTe 
C9I8 


1 


1.00 


lolo^J 


AA1QCQQO 

AA 190090 


He 1RADR9 
nS.104UO£ 


PQTc MivlorfltAhi ctmtlar tn mitat'nm Rah 
CO 1 5, MOueloleiy SuTUwi ID pUtoUvo r\dO 


nRi 

u.oo 


163 


lOlOOU 


U02082 


He 11/ 

ns.oo4 


uncugene iiwi 




22 


101004 


n9Ul/4 


He 1/R1 

ns.0400 


rihnenma) nmtptn C91 

nuosuniai pujiein ozo 


1.23 


124 


loibuo 


AA/R1791 
AA40 1 / ZO 


He 1/1R 
nSrt>400 


□BieCBO In Oiin cancel ^illUUoe, iiuiimmy^ 


U.9I 


118 


01Q/C 


iVK}/009 


He 1«»19n 


mnDrafinn fartni* P. /pfHvatnr 1\ / f17 




28 


111QRR 
101300 


AAU90990 


He 1RRR 

ns.oDoo 


PQTc Uinhlv eimilar tn nhnsnhftrvlafiori 
co is, nigniy sinuiaj id piwopiiuiyiaiiuii 


n R7 

u.o/ 


1.36 


1110R/ 
107304 


W4Z0UO 


He 1TO1 


CO IS 


1 

1 


1.25 


119/101 
10/uul 


inn 977 


He I7nni 
ns.o/uuo 


u.Uu9e Uannu rat niwnffla utral nrtnfwio 
v* no-few ndJ »tjy roi ooiwuiia vntu iMMAjye 


1.12 


143 


132040 




He179flQZ 
no. i f £on 


RH1 tntprnrfinn rfrrmain death affflnlst 




1.55 


132065 


D82226 


He 911^04 
nouC 1 1 39f 


nrntosenmPi /nmenmft* IMMPHaSnl 2fiS etthil 


0.89 


1.27 


132109 


AA0390U 1 


He /nnQR 


ESTs 




105 


132112 


AA150661 


Hs.40154 


jumonji (mouse) homobg 


0.99 


144 


132123 


AA447123 


Hs^50705 


ESTs 


1.06 


246 


132162 


H89551 


Hs.41241 


ESTs 


1.08 


246 


132180 


AA405569 


Hs.418 


fibroblast activation protein; alpha; se 


1.02 


4.56 


132309 


AA460917 


Hs^780 


junDproto-oncogane 


1.16 


18 


132371 


AA235448 


Hs.46677 


ESTs 


as 


126 


132618 


AA253330 


Hs.5344 


adaptor-related protein complex 1; gamma 


0,5 


1.49 


132736 


U68019 


Hs^11578 


MAD (mothers against decapantaplegic; Dr 


1.21 


1.81 



90 



WO 02/086443 PCT/US02/12476 



132771 




Up CC/fY7 

nS,w>4U/ 


phosphoserins phosphatase 


1 


4 1 
l.o 


132833 




Hs, 57783 


eukaryotic translation initiator! farter 


Ul91 


1.43 


132922 


T23641 


Hs.6058 


l/l A A 4 4 *t O 1 — i— 

K1AA1112 protein 


1.16 


1.53 


132959 


AAUZolUo 


C4 A "7*1 

Hs.61472 


ESTs; Weakly similar to unknown [S.cerev 


1.02 


1.88 


132994 


AA505133 


Hs.7594 


solute carrier family 2 (facilitated giu 


0.72 


2.97 


133005 


C21400 


Hs. 103329 


K1AA0970 protein 


0.88 


1.34 


133065 




HS.1 72690 


diacytgiyceroi kinase; alpha (80 kD) 


0.93 


1.23 


133083 


M7ftCOO 


Hs.6455 


chaperonin containing TCP1; subunil 2 (b 


1.14 


1.76 


133086 


L17131 


Hs. 139800 


Wgri-mobiBty group (nonhistone chrornoso 


0.97 


1.43 


133134 


T89703 


HS.65o4o 


RNA binding mo&f protein 8 


1.1 


1.8 


133195 


AA350744 


a r\4 4f\f\ 

.HS.1B1409 


KlAAi0rj7 protein 


2.29 


2.69 


133313 


AAi49447 


HS.7U704 


ESTs 


1.07 


1.68 


133331 


T62039 


LI. 4 CBCTC 

HS.1 58675 


ifli n n ■ in nl i— L> I 4 J 

rfbosomal protein LI 4 


0.85 


1.18 


407X79. 


ni oo7n 




ArhA nuclease {nuiJoruncuonai una repa) 


ft Q1 

0.91 


1.45 




TDOOftO 

i yyjuo 


Up 707Q7 


guanine nucteolide binding protein (G pr 


ft OA 

0.94 


4 CO 

1.68 


400x00 


YCOXOC 


Uk 7Xft7A 

MS. /4U7U 


keratin 13 


0.85 


1.14 


133492 


L40397 


Hs.74137 


transmembrane trafficking protein 


1.1 


1.69 






Uf 7VI04C 

MS.74J10 


desrnoplaWn (DPI; DPI)) 


ft 7 

0.7 


C 04 

6.21 


133517 


X52947 


Hb.74471 


gap junction protein; alpha 1; 43kO (con 


0.95 


1.3 


40oc/ft 
lod540 


D78151 


Hs.74619 


pro teas cms (prosome; macro pain) 26S subu 


ft 04 
0.91 


1.25 


133594 


L07758 


Un 4 71C OO 


nuclear phosphoprotein similar to S. cer 


0.84 


1.29 


400CO7 


1 tftOCQ7 

UU9007 


MS.7WBU 


grycyHRNA synthetase 


1.09 


1.99 


133671 


T25747 


Hs.75471 


zinc finger protein 146 


1.02 


1.5 


133859 


U86762 


HS.17B761 


26S proteasome-associaied pad1 homolog 


1.11 


3.33 


133865 


FD9315 


Hs.170290 


discs; large (DrosophBa) homolog 5 


1.84 


6.7 


133913 


W84712 


Hs.7753 


catumenin 


1.15 


1.86 


133963 


L34537 


• Hs. 184633 


trartscripCon elongation factor 6 (Sill) 


1.3 


1.91 


133982 


U47621 


Hs.207251 


nucleolar autoantigen (55kD) simSar to 


1.3 


1.99 


134100 


L07540 


Hs.171075 


replication factor C (activator Ij 5 (36 


0.72 


1.65 


134110 


U41060 


Hs.79136 


UV-1 protein; estrogen regulated 


1.04 


1.62 


134156 


U15174 


Hs.79428 


BCL2/adenovtrus E1B ISownteraccng pro 


1 


1.55 


134161 


1)97188 


Hs.79440 


IGF-II mRNA-Unding protein 3 


0.82 


1.95 


134193 


F09570 


HS.79B0 


ESTs 


0.98 


1.48 


134367 


X54199 


Hs.82285 


phosphorifjosylglydnarnide fonnyi transfer 


1 


Z8 




1 IOC4CC 


ns.82712 


fragile X mental retardation; autosomal 


1.26 


2 


134457 


D86963 


Hs.174044 


dishevelled 3 (homologous to Drosophila 


1 


1.47 


134469 


XI 7567 


Hs.83753 


small nuclear ribonucleoprotein polypept 


0.94 


1.57 


134498 


t Jen OA 

M53180 


Hs.84131 


threonyi-tRNA synthetase 


1.2 


2.64 


1345U1 


tatQ>IQ7ft 

Wo4o70 


Hs*2115oB 


eukaryofjc translation initiation factor 


0.84 


1.36 


134507 


M63488 


Hs.84318 


replication protein A1 (70kD) 


1.7 


2.93 


134548 


U41515 


Hs.85215 


Deleted in spEt-hand/spffl-foot 1 regio 


1.46 


2.73 


134599 


X99226 


Hs.86297 


Fanconi anemia; complementation group A 


1.35 


2.22 


1346S2 


R73567 


Hs.8850 


a disintegrin and metalloproteinase doma 


0.77 


1.64 


134693 


N70361 


Hs.8854 


ESTs 


1.09 


1.82 


I OTOUO 




ns.osr 10 


spermine synthase 


n Q& 

o.yo 


1.3b 


134821 


Z34974 


Hs.198382 


plakopruTin 1 (ectodermal dyspiasia/sJdn 


0.99 


1.4 


134864 


Y08999 


Hs.90370 


actin related protein 213 complex; subun 


0.95 


1.42 


134914 


U29615 


Hs.91093 


chltJnase 1 (chitotriosidase) 


1.16 


1.29 


134953 


L10678 


Hs£1747 


proCBn 2 


0.95 


1.76 


134993 


AA282343 


Hs.9242 


p urine-rich dement binding protein B 


0.98 


1.73 


135051 


CI 5324 


Hs.93668 


ESTs 


1.35 


2.11 


135158 


U51711 




Human desmocdir)-2 mRNA; 3 1 UTR 


0.86 


1.16 



Table 18 shows the accession numbers for those pkeys In Table 1 A lacking urrigenelD's. For each probeset we have fisted the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Toots (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each cluster are fisted In (he 
Accession column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 

Accessions 

BE623001 L0509S AA383604 AW96641 6 N53295 AA46021 3 AW571 51 9 AA603555 

105424 X56794 S66400X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE181648 BE069700 
AW506203 BE069721 AW382138 AW803775 BE463954 BE005334 8E005274 T27386 AA932714 AA972695 AW377728 A1632506T29056 
A1783934AW377727BE163715ALJ047291 AA279047AA523003BE008048 BE440141 W23614 BE090519 BE092193 N29181 N20358 N44153 
BE546944 T69231 AW377441 AA907406 H50799 AW051416AI420712 BE620922 AI2791 61 AA992549 W47198 BE005241 AI342696 H50700 
AI969974 A1863855 AA374490 AW1 30675 Al 950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 A167306B A1887890 AW804171 
A1675961 AW804172 AA77B841 AL048050 Al 127757 A1095558 AW204955 AW468978 W3189B AI052595 A127B771 BE464018 A1081503AI824196 
AA51321 1 AA411062 AW084376 N48752 AA703209 N35580 AW05991B AA054563 AI280942 T27619 BE621435 N66010 AW589527 AI160414 
AA283090 AA962535 H82726 W52115 W45432 W60433 AA577548 AA146714 BE150994 AA054615 AW796025 AW382768 BE555671 C00444 
AA054555 

L05424 X56794 S56400 X55150 W60071 AW351820 X55938 M83325 BE005289 BE070059 M83324 BE005248 BE069717 BE181648 BE069700 
AW60B203 BE059721 AW3B2138 AW803776 BE463954 BE005334 BE005274 T27386 AA932714 AA972695 AW37772B AI632505T29066 
AI783934 AW377727 BE163715 AL047291 AA279047 AA523003 BE008048 BE440141 W23614 BE090519 BE092193 K29181 N20358 N44153 
BE546944T69231 AW377441 AA907406 H50799 AVVK1416AM20712BE620922AJ279161M992549 W47198BE005241 AJ342696 H50700 
A1969974 A1863855 AA374490 AW130B75 A1950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 A1673068 A1867890 AW804171 
Ai675961 AW804172AA77B841 AL048050 AI127757 AJ095568 AW204965 AW468978 W31898 AI052595 AI278771 BE464018 AI081503AI824196 
AA51321 1 AA41 1062 AW084376 N48752 AA703209 N35580 AW05991 8 AA054563 AI280942 T2761 9 BE621 435 N6601 0 AW589527 AI160414 
AA283090 AA96253B H82726 W521 15 W45432 W60433 AA57754B AA14671 4 BE1 50994 AA05461 5 AW796025 AW38276B BE565671 C00444 
AA054555 

J04088NM.001067AF071747AJ011741 N85424AI042407AA218572BE296748BE083981 AL040877AW499918AW675045H17813 
BE081283 AA670403 AW504327 BE094229 AA104024 AI471482 AI970337 AA73761 6 A1827444 AW003286 AI742333 A1344044 AI765634 



Pkey CAT 

100661 23182J 
100667 26401.3 



100668 26401_3 



101332 25130.1 



91 



WO 02/086443 



PCT/US02/12476 



A1948838 AW235336 AW172B27 AA095289 8E046383 AI734240 W1 6699 AJ660329 AI289433 M933778 AW469242 AA468838 AA806983 
AA625873 W78031 BE206307 AA550803 AI743147 A1990075 AA948274 AA129533 AI635399 M605313 AI624669 AW594319 A1221834 AI337434 
AA307706 BE550282 AI760457 AI630636 AI221 521 AW674314 AW078889 AI933732 AI686969 A) 1 86928 AW074595 AI1 27486 AL079644 
A191081 5 H1 7814 AA310903 AW137854 T19279 AA026682 AA305035 AW383390 AW383389 AW383422 AW383427 AW383395 H09977 
AA306247 AA352501 AW403639 F05421 AA224473 AA305321 H93904 AA089612AW391543AW402915AW173382AW402701 AW403113 
R94438 N731 26 H93466 AA090928 M095051 T29025 AW951071 U7277 L47276 AI375913 BE384156 W24652 M746288 AA568223 BE090591 
H93033 N57027 AA504348 AA327653 AW959913 N53767 AAB43715 AJ453437 AW263710 AI076594 AA583483 AW8731 94 AW5751 66 Al 128799 
AI803319 AL042775 AW074313 AI887722 A10322B4 AA447521 A1123B85 N29334 AI35491 1 AW090687 AA236763 AA435535 AA235910 
AA047124 AA236734AW514610 H93467 AA9S2007 AI446783 AA127259 AI61 3495 AI686720 AI587374 AA93S731 AA702453 AI859757 
AA21 6786 A1251 819 AI469227 AA806022 AI092324 N71868 AA958782 AA23891 9 AA809450 AA227220 AA765284 Al 192007 AA768810 
AA805794 AA729280 AA805238 AW768817 N71879 A1050686 AA505822 AA66B974 AI688160 BE045915 AW466315 AA731 314 AA649558 
AA834316 AW591901 AW063876 AW294770 A1300266 AI336094 AI56038Q AA72175S H09978 D20305 D291 55 AW821790 BE150864 F01675 
AI457474 AW466316 AA550969 AA630788 

100780 458 127 BE561958 BE561728 BE397612 BE514391 BE269037 BE514207 BE562381 BE514256 BE514403 BE514250 BE397832 BE269598 BE553865 
BE396881 BE560031 BE514199 BE560037 BE560454 

100630 40Q2J AC004770 W05005 AA356068 AA094281 H29358 T56781 AW875313 L37374 BE312466 BE311755 BE207106 BE293320 BE01B115 AW239090 
BE548830 AW247547 AA776062 BE397382 AA48671 3 T101 1 1 TD9340 AW498981 BE547280 AA356003 AW581 520 AW875331 AA580720 
AW875335 BE276B73 BE408229 AW188148 BE255166 BE253761 AW793727 AW373141 AW581548 AA471223 AA305950 BE263976 AA626820 
BE257409 AW360962 AA09Q655 C00312 BE312741 BE407213 AA209352 AW298199 AW248553 AW297794 AW731722 BE300586 AW731972 
AW615446BE301599AW615520AA486714AW440257AA196516AA554630AA618079AW1 92592 AW474985 AA604580AI627461 AA765440 
A1680394 All 35548 AI6B3224 AJ581 1 26 AW245096 AW1941 54 H29274 N70363 AA629758 AA580602 AA862006 A1863841 AI097667 A1928583 
AI358774 BE2434B7 AA620553 AA653297 AA292690 T101 10 Z38906 AA908544 AA340930 A1185438 T03328 T28844 AI68701 0 A1864965 
AIB72575 BE388740 T56780 AW3731 38 BE258717 AA699671 

100906 4312L1 AU076916 BE298110AW239395AW672700MUI03875U1 0850 AW651 755 B 

AA446421 AW881866 A1469428 BE548103 T96204 R94457 N7B225 AI564549 AW004984 AW780423 AW675448 AW087B90 AA971454 AA305698 
AA879433 AA535069 AI394371 AA928053 AI37B367 N59764 A1364000 AI431 285 T81090 AW674657 AW674987 AA897396 AW67341 2 BE063175 
AW674408A1202011 R00723 AI753769 AI460161 AW079585AW275744A1873729 D25791 BE537646 T81139 R00722 

100930 16865 1 J04129 Nty_002571 AA293088AM77016AA404631 T28299 AA476904AA433965AA430486AA495907AI151391 AA291495AA402723 W25651 
AA706816 AI826712 AW296294 AA293479 A1276581 AW044154 AI080180 AI4179B5 AI274188 AI474212 AA495908 AA635664 A10921 14 
A1804952 AA479874 AI597661 AI42051 1 AA479738 AA421417 AA421247 AA436220 AL047797 M34046 N42277 AA22B076 W02698 A1420297 
AA434011 AJ369971 AA479731 AI865541 AI418020 AA421246 AA452764AL048051 

102221 3861 1 NM.006769 U24576AW161961 AW1 60473 AW160465AW1 60472 AW161069 AI824831 AW1 62635 AI990356AW1 62477 AW1 62571 AI520836 
AW162352 AW162351 AW162752AI962216 AI537346 AA853902 H17667 BE045346 BE559802 BE255391 AA985217 AA235051 AI129757 
AW366451 T34489 D56106 D56351 A1936579 AW023219 AW889335 AW889120 AW889232 AWB89175 BE093702 AW889349 AA147546 
AI952998 AA912579A1143356 AW902211 R64717AW157236 AI815242 D45274AW263991 AA442920 AA1 299B5 AL035713 A1923255 AJ949082 
AI142826 A1584160 AI701987 AI67B954 AI827349 BE463635 AW628092 AW302281 AA493203 BE348856 BE536419 AW193969 AW673561 
AW592609 AI224044 H43943 AA091 912 R49632 R48353 A1568409 R48256 AM 98046 H27986 H43899 AI678759 A16B031 0 AI624220 H17052 
AA156410 N56062 AI699430 AA664S29 T09406 T10459 AA627506 AI379584 N83831 N88633 AW022651 AA971281 AA248036AI039197 
A1914689 AA973825 AL047305 AA129966 AI798369 AW264348 AI445879 AI658759 N67924 AI933507 AI216121 AI333174 T10972 AI375028 
AI186756 A1273778 AA610487 AI797946 AA853903 AA903939 AJ3385B7 AI278494 AW627595 AA904019 

101809 32963.1 MB6849 AA3152BQ NMJKW004 AA315269 BE1 42653 AA461 400 AW802)42 BE152893 AW383155 AA490688 AW1 17930 AW384563 AW384544 
AW384556 AW378307 AW378323 AW839085 AA257102 AW37B317 AW276060 AW271245 AW37B298 AW384497 AI5981 1 4 AW264544 AI018136 
AW021810 AA961504 AW086214 AW771489 AW192483 AI290266 AW19248B AW384490 AW007451 AW890835 AA554460 AA613715 
AW020066 AI783695 AI589498 AI917637 AW264471 AW384491 AI816732 AW368530 AW368521 AW35B463 AA461087 AI341438 AI970613 
AI040737 A1418400 AA947181 AA962716 AI280695 AW769275 AW023591 AI1 60977 AA055400 N71 882 AA490466 AW243772 AW316636 
A1076554 AW51 1702 N69323 H88912 AA257017 AI952506 H88913 A1912481 AA600714 BE465701 N54149 C00523 N64240 AA677120 

102590 15932.1 R61573 BE005029 X98091 AA297307 BE537267 BE566138 BE566139 F11561 BE564795 BE568776 AW064005 BE566479 BE380035 BE567012 
BE568634 BE566568 AA298060 BE566043 BE568813 BE568618 AA283070 BE565414 BE566738 BE568585 BE565667 BE5661 16 BE566433 
U62136 AF049140 BE567057 BE567297 BE567403 BE564316 BE567400 BE568854 BE566588 AA448772 AA071353 AW732642 BE564996 
AA297763 AA278550 AA421 083 AA2981 B4 AA091 007 AA384577 AA20591 6 N28759 AL031291 C15757C15761 H02728BE566410AA1 29335 
AA419499N87741 BE379669BE004824BE379611 D25874 AA148454 AA323654 AW95031 1 AA44B795 AW749423 AA773386 AA773B43 
AWD203Z7 BE348580 BE504258 BE549990 BE220200 AI573334 A/202679 AA975515 D61421 AI168683 AA102843 AW246621 A/276203 
AI074054 A1633824 A1962927 A) 148926 N50969 AI30891 1 AA41 0994 AW373025 AA148455 H02620 AA688293 AI24631 8 K22220 Ai 91 7777 
AI050943 AI0972B6 AA663794 AW368662 AW627826 AW078734 A1253060 AA749154 AAB32236 A11 92358 AW024676 AA448676 AA764891 
BE439467 AA661534 AA258061 Al 090546 AA995157 A1051 01 1 AA584421 AI026032 AW591 338 AW589563 AA776914 AW0246B4 AA421 002 
F09219 BE454500 A1383595 AA954244 AA601583 AA737304 AA1 95549 MB05778 A1055876 AA1 64942 AW0 13961 A1672608AW514211 D59441 
AW582574 AA160935 BE566501 BE564612 BE565353 BE568195 BE555447 BE568302 BE566097 BE565470 BE564249 AL036217 AW749424 
BE567494 AA102842 AA314761 AV661237 C1421 1 AA651866 AW798997 AA470605 

101977 29073 1 AF112213AL050318T24804AW248136 BE386341 BE263177W16677BE250224BE563669BE267405BE546577AV651354AV651292 

AI346903 AI539128 AI189171 S83364 AW073849 AI816760 AW073309 A1422690 AA296692 A1860301 AI805446 N77735 A1340328 BE092530 
AW028742 BE088442 AA657742 AA742438AW170086AI038920 AM32379 N36073AI936194 AA868655 AA983612A1077505 BE0B0433 
A/375014 A1126547 AI348244 AI346077 A/748952 N26915 A/753574 A/093341 AI278762 BE092517 N74204 H06158 T58149 A1129303 N58366 
AA524456 BE122661 AA542925 AI246120 A1735203 AA706829 AA877544 Al 082289 AA926687 N92840 AW249798 AA934763 AW998363 
A/128832 N25202A1240209 AW118892 N80744 R35655 AI342321 A1340141 AW878792 AI857321 H09610 W04601 AW006650AA126006 
AA553675 A1052791 AW059835 A1041 906 AA81 4658 AW002059 AA729483AI609301 AA994633 AA903651 AI459183 T95072 AW088630 
AA126112AI800091 AI5S1215 H17502 AW47OT72 AI819003 AI6B3272 AI262701 AW793140 T81787R99586AI275160AI310420AI698929 
AA1591 74 A/827958 F30305 F3O309AA806662 AI091923 AW878722 AA583430 AW571913 A/674584 AA292533AJ079471 AAS42325 M71 9050 
AW7931 72 AA305476 AW1 03745 T23459 N79525 AI784438 AA534551 AW193751 AI074360 BE281214 T32229 W25065 W01205 T63086 
AW795348 AI361287 AW795353 AW795349 AA594759 AJ400295 D1 1489 AI3706B9 AA482356 M485295 W40151 AA564661 AW300745 
AI346938 A/374975 A/423782 AW1 93899 AA612604 All 83409 AA996156 AW366963 AW386977 A/284860 AA846503 A1985064 AA844576 
M737921 AA873274 BE241 546 BE241540 AA4B4058 AW468970 AA127876 AA159120 AW001 568 AW79521 3 AW795258 AW795330 BE250589 
BE3B7572 AA910895 AA161217 BE250380 W31500 T95167 AI719306 AI359224 

102781 20812.1 BE258778 BE281230 BE410044 T33723 AW672694 AW410439 NM.006429 AF026292 T35505 BE542333 T08940 AU076737 AW247471 

BE393215 AW328640 BE542408 T32170 BE302544 T31955 BE206B98 BE275738 T32570 BE386426 BE298746 BE389937 BE293991 BE31 5289 
BE389578 R34739 R15312 BE279365 BE277756 AL036019 T33725 8E277779 8E302962 AUM7294 BE276505T09070T33673 BE312580 
AW3B7774BE257175AW674367BE253331 BE270344 BE299831 BE273576 T32062 AI751831 BE61B381 AA304B99 BE252268 U46364 
BE256790 BE207199 BE256209 BE251 941 BE250791 BE313955 BE269806 6E543623 BE279212 BE252289 T31699 BE262220 T31659 
AA315781 AA192212 N84547BE292737BE259631 AA232179 Al 1331 44 T31 292 AA31 5945 BE407301 BE2S1184 BE40900B AI880158 A1904003 
AI9041 14 AW651768 AW651763 R58247 BE271897 UB3843 C05298 BE261609 BE255973 AA351650 N84631 BE263637 AW452910 AA328465 
AA324549 AW579525 BE252296 BE257551 AL048332 8E208630AA359336 AW327897M151742 AA305816 BE076862 BE076796 BE263161 
AA3237B5 AA676588 AA626565 AA078917 W87657 R09002 R94021 AA312032 BE276665 AA295608 AW407162 AA329374 AW877912 N27885 
AA369256 AA360968 BE250476 N85427 BE265569AI278639AI81 6576 A1691 037 AW328583 AI567949 A1983455 A1927732 AI81 1297 A1571 508 
AW073674 BE296039 BE467326 A1828796 AI816578 AW511604 AI921213 AW152427 AI795787 AI801618 AW168866 AI628144 AIB90339 
AW173690AW511540BE535620AA383014BE301164A1866596AW514909M A1093488 AW57540BAW675443 

AW615636 AW732207 AW377638 AA321784 AA641629AA633105 AA527640 AW129146 AW615672 BE394607 AA483902 AW475032 BE378532 



92 



119221 102947.1 



125831 1522905J 
128192 45743.3 
113195 1786B8J 



119861 
112973 



129402 
105936 



238266 1 
4868.1 



47367J 
260931J 



129466 2094.50 



WO 02/086443 PCT/US02/12476 
AA872808 AI4693B8AW105268BE047301 AW591B43AW410066AW517153A1950495 M746641 AI914878 AA873185 AI696911 AA548625 
AA911505 AA148762AW674535 AI587329 BE328328 AW270348 AA158225 AW1 17705 AW474997 AW51 9193 AA6 14757 AW664383 AI082647 
AW590973 A1476711 AA192213 N88741 BE464552 AWD72679 A1453708 AA1 52166 AA805924 A15B1 078 Al 1 25768 AW173484 AI961 980 
BE300766 A11 99698 AI636792AW247333 AW272B61 AA078818 AA150012 AA551232AA678821 AW873869AW76B266AJ 66031 5 AA31 9210 
M814551AA157994AA318886AI582962AW089224A1356098AI343694^ . 
AW248898 AI751830 AA907816 R08898 AW087989 AI828300 AA148596 A1269577 T33426 AA213571 AI973201 AA666279 R49612 AI573183 
AW799762 AW410068 AW769666 A1962097 A1475204 D57490 AW517531 BE245270 AW470008 T33427 AW005731 AI795795 T23753 AW272981 
T15747 AA552875 T23644 AW361289 AI758558 BE207435 AA876958 T03361 AA883569 F37533 M582321 AW082524 R4221 2 AA973847 
T189D0 AA08S202 AJ559867 AI302418 AA9486S7 AA745670 T08939 T33724 T33722 BE621568 D57489 D25906 BE621151 F16510 C05966 
T35127 AA630427 AI933481 AA309426 A1918440 BE561854 BE618866 BE394675 BE296173 AW951687 BE383739 BE616141 BE312730 
BE535351 AW080575 BE313330 BE61 6664 AJ354390 AA847315 BE544509 BE515212 BE297833 BEZ78808 BE544844 AW090178 AI890664 
BE546708 AW189943 BE274412 BE382399 BE266392 BE254949 BE280696 BE383237 BE261756 BE257721 BE312683 BE275476 BE514880 
BE545314 BE313587 BE384537 BE386691 BE264813 AW592575A]336332AI?78541 AI795791 BE222662AW249316AA314361 AL036012 
AW402923 BE266845 AA075945 AA314436 BE384640 AW731769 AW957077 AA552234 AA573560 AW367038 AA31 3399 A1983873 BE410159 
BE263803 BE514339 BE409073 BE281296 BE543396 BE395387 BE0B8360 BE546946 BE546570 BE390626 AA074638 AA301821 AW845230 
AW582379A1949222AW029572AA515843AW272394BE250234 

C14322 W74050AI074232AA595624BE048955 A1148417 A1583145 AI473460 AIB01688 AW573593 AI950741 AI628140AW467921 R98105 
AJ149258 AJ247584 AI078378 AJ 139850 AA4B941 1 W24744 R981M AI033826 AA699589 AI033120 N55544 W8B9B4 AW970771 AA7033B2 
M099138 M706792AA046150 H98981AI916674M953018A1^^ 
AW235415 R70377 AA099236 F20703 AA524436 R69484 
H04043 D60988 D60337 
A1204246A/204250A1194050 

H83265 T63524 AA304359 AW960551 AI672874 AI749427 AA227777 AW027055 AA971834 T49644 T54122 AI983239 AI808233 T91264 T96544 
AI350945 AI7091 14 R72382 T4B788 R4B726 AW38541 8 AI095484 T49645 AA928653 AA570082 AW007545 T5717BAA516413 AA913118T57112 
AA564433 AA774503 AA367571 T59757 

W78816 A1720806 A1633854 AI632086 AI668663 N70894 AW57 1809 AI383592 A1201 348 W80715 N91880 AW963101 M339011 
AB033023 BE391905 BE275965 BE277872 BE003882 AA313774 BE019159 BE298024 BE299727 BE300011 BE390277 BE394764 N87550 
BE409419 BE408652BE408197AL119332M622427AI816265AA610118Tt)7318AA019839M634430BE205^ 
AI652711AI917645M630(M5AW191969AI817882T17271A1803663A)095533 H46019AW592438 
M639614AI925762AW088153T17455AA018640AW51475BE300241 A!816255BE391981AW40^^ 

AW875926 AW875645 AW875647 AW938037 AL138042 AW892619 BE243018 AW995454 BE2463B1 BE0090B2 BE278921 AW967842 AA262454 
H30121 

W72062 AF088057 W76255 A182721 9 AI631461 AW449295AI354957 AI913803 T62772AI222040 T62921 T63781 
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Table 2A shows 504 genes down-regulated in lung tumors relative to norma! lung and chronically diseased lung. Chronically diseased lung samples represent chronic non- 
maEgnanl lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 59650 probesets on the Eos/Afymelrtx Hu03 GenecNp array. Gene 
expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene Cfle 



R1: 



R2 
R3: 



R4: 
R5: 
R& 

R7: 
RQ: 



90m percentile of Al for norma) rung samples divided by the 80m percenfle of A) for adenocarcinoma and squamous cell carcinoma lung tumor 
samples. 

median of Al for normal lung samples divided by 90th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples, 
median of Al for normal lung samples minus the 1 5th percentile of A) for aQ norma) rung, chronically diseased lung and tumor samples divided by 
the 90th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples minus the 15th percentile of Al for all normal 
lung, chronically diseased lung and tumor samples. 

average of A) for norma) lung samples divided by average Al for squamous cefl rarctnoma and adenocarcinoma lung tumors, 
median of Al for normal lung samples divided by the 90th percentile of A) for adenocarcinomas. 

median of A) for normal lung samples minus the 15th percentile of Al for a!) norma) lung, chronically diseased lung and tumor samples divided by the 90th 
percentile of Al tor adenocarcinomas minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples, 
average of Al for norma) lung samples divided by the 90th percentile of Al for squamous cell carcinomas. 

median of Al for normal lung samples minus the 1 5th percentile of Al for all normal lung, chronically diseased lung and tumor samples divided by the 90th 
percentile of Al for squamous cell carcinomas minus foe 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples. 



Pkey 


ExAccn 


UnigenelD 


100095 


Z97171 


Hs.78454 


100115 


NM.002084 


Hs.336920 


100138 


U8350B 


Hs.2463 


100299 


D49493 


Hs.2171 


100306 


U86749 


Hs.80598 


100447 


NM.014767 


Hs.74583 


100458 


S74019 


HsZ47979 


100862 


AA005247 


Hs^85754 


100959 


AA359129 


Hs.118127 


101032 


BE206854 


Hs.46039 


101081 


AF047347 


Hs.4880 


101088 


X70697 


Hs.553 


101125 


AJ250562 


Hs.82749 


101180 


U11B74 


Hs.846 


101308 


L41390 




101330 


L43821 


Hs.80261 


101345 


NM.005795 


Hs.1 52175 


101346 


AI738616 


Hs.77348 


101397 


M26380 


Hs.180878 


101414 


NM_000065 


Hs.38069 


101435 


NMJW1100 


Hs.1288 


101507 


X16896 


Hs.82112 


101530 


M29874 


Hs.1360 


101537 


A1469059 


Hs.184915 


101542 


NMJW0102 


Hs.1363 


101545 


BE246154 


Hs.154210 


101554 


BE207611 


Hs. 123078 


101560 


AWS58272 


Hs.83733 


101574 


M34182 


Hs.158029 


101605 


M37964 


Hs.116845 


101621 


8091804 


Hs.62661 


101660 


AA299330 


Hs.1042 


101829 


AW452398 


Hs.1 29763 


101842 


M93221 


Hs.75182 


101961 


AW004056 


Hs.168357 


101994 


T92248 


Hs.2240 


102020 


AU077315 


Hs.154970 


102091 


BE280901 


Hs.83155 


102112 


AW025430 


Hs.155591 


102190 


AA723157 


Hs.73769 


102202 


NM_000507 


Hs.574 


102241 


NNL007351 


Hs.268107 


102310 


U33839 




102397 


U41898 




102571 


U60115 


Hs.239069 


102620 


AA976427 


Hs.121513 


102636 


U67092 




102667 


U70867 


Hs.83974 


102675 


U72512 


Hs.7771 


102698 


M18667 


Hs.1867 


102727 


U79251 


Hs.99902 


102852 


V00571 


Hs.75294 


103026 


X54162 


Hs.79386 


103028 


X54380 


Hs74094 


103098 


M86361 




103117 


X6357B 


Hs^95449 


103241 


X76223 




103280 


U84722 


Hs.76206 


103360 


Y16791 


Hs.73082 



myociUn; trabecular meshwork inducible 
glutathione peroxidase 3 (plasma} 
angbpoietln 1 

growth differentiation factor 1 0 
transcription elongafion factor A {SI I); 
KIAA0275 gene product 
Vpre-B 

Hepatocyte Growth Factor Receptor 
actin; alpha; cardiac muscle 
phosphogtycerate mutase 2 (muscle) 
amyloid beta (A4) precursor protein-bind 
solute carrier family 6 (neurotransmitte 
transmembrane 4 superfarnBy member 2 
Intsrieukin 8 receptor; beta 
'Homo sapiens core 2 beta-1,6-N-acetylgl 
enhancer of filamentafion 1 (cas-Gke do 
Calcitonin receptor-tike 
hydroxyprostagiandin dehydrogenase 15-(N 
lipoprotein lipase 

complement component 8; beta polypeptide 
actin; alpha 1; skeletal muscle 
intsrieukin 1 receptor; type ) 
cytochrome P450; subfamily UB (phenobar 
zinc finger protein; Y-Gnked 
cytochrome P450; subfamily XVH (steroid 
EDG1; endothelial differentiation, sphin 
thyroid stimulating hormone receptor 
Intercellular adhesion molecule 2, axon 
protein kinase; cAMP-dependent; catalyti 
troponin C; slow 

guanylate binding protein 1; interferon- 
Sjogren syndrome antigen A1 (52kD; ribon 
solute carrier family 8 (sodrurrvcakaum 
man nose receptor; C type 1 
'Hs-TBX2=T-box gene {T-box region} [huma 
uteroglobin 

transcription factor CP2 
aldehyde dehydrogenase 7 
fbrkhead box F1 
folate receptor 1 (adult) 
rructose-bisphospli atase 1 
Multimertn 

Accession not listed in Genbank 

"Human sodium cotransporter RKST1 mRNA, 

"Homo sapiens skeletal muscle UM-protei 

Human clone W2-6 mRNA from chromosome X 

"Human ataxia-telangiectasia locus prote 

solute carrier family 21 (prostaglandin 

"Human B-cell receptor associated prated 

progastrin (pepsinogen 0 

opioid-binding protein/cell adhesion mol 

corticotropin releasing hormone 

thyroid and eye muscle autoanQgen D1 (6 

pregnancy-zone protein 

Human mRNA for T cell receptor; clone IG 

paralbumin 

Rsapiens MAL gene axon 4 

Cadherin 5, VE-cadherin (vascular epithe 

keratin; hair; ackfic; 5 



R1 

40.20 



4Z40 



36.40 



33.20 



R2 R3 



R5 



11.00 



Z30 



Z29 



125.60 

34.60 
193.20 

54.86 

36.40 

70.55 



34.60 
37.60 



R6 



3.06 



4.13 



3.10 



R7 



3.46 



3.16 



3.61 



2.54 



39.40 



30.20 



5.50 
13.00 



38.20 



4.37 



3.37 



2.75 



54.60 



29.40 



7.00 



2.32 



Z45 



Z32 



2.40 
3.15 



3.07 



3.55 



37.40 
28.80 



6.00 



12.00 
13.00 
10.00 



3.54 



4.25 



3.38 
3.80 



6.85 

6.75 

3.98 
3.62 



3.75 



4.51 



Z47 



Z16 
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108041 AW204712 Hs.61957 
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70 108338 AA070773 
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108550 AA084867 

108604 AA934589 Hs.49696 

108625 AW972330 Hs.283022 

108629 AA102425 

oO 108655 AA099960 

108756 AA127221 Hs.1 17037 

108864 A1733852 Hs.19S957 

108895 AL138272 Hs.62713 

108921 A1568801 Hs.71721 

05 108967 AA142989 Hs.71730 



flavin containing mcnooxygenase 2 
"H.sapiens DAT1 gene, pafei, VNTR* 
contacOn 1 

glycoprotein A repetitions predominant 
■H.sapiens isoform 1 gene for Uype cat 
Haptens XG mRNA (clone PEP6) 
CGH00 protein 

Hypothetical protein MGC10895; sim to SR 

ESTs 

ESTs 

dual-specificity tyrosine-(Y)-phosphoryl 
ESTs; Weakly similar to putative p150 [H 
ESTs 

ESTs; Moderately similar to TGF-BETA REC 

ESTs 

ESTs 

ESTs; Weakly similar to SliM protein[ 

ESTs 

ESTs 

ESTs; Beta-1 -adrenergic receptor 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA: FU21042 fis, done C 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamfly member (telr 
ESTs 

homeodomaijvinteracting protein kinase 3 

arresfin; beta 2 

synaptopodin 

ESTs 

calcitonin receptor-like receptor acfivi 

ESTs 

ESTs 

KIAA0554 protein 
ESTs 

ADP-noosytation {actor-like 2 
ESTs 
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2.40 
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Homo sapiens mRNA; cDMA DKF2p564B076 (fr 

novel SH2-ccntaining protein 3 

sperm associated antigen 6 

serum deprivation response (phosphatidyl 

ESTs 

ESTs 

ESTs 

ESTs 

EST 

EST 

solute carrier family 17 (sodium phospha 

latent transforming growth factor beta b 

ESTs 

ESTs 

EST 

DKFZP434K151 protein 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to M! ALU SUBFAMI 
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ESTs; Moderately similar to CaM-KII iriW 
ribosoma) protein L41 

ESTs 34.60 
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2.61 



46.80 
31.40 



39.80 
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Z25 
2.92 



3.58 



9.00 



15.00 
6.67 



2.65 



17.00 



2.63 
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3.07 
2.48 

Z5Q 



8.00 



29.00 
32J0 



35.80 

39.60 
28.80 



12.00 



9.00 



7.33 



10.00 



Z36 



3.09 



32.20 
41.80 



82.80 

32.40 
70.60 

29.40 

30.80 

37.60 

34.20 

33.60 
29.80 

34.40 



29.40 
3750 

38.40 



34.40 



41.60 
87.00 



19.00 



3.60 
3.14 



3.37 



3.95 



2.58 



13.00 



4.74 



3.14 



6.03 



10.00 



3.12 



2.79 



4.42 
2.58 

4.31 



Z66 



3.00 



11.00 



3.10 



3.17 



3.21 



4.02 
3.76 

3.68 
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131545 AL137432 
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131821 AA017247 Hs.164577 
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131861 AL095858 

132015 A1418006 

55 132070 BE622641 

132242 AA332697 

132334 AW080704 Hs.45033 

132476 AL1 19844 Hs.49476 

132490 NiyL001290 Hs.4980 

60 132533 AI922988 Hs.172510 

132598 XB0031 

132619 H28855 

132652 N41739 

132726 N52298 

65 133028 R51604 

133071 BE384932 

133120 NM_O03278 Hs.65424 

133129 AA428580 Hs.65551 

133147 AA026533 Hs.66 

70 133151 NM.014051 Hs.94896 

•133213 AA903424 Hs.6786 

133276 AW978439 Hs.69504 

133377 AJ131245 

133407 AF017987 

75 133535 AL134030 

133537 U41518 

133656 8E149455 

133689 NWL001B72 Hs75572 

0 • 133779 T5B486 Hs.222556 

80 133978 AF035718 Hs.78061 

133985 L34657 Hs.78146 

134000 AW175787 Hs.334841 

134111 A137258B Hs.8022 

0 . 134185 AA285136 Hs.301914 
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Hs.246216 

Hs.22824 

Hs.23735 

Hs.184482 

Hs.23912 



Hs.25227 
Hs.4 

Hs.25274 
Hs.25956 



Hs.27373 

Hs.263395 

Hs.28564 

Hs.323092 

Hs.30089 

Hs.30509 

Ks.30514 

Hs.30941 

Hs.3107 

Hs.31595 



Hs.33010 

Hs.184245 

Hs.3731 

Hs.38489 

Hs.42721 



Hs.530 

Hs.53447 

Hs.61250 

Hs.55606 

Hs.300842 

Hs.64313 



Hs.7239 

Hs.7308 

Hs.284180 

Hs.74602 

Hs.75415 



K1AA1 102 protein 

tntsrietddn 7 receptor 2.29 
ESTs 

Human cytochrome P450-IIB (hllB3) mRNA; 
ESTs; Highly similar to CGI-38 protein [ 

claudin 5 (transmembrane protein deleted 2.93 

vasoactive intestinal peptide receptor 1 1 60. 80 

oviducta) glycoprotein 1; 120kD 10.00 

ESTs; Weakly similar to collagen alpha 1 

EST 9.00 

decay accelerating factor for complement 87.80 

annexing 44.60 

defensin; alpha 1;myelolcWelatedseque 2.72 

Ur^-GattetaGlcNAcbeta1;3^alactosy1tr 42.20 

retinitis pigmentosa GTPase regulator 2.54 

MAD (mothers against cacaperttap] eg tc, Dr 51 .60 

DKFZP586G121 9 protein 

D component of complement (adipsin) 

ESTs 

ESTs 6.00 
adenomatous polyposis coll like 
KIAA0758 protein 

KIAA1 102 protein 20.00 

aquaporM 41.20 

ESTs; Moderately similar to WAA0273 [H. 31.40 

ESTs 29.60 

ESTs; Weakly similar to zinc linger prot 9.00 

ESTs; Weakly similar to p160 myb-binding 

potassium voltage-gated channel; subfami 

DKFZP5B6D0624 protein 

ESTs 

superoxide dismutase 3; extracellular 2.9B 

ESTs 3.15 

ESTs . 32.20 

alcohol dehydrogenase 3 (class I); gamma 

ESTs; Moderately similar to putative sev 

DKFZP564D206 protein 8.00 

ESTs 10.00 

butyrate response factor 2 (EGF-response 28.80 

Homo sapiens mRNA; cONA DKFZp56401763 (f 

ESTs; Highly similar to semaphorin Via [ 39.00 

ESTs 11.00 
ESTs;Weakr/simaartoduafspecfficty . 10.00 

ESTs 2.47 
ESTs 

ESTs 4530 
calcium channel; voltage-dependent beta 2.28 
C097 antigen 
ESTs 

ESTs; Moderately similar to CaM-KH inhi 

ESTs £87 
K1AAD633 protein 

K1AA0929 protein Msx2 Interacting nuctea 54.00 
ESTs 
ESTs 

ESTs 2.68 
lacrimal proline rich protein 4.66 
Homo sapiens done TUA8 Cri-du-chat regi 34.20 
UM binding domain 2 2.66 
ESTs 13.00 
collagen; type IV; alpha 3 (Goodpasture 
ESTs; Moderately similar to kinesin Bgh 
ESTs 

ESTs; Weakly similar to cDNA EST yk484g1 11.43 

ESTs 2.37 

ESTs 2.27 

tetranectin (plasmmogenWing protein 263 

ESTs 

Interieukfn 1 receptor-fike 1 6.20 
ESTs 

ESTs 31.40 
ESTs 9.00 
SEC24 (S. cerevfsiae) related gene famB 41.20 
secreted frizzled-related protein 1 50.20 
protocadherin 2 (cadherin-itke 2) 
aquaporin 1 ((toneUfbrming integral pr 

Accession not Tested in Genbank 2.65 
carboxypepttdase B2 (plasma) 90.80 
ESTs 

transcription factor 21 2.92 
platelet/endotheliat cell adhesion molec 
selenium binding protein 1 

TU3A protein 4.49 
Homo sapiens mRNA; cDNA DKFZp5B6K1220 (f 

ESTs; WeaWy similar to CO-69 protein [ 40.80 
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3.30 



3.40 



4.05 
4.09 



3.16 
4.77 
3.54 



3.14 
3.80 



6.40 



4.11 
3.50 

3.86 
3.67 

3.44 



4.03 



3.06 



40.60 



49.20 
34.80 



30.60 



3.78 



3.48 



4.02 
3.18 



3.69 



3.72 



3.05 



5.49 



3.35 



3.45 
4.05 



3.27 
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Hs.89640 
Hs.197764 
Hs.333363 
Hs.92927 



Hs.94367 

Hs.95011 

Hs.269386 

Hs.96901 

Hs.97393 



prolan tyrosine phosphatase; notwecept 
ESTs 

angiotensin receptor 1B 

carbonic enhydrsse (V 

angtopoielin 1 receptor; TEK tyrosine W 

thyroid tran scripti on factor 1 

ftconn (coflagen/fibrinogen domab-conl 

ESTs 

ESTs 

RNA binding motif protein 6 
ESTs 

syntrophin; beta 1 (dystrophin-assodate 

ESTs 

ESTs 

Human mRNA for KIAA0328 gene; partial cd 
phosphoQpase A2; group IS (pancreas) 
potassium voltage-gated channel; shaker- 
elastase 2; neutrophil 
EST 

dopamine receptor 04 
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3.76 



15.00 



3.05 
2.52 



32.20 
57.80 
31.60 



2B.80 



43.00 



37.20 
38.80 



8.00 



3.82 
4.15 
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4.31 



4.21 



3.73 



4.24 



6.42 



TABLE 2B shows the accession numbers for those primekeys lacking unigenelD's for Table 2A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compBed using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Afignment Tods (DoubleTwist, Oakland CaTrfomia). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession' column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 



08447 
08550 
08655 
I02397 
26303 
25810 
I03627 
21366 
14609 
15272 
08338 
08434 
23802 
02310 
02636 
04776 
20504 
13502 
08499 
101308 
I08629 
I03098 
03241 
03508 
I03575 
9514 
1082 
28634 
105817 
121518 
14449 
14648 
21950 
107723 



CAT number Accessions 

43452^-7 AA079126 
120073J AA084857AA084996 
127522J AA099960AA1 13013 
44371.-1 U41898 
1525933J D78841 078880 
1554054.1 H00083 R81062 
2615 2 Z48513 248512 

280401 J A1743515 AA405617 AW276706 
116777J AA079505AA079537 
172113J AW01 5947 AA21 1 890 AA279425 
1121B6J AA070773AA070774 
114012.1 AA078899AA078782AA075788 
genban)LAA620448 AA620448 
NOT._FOUND_entrezJJ33839 U33839 
entrez_U67092 U67092 
genbaniLAAQ26349 AA026349 
genbaniLAA256B37 AA256837 
genbanleT89130T89130 
genbanLAA083103 AA083103 
entrezj.41390 L41390 
genbanlLAA102425 M102425 

221.215 . M86361 Z26593 X02850 013070 AE000559 M17649 MB7869 M87B71 X61077 M16286 AF018169 X61079 S59351 X60142 AF043169 
entreOC76223 X76223 
entre^YIOUl Y10141 
entnu.226256 Z26256 
NOT.FOUND.entrez_W37937 W37937 



genbank_AA398722 
AA464918_at AA464918 
genbanlLM397825 
genbank_M412155 
genbanlLAA020736 
genbankjVAl01056 
genbantM429515 
genbankJW)15967 



AA398722 

AA397825 
AA412155 
AA020736 
AA101056 
AA429515 
AA015967 
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Table 3A shows 452 genes up-regulated In chronically diseased lung relative to normal lung. Chronically diseased lung samples represent chronic non-mafignant lung diseases 
such as fibrosis, emphysema, and bronchitis. These genes were selected from 59680 probesets on the Eos/Afiymetrix Hu03 Genechip array. Gene expression data tor each 
probeset obtained from (his analysis was expressed as average intensity (AI), a normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccm Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene litte 

R1: 80th percentile of AI for chronically diseased lung samples divided by the 90th percentile of At for norma) lung samples. 

R2: 80th percentile of AI tor chronically diseased lung samples divided by the 90th percentile of norma! lung samples, squamous cell carcinomas and 

adenocarcinomas 

R3: 70th percentile of AI for chronically diseased rung samples minus the 15th percentile of At for all normal fungi chronically diseased tung and tumor samples 

divided by the 90th percentile of normal lung samples, squamous cell carcinomas and adenocarcinomas minus the 1 5th percentile of AI for all normal lung, 
chronically diseased lung and tumor samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


R3 


135423 


U50531 


Hs.1 38751 


Human BRCA2 region, mRNA sequence CG030 


12.40 






135378 


AW951818 


H&24379 


MUM2 protein 






2.13 


135346 


NMJMJ0928 


Hs.992 


phosphoTipase A2, group IB (pancreas) 








135235 


AW298244 


Hs.293507 


ESTs 


12.40 






135057 


U90268 


Hs.93810 


cerebral cavernous malformations 1 


11.67 






134951 


BE305081 


Hs.169358 


hypothetical protein 




8.00 




134799 


M36821 


Hs.89690 


GR03 oncogene 




8.20 




134786 


T29618 


Hs.89640 


TEK tyrosine kinase, endolhefial (venous 








134772 


Nty_000829 


Hs.1 63697 


gjutamate receptor, ionotrophic, AM PA 4 


29.80 






134752 


BE246762 


Hs.89499 


arachidonate 5-Iipoxygenase 






1.93 


134749 


T28499 


Hs.89485 


carbonic anhydrase IV 






2.07 


134596 


8E326276 


Hs.8861 


ESTs 








134636 


NMJMJ5582 


Hs.87205 


lymphocyte antigen 64 (mouse) homotog, r 


13.60 






134627 


A1018768 


Hs.1 2482 


gryceronephosphate O-acyltransf erase 






1.92 


134622 


AW975159 


Hs.293097 


ESTs, Weakly similar to A55380 faciogeni 






1.92 


134570 


U66615 


Hs.1 72280 


SvWSNF related, matrix associated, acfi 


13.20 






134561 


U76421 


Hs.85302 


adenosine deaminase, RNA-sperifb, B1 (h 






1.78 


134468 


NM_001772 


Hs.83731 


CO 33 antigen (gp67) 




6.20 




134417 


NM_006416 


Hs.82921 


solute carrier family 35 (CMP-siatic aci 








134343 


050683 


Hs,82028 


transforming growth factor, beta recepta 








134323 


8E170551 


Hs.8700 


deleted in fiver cancer 1 








134300 


NMJJ01430 


Hs.8136 


endothelial PAS domain protein 1 








134299 


AW580939 


Hs.97199 


complement component C1q receptor 








134253 


X52075 


Hs.80738 


siatophorin (gpL1 15. leukosiafin, C043) 


20.60 






134162 


D52059 


Hs.7972 


K1AA0871 protein 


1120 






133985 


L34657 


Hs.78146 


platelet/endotheBal cell adhesion motec 








133978 


AF035718 


Hs.78061 


transcription factor 21 








133835 


A1677897 


Hs.76640 


RGC32 protein 








133551 


A1301740 


Hs.173381 


dlhydropyrimidinase4ike 2 








133633 


D21262 


Hs.75337 


nucleolar and coBed-body phosph protein 


15.20 






133565 


AW955776 


Hs.313500 


ESTs, Moderately similar to ALU7.HUMAN A 








133548 


AW946384 


Hs.178112 


DNA segment, single copy probe LNS-CAJ/l 






1.77 


133488 


AA335295 


Hs.74120 


adipose specific 2 








133478 


X83703 


Hs.31432 


cardiac ankyrin repeat protein 






Z08 


133337 


AF085983 


Hs33676 


ESTs 




9.60 




133200 


AB037715 


Hs.183639 


hypothetical protein FU 10210 






1.77 


133153 


AF070592 


Hs.66170 


HSKM-B protein 


30.60 






133130 


A1 128606 


Hs.6557 


zinc finger protein 161 


22.60 






133120 


NM_003278 


Hs.65424 


tetranectin (plasminogen-blnding protein 








13828 


AW1 68082 


Hs.169449 


protein kinase C, alpha 


1180 






132836 


AB023177 


Hs.29900 


WAA0960 protein 








132799 


W73311 


Hs.1 69407 


SAC2 (suppressor of acun mutations 2. 


41.60 






132742 


AA025480 


Hs.292812 


ESTs, Weakly similar to T33468 hypothefi 


40.40 






132548 


X12830 


Hs.1 93400 


interieukin 6 receptor 




7.20 




132476 


AL119844 


Hs.49476 


Homo sapiens clone TUA8 CrWu-chat regi 




4.76 




132439 


AK001942 


Hs.4863 


hypothetical protein DKFZp566A1524 






1.8B 


132240 


AB018324 


Hs.42676 


WAA0781 protein 


21.20 






132210 


NM_007203 


Hs.42322 


A kinase (PRKA) anchor protein 2 






1.99 


132199 


AL041299 


Hs.165084 


ESTs 


15.20 






131751 


T96555 


Hs.31562 


ESTs 






1.76 


131745 


A1828559 


Hs^1447 


ESTs, Moderately similar to A4601 0 X-4J 


2730 






131694 


NMJJ00246 


Hs.3076 


MHC class 11 transactivator 




4.00 




131686 


NM.012296 


Hs.30687 


GRB2-assoctated binding protein 2 








131676 


AI126821 


Hs.30514 


ESTs 




6.20 




131629 


245794 


Hs.238809 


ESTs 


21.40 






131589 


C18825 


Hs.29191 


epithelial membrane protein 2 








131536 


AA019201 


Hs.269210 


ESTs 




9.40 




131517 


AB0377B9 


Hs.263395 


sema domain, transmembrane domain (TM), 




3.59 




131355 


R52804 


Hs.25956 


DKFZP564D206 protein 




4.48 




131253 


R71802 


Hs.24853 


ESTs 


15.00 






131207 


AF104266 


Hs.24212 


latrophifin 






1.75 


131156 


AI472209 


Hs.323117 


ESTs 






1.84 


131066 


AW169287 


Hs.22588 


ESTs 




3.54 




131061 


N64328 


Hs.268744 


K1AA1796 protein 








131053 


AA348541 


Hs.296261 


guanine nucleotide binding protein (G pr 






1.93 


130895 


AA641767 


Hs.21015 


hypothetical protein DKFZp564L0854 simB 


16.60 






130762 


084371 


Hs.1898 


paraoxonase 1 


1Z00 
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TABLE 3B shows the accession numbers for those primekeys lacking unigenelD's for Table 3A. For each probeset we have Bsted the gene cluster number from which the 
oBgonucteotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubteTwist Oakland CaCfomia). The Genbank accession numbers for sequences comprising each cluster are listed in the 
'Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 
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genbank.N22401 
genbanxJUl 28654 
genbanK.R97018 
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tJgr.HT2245 M691B1 M81105U51039 
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Table 4A shows 202 genes ufwegulated in samples from patients treated wilh chemotherapy or radiotherapy. These genes were selected from 59680 probesets on the 
Eos/Aflymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average Intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 
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Pkey: Unique Eos probeset identifier number 

ExAccn: . Exemplar Accession number, Gen bank accession number 
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Homo sapiens cDNA FU 20653 fis, clone KA 20.20 

bacidovira) lAP repeat-containing 5 (sur 22.40 

glutamine-fhJctose-6.phosphatetransamln 19.60 

ESTs • 19.40 

cadherin 1 3, H-cadherin (heart) 21 .40 

UDP glycosyi transferase 1 family, polype 1 1 0.00 

CCAAT/enhancer binding protein (OEBP). 25.20 

chromogranin B (secretogranin 1) 40.60 

E1 A binding protein p300 24.60 

KIAA0S48 protein 21.00 

Rho guanine nucleotide exchange factor ( 33.40 

replication factor C (activator 1) 4 (37 60.80 
Homo sapiens cDNA: FU22373 frs, done H . 20.40 

karyopherin alpha 3 (importin alpha 4) 29.40 

ESTs 32.40 

cydtnlania-6a 27.40 

bone morphogenefJc protein 7 (osteogenic 75.60 

carbonic anhydrase XII 31 .36 

guanine mon phosphate synthetase 32.40 

Cdc42 guanine exchange factor (GEF) 9 23.40 

ESTs, Weakly similar to T33468 hypolhefi 61 .20 

phosphoserine phosphatase 22.33 

a disintegrin and metaDoprotehase doma 23.50 

HSKM-8 protein 30.00 

ovist (Orosophila} homoJog (acrocephabs 23.80 

SRB7 (suppressor of RNA polymerase B, ye 51 .60 

hypothetical protein FU 10074 33.00 

general transcription factor l!IA 82.00 
secretogranin II (chromogranin C) 

cfiscs, large (Drosophila) homolog 5 69.33 

serine (or cysteine) proteinase inhlbito 33.20 

K1AA0203 gene product 31.60 
BCL2/edenovirus E1B 19klWnteracfJng pro . 30.60 

ESTs, Moderately similar to A46010 X-Gn 23.40 

phosphoribcsylglycinamlde formyttransfer 49J20 

SVWSNF related, matrix associated, sc8 20.20 

duaJ^cifidtytyrosine^phosprwr^ 20.80 

G antigen 7B 37.60 

hydroxysteroid (17-beta) dehydrogenase 53.40 

cydin-dependent kinase 5, regulatory su 31 .60 

neurotrophin 3 28.80 



TABLE 4B shows the accession numbers for those primekeys tacking unigeneHys for Table 4A. For each probeset we have fisted the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Toots {DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession; Genbank accession numbers 



Pkey CAT number Accessions 

123619 371681.1 AA602964 AA609200 

126433 127143.1 AA325606 AA099517 N89423 

126872 142696 1 AW450979 AA1 36653 AA1 36656 AW419381 AA984358 AA492073 BE1 68945 AA809054 AW238038 BE01 1 212 BE011 359 

BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 

106851 322947J AI458623 AA639708AA485409 R22065AA485570 

118720 genbank_N73515 N73515 

120515 genbanlLAA258356 AA258356 

117099 321871J H93699 H97976 HB0036 

101447 entrezjY!21305 M21305 

123130 genbank u AA487200 AA487200 
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Table 5A shows 680 genes up-regulated in squamous cell carcinoma or adenocarcinoma lung tumors relative to normal lung and chronically diseased lung. These genes were 
selected from 59660 probesets on the EosMffymetrfx Hu03 Genechlp array. Gene expression data far each probeset obtained from this analysis was expressed as average 
intensity (Al), a normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccru Exemplar Accession number, Genbank accession number 

UnigenelD: Unigerte number 

Unigene Title: Unlgene gene title 

R1 : 70th percentile of Al for squamous cell carcinoma and adenocarcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically 

diseased lung samples. 
R2 80th peranffle of Aladenccardrujnia lung tunujrsam 

R3: 80th percentile of Al squamous cell carcinoma lung tumor samples divided by the 903) percentile of Al for normal and chronically diseased lung samples. 

R4 : 80th percentile of Al adenocarcinoma lung tumor samples divided by the 80S) percentile of Al for squamous cell carcinoma tung tumor samples. 

R5: ' 70th percentile of Al for squamous cell carcinoma and adenocarcinoma rung tumor samples minus the 15lh percentile of Al for all normal lung, chronically 

diseased lung and tumor samples divided by SOlh percentile of Al for normal and chronically diseased lung samples minus the 15th percentile of Al for all 

normal lung, chronically diseased hmg and tumor samples 



Pkey 


ExAccn 


UnigenelD 


untgene itae 


100035 






Atta COntTOL oArUn 


100036 






AFFX contra gapdh 


100037 






A CUV Antrim*- flADHU 

ArrX cornroi. laArun 


100071 


A28102 




LLimu PADAi mMiilnr alnha.^ eiiHitnti 

human uAtsAa receptor aipna-o suounn 


100114 


X02308 


HS.82952 


thymidylate synthetase 


100154 


H60720 


nS.0ia92 


lUnAvivi gene proouci 


100187 


D17793 


nS./oloJ 


atuCH\cLU IB0UPI8SB lcuinij 1 , UfcaiiUtJi 


100188 


AW2470S0 


Hs.57101 


mini chromosome maintenance deficient (S. 


100202 


nrnn A Any 

BE2944Q7 


i). nn<n ft 

HS.9991 0 


phosphofructokinase, platelet 


100216 


A A 4 AAA/VO 

AA489908 


Hs.1 390 


proteasome (prosome, macropain) subunit, 


100269 


NM_0u1949 


u* 4 4 on 
HS.1i by 


E2F transcription factor 3 


100287 


AUQ76657 


Hs.iouO 


cnaperonin containing iw^i, suounn o(e 


100297 


AU077258 


Hs. 182429 


protein utsuinoe isomerase-reiaiea proi 


100330 


A til 4 4 nOTC 

AW410976 


Hs.77152 


rrunicnrornosorne rnajntenance oenciBni \o. 


100335 


AW247529 


Hs.6793 


plateiet-acfivaung factor acetyinyaroia 


100360 


W70171 


Hs.75939 


uridine monophosphate kinase 


100372 


NM.014791 


Hs.1 84339 


WAA0175 gene product 


100474 


NMJXJ0699 


Hs.300280 


amylase, alpha 2A; pancreatic 


100486 


T19006 


it. 4\ Aft J n 

Hs.1 0842 


RAN, member HAS oncogene tamiiy 


100491 


D56165 


Hs.275163 


non-metastafic cells Z, protein ((hMZjoJ 


100516 


D90278 


Hs.11 


cardnoembryonic antigen-related cell ad 


100522 


X51501 


I 1. aaa 4 A 

Hs.99949 


prolac tin-induced protein 


100559 


NM_000094 


Hs.1 640 


collagen, type VII, alpha 1 (er^dermorys 


100576 


X00355 


Hs.37058 


calcttonin^citonin-reiaieo porypopuo 


100629 


AA015693 


Hs.21291 


ntitogen-activated protein kinase kinase 


100661 


BE623001 


Hs.1 32748 


Homo sapiens noosomai protein u$s mKNA, 


100877 


AA353686 


Hs.57813 


zinc ribbon domain containing, 1 


100696 


014887 


Hs.1 21 686 


general transcription factor flA, 1 (37k 


100709 


N26539 


Hs. 100469 


myelokt/Iyrnphdd or mixed-lineage leukem 


100761 


BE208491 


Hs.295112 


KlAAuoiQ gene proouci 


100830 


AC004770 


Hs4756 


flap structure-specific endonudease 1 


100867 


U14622 




gb^Human transketolase-like protein gene 


100902 


M 16029 


Hs.287270 


ret proto-oncogene (multiple endocnne n 


100906 


AU076916 


Hs.5398 


guanine monphosphate synthetase 


100960 


J00124 


Hs.1 17729 


kerafin 14 (e^ermorysis bullosa simple 


101045 


J05614 




gb:Human pmtherating ceO nudear anfi 


101061 


NMJ100175 


Hs.1 80532 


glucose phosphate teomerase 


101071 . 


L02840 


Hs.84244 


potassium voitage-gateo cnannei, onao-re 


101124 


L10343 


Hs.112341 


protease inhibitor 3, sMn-denved (SKAL 




UtncDr 1 




melanoma anfinpn fjwrnh/ A 2 


101181 


BE262621 


Hs.73798 


macrophage migration inhibitory factor ( 


101204 


L24203 


Hs.82237 


ataxia-telangiectasia group [^associated 


101210 


L29301 


Hs.2353 


opioid receptor, rrru 1 


101216 


AA2B4165 


Hs.84113 


cycfin-dependOTt kinase Inhibitor 3 (COK 


101228 


AA333387 


Hs.62916 


chaperonin containing TCP1, subunit 6A ( 


101233 


AI135173 


Hs.678 


sorbitol dehydrogenase 


101273 


Z11933 


Hs.182505 


POU domain, dass 3, transcription facto 


101342 


U52112 


Hs.182018 


interieukirM receptor-associated kinase 


101346 


AI738616 


Hs.77348 


hydroxyprostaglandin dehydrogenase 15-(N 


101369 


NM.000892 


Hs.1901 


kaDtkrein B, plasma (Fletcher factor) 1 


101396 


BE267931 


Hs.78996 


proliferating cell nudear antigen 


101431 


BE165269 


Hs.1076 


small proSne-rich protein 1B (comifm) 


101448 


NfA.000424 


Hs.195850 


keratin 5 (epidarmolysrs bullosa simplex 


101462 


AL035668 


Hs.73853 


bone morphogenetic protein 2 


101466 


BE2626S0 


Hs.170197 


ghrtamic-oxafoacefic transaminase 2, mlt 


101484 


AA0534B6 


Hs.20315 


interferon-induced protein with tetratri 


101502 


M26958 




gb:Human parathyroid hormone-related pro. 


101505 


AA3076B0 


Hs.75692 


asparagine synthetase 


101526 


NML002197 


Hs.154721 


Bconitasel, soluble 


101535 


X57152 


Hs.99853 


fibriliarin 

v-ros avian UR2 sarcoma virus oncogene h 


101577 


M34353 


Hs.1041 


101649 


AW959908 


Hs.1690 


heparin-Wnding growth factor binding pr 


101563 


NML003528 


Hs.2178 


H2B histone family, member Q 


101664 


AA436989 


Hs.121017 


H2A histone famity, member A 


101669 


L24498 


Hs.80409 


growth arrest and DNA-damage-indudble, 



R1 R2 R3 
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3.33 
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5.07 



3.10 
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7.20 



8.60 



24.80 
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3.50 

4.08 

2.53 

8.50 

3.24 
8.31 

10.50 
4.02 



54.00 

5.59 

7.00 



7.60 
10.20 

aoo 



1Z91 



6.40 



15.65 



14.20 

9.30 
20.60 



10.00 



R5 

6.76 
5.77 
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5.71 



4.52 
5.49 
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3.81 
4.50 

4.82 
3.79 

5.49 
4.17 



21.89 
12.80 



38.80 
12.00 



7.99 



5.16 

4.69 
4.19 



5.69 



7.90 
4.45 

4.17 



7.90 

4.01 

4.46 
4.65 



7.60 
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chymase 1, mast ceil 4.79 
bullous pemphigoid antigen 1 (230/24GkD) 15.21 
desmogleln 3 (pemphigus vulgaris antigen 55.50 
solute carrier family 7 (cationic amino 
myeloid ceS nuclear differentiation ant 
TTK protein kinase 4,50 
gap junction protein, beta 2, 26kO (conn 140.00 
collagen, type XVII, alpha 1 2,56 
mannose receptor, C type 1 
mldkine (neurrte growtrHpromofing factor 
nucleotide binding protein 1 (Ecoii Win 7.60 
aldo-keto reductase family 1 , member CI 

zinc finger protein 131 (clone pHZ-10) 7.40 
interteukin enhancer binding factor 2, 4 
sulfotransrerase farnfly, cytosolic, 2A, 
centromere protein A (17k0) 
tamirdn, beta 3 (nicein (125kO), kalimn 
secreted phosphoproteln 1 (osteopontJn, 
JTV1 gene 

proteasome (prosome, macropain) 25S subu 
heterochromatin-Cke protein 1 
DBADiH (Asr^GIu-Aia^sp/His) box polypep 
chromosome segregation 1 (yeast homolog) 
chromobox homolog 1 (DrosopWla HP1 beta 
macrophage stimulating 1 (hepatocyte gro 
aldehyde dehydrogenase 3 family, member 
Bloom syndrome 

a disintegrin and metalloproteinase doma 
vascular endothelial growth factor C 
RAB27A, member RAS oncogene family 
enhancer of zeste (DrosophlJa) homolog 2 
ubtqurrJn fusion degradation 1-like 
preferentially expressed antigen in mela 
melanoma antigen, family A, 9 
G protein-coupled receptor 
Human hbc647 mRNA sequence 
CUG triplet repeat, RNA-brnding protein 
eyes absent (DrosopWla) homolog 2 
retinoblastoma-binding protein 8 
utiquffin carrier protein E2-C 
farkhead box M1 

gb.'Hamo sapiens done 14.93 mRNA sequenc 
chaperonin containing TCP1 , subunit 7 (e 
transcription factor AP-2 gamma (acOvat 
Homo sapiens cDNA: FU21930 fis, cbne H 
neurotensin 

ubtqurGn carboxyUerininal esterase LI 
matrix metal loproteinase 3 (sbometysln 
keratin 15 

small nuclear ribonucieoproiein polype pi 
v-ski avian sarcoma viral oncogene homo! 
non-metastaSc cells 1, protein (NM23A) 
multifunctional pofypepfide similar to S 
matrix mBtalloproteinase 1 (interstitial 
CDC28 protein kinase 1 
matrix metalloproteinase 11 (stromelysin 
NADH dehydrogenase (ubiquinone) Fe-S pro 
cadherin 3, type 1, P-cadherin (placenta 
glutathione peroxidase 2 (gastrointestin 
transmembrane protein (63kO), endoplasmi 
transforming growth factor, alpha 
chaperonin containing TCP1, subunit 3 (g 

alcohol dehydrogenase 7 (class IV), muo 100.00 
SMA5 9.80 
sine ocuEs homeobox (Drosophfla) homolo 9.71 
coated vesicle membrane protein 14.00 
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proteasome (prosome, mxropaln) subunit 
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gb:Homo sapiens full length insert cDNA 
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ESTs 6.60 
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TABLE 5B shows the accession numbers for those primekeys lacking unigenelPs for Table 5A. For each probeset we have fisted the gene cluster number from which Ihe 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubJeTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset Identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 
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Table 6A shows 99 genes up-regulated nonsmokers with lung cancer relative to smokers with lung cancer. These genes were selected from 59880 probesets on the 
Eos/Anymetrix Ku03 Genechip array. Gene expression data for each probesel obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset tdenfjfier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unlgene Title: Untgene gene title 

R1: average of Al for samples from non-smokers with adenocarcinoma divided by the 90th percentile of Al for samples from smokers with adenocarcinoma 

R2 average of Al for samples from non-smokers with squamous cell carcinoma divided by the 90th percentile of Al for samples from smokers with squamous cell 
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Hs.190150 


ESTs 


9.50 


116830 


H61037 


Hs.70404 


ESTs, Weakly similar to ALU2_HUMAN ALU 


B.50 


116970 


AB023179 


Hs.9059 


KIAA0962 protein 


7.50 


117178 


H98675 


Hs.269034 


ESTs 




117757 


AF088019 


Hs.46732 


EST 


7.50 


118283 


AA287747 


Hs.173012 


ESTs, Weakly similar to A46010 X-linked 


16.50 


118384 


AF217525 


Hs.49002 


Down syndrome cell adhesion molecule 




118657 


AI822106 


Hs.49902 


ESTs 




120328 


AA923278 


Hs.290905 


ESTs, Weakly similar to protease [Rsap] 




120404 


ABO23230 


Hs.96427 


WAA1013 protein 


7.00 


120524 


AA261852 


Hs.192905 


ESTs 


6.00 


120688 


AW207555 


Hs.97093 


Homo sapiens cONA: FU23004 6s, done L 


17.92 



R2 

3.64 

2.46 
12.00 
268 
2.11 



2.50 
3.94 
12.66 
2.17 

238 
295 

2.40 
5.00 



3.00 
279 
4.50 



3.82 
221 

2.65 

6.00 
4.63 
7.00 
6.00 
227 
9.00 



2.68 



250 
239 
3.50 
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121558 


A A /M1/tQ7 

AA41Z497 




gbzt95g1 2s1 Soares.testis.NHT Homo sap 




121676 


H56037 


Lie 1 arias 


ESTs 


10.00 


121936 


Al 024600 




ESTs 


15.00 


121938 


AA428559 


U, GQCIft 

HS-sOOiU 


ESTs 


14.00 


122177 


AA4357B9 


HS.90000 


EST 


8.93 


123442 


AA29S652 


HS.1 11436 


Homo sapiens cDNA FU11643 fis, done HE 


13.04 


123551 


AA608837 




gb:af03h12s1 Scares testis NHT Homo sap 


11.50 


123756 


AA609971 


HS.i iZfab 


EST 


11.00 


123861 


AA620840 




gb:afB9g01.s1 Soares.tes8s_NHT Homo sap 


6.50 


124371 


N24924 


HS.lDODi/1 


ESTs 


127477 


BE 328720 


rls./oubDi 


ESTs 




127591 


AJ 130540 


U» 4 '31000 


ESTs 




128252 


AA45t>924 


Up 400900 


ESTs 


7.00 


128426 


Ai 265784 


HS.i4oiy/ 


ESTs 




128925 


K07419 




Homo sapiens cDNA FU 1 2900 fis, done NT 




128945 




Hs.8077 


Homo sapiens mRNA; cDNA DKFZp547E184 (fr 


10.00 


129105 


At70r»4Cf\ 

AwDslOO 


US. lUOODl 


Homo sapiens brain tumor associated pro! 


15.50 


129235 


AIHRTT700D 

AW977Z3© 


u. 40A00VI 

HS.1^oUo4 


K1AA1055 protein 




129506 


AB020684 


it- 4 4 047 

HS.1 MM 


K1AA0877 protein 


6.50 


129595 


i irtficcn 
UOyoaO 




oviductal glycoprotein 1, 12RD (mucin 9 




130160 


A A OrtCCOQ 

AA305588 


HS.2b7o90 


UDP-Gal:betaG)cNAc beta 1,3-gaiactosyttr 


20.00 


130340 


D82326 


Hs.239106 


solute carrier family 3 (cystine, dbasi 


11.50 


131220 


AB023194 


HS.3UUB55 


WAA0977 protein 


17.50 


131430 


A I07T14 JO 

AI879148 


II. OC77f\ 

HS.ZD77U 


fatty add binding protein 7, brain 


6.10 


132114 


NMJJQ6152 


Hs.40202 


Jymphotd-restricted membrane protein 




132458 


AA935315 


HS.Aosba 


Homo sapiens cOHk FU21693 fis, clone C 




132647 


NMJJ0o927 


Hs.54432 


sialyttrartsferase 4B (beta-galactosidase 


7.50 


132655 


D49372 


Hs. 54460 


small Indudbie cytokine subfamily A (Cy 




1326B2 


A1077500 


Hs.Myou 


serologically defined colon cancer anfig 




132747 


AAj4d*41 


Hs.ooyoti 


ESTs, Weakly similar to KIAA1330 protein 




132812 


R50333 


Hs 92186 

1 *4>« v^. 1 UW 


Leman coted-coll protein 




133337 


AF085983 


Hs.293676 


ESTs 




133876 


AL134906 


Hs.771 


phosphoryiase, glycogen; liver (Hers dis 




134119 


AW157837 


Hs.79226 


fasdcuiation and elongation protein zet 




134464 


AA302983 


Hs.239720 


CCR4-N0T transcriptton complex, subunit 




134542 


M14156 


Hs.85112 


insulin-like growth factor 1 (somatomedi 




135002 


AA448542 


Hs.251677 


G antigen 7B 


87.00 


135305 


AA203555 


HS.982B8 


Homo sapiens cDNA RJ14903 fis, clone PL 





PCI7US02/12476 



Z95 



250 

4.33 
3.02 

208 
211 



4.25 
10.00 



6.15 
5.58 

253 
250 
283 
3.82 
5.00 
3.00 
206 
227 
11.50 

&50 



TABUE 6B show the accession numbers for those primekeys lacking unigeneirjs for Table 6A. For each probeset we have fisted the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence 
simflarity using Clustering and Alignment Tods {DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession* column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 

108562 
103439 
123551 
123861 
102832 
101972 
121558 



CAT number Accessions 

36375 1 AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 

35330 1 X98256 N41124 

genbankjAA608837 AA608837 

genbank_M620S40 AA620840 

entrezJJ92Q15 U92015 

enuB?_S82472 S82472 

genbankjAA4124g7 AA412497 
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Table 7A shows 98 genes down-regulated in non-smokers with lung cancer relative to smokers with lung cancer. These genes were selected from 59680 probesets on the 
Eos/Ahymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (A!), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UrvgenelD: Unigene number 

Unpens THJa- UnJgene gene title 

R1: 90th percentile of Al for samples from smokers with adenocarcinoma divided by the average of Al for samples from non-smokers with adenocarcinoma 

R2: 90th percentile of Al for samples from smokers with squamous ceil carcinoma divided by the average of Al for samples from non-smokers with squamous ceii 
carcinoma 



Pkey 


ExAccn 


Unigene) D 


Unigene Title 


R1 


R2 


100187 


D17793 


Hs.78183 


aldo-keto reductase family 1, member C3 




164.10 


100380 


082343 


Hs.18551 


neuroblastoma (nerve tissue) protein 




77.40 


100576 


X00356 


Ks.37056 


cBicitonin/caicitonin-reiated potypeptid 


10Z40 




100971 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


463.80 




101046 


K01160 




(NONE) 


672.00 




101066 


AW970254 


Hs.689 


Charot-Leyden crystal protein 


66.00 




101175 


U82671 


Hs.36980 


melanoma antigen, family A, 2 




77.20 


101497 


W05150 


Hs.37034 


homeo box A5 


6280 




101663 


NMJJQ3528 


Hs.2178 


H28 histone family, member Q 


78.00 




101677 


NM.000715 


Hs.1012 


complement component 4-binding protein, 


186.2) 




101745 


M88700 


Hs.150403 


dopa decarboxylase (aromatic L-amino ad 


80.08 




101941 


S77583 




gb:HERVK10/HUMMTV reverse transcriptase 


99.20 




102125 


NMJJ05456 


Hs.288215 


siaiyltransferase 




103.10 


102242 


U27185 


Hs.82547 


retincic acid receptor responder {tazaro 


67.00 




102340 


U37055 


Hs.278657 


macrophage stimulating 1 (hepatocyte gro 


71.60 




102369 


U39840 


Hs.299867 


hepatocyte nuclear factor 3, alpha 




69.70 


102457 


NM 001394 


Hs.2359 


dual specificity phosphatase 4 


153.00 




102669 


U71207 


Hs.29279 


eyes absent (Drosophlla) homobg 2 




65.70 


102796 


AL079646 


Hs.107019 


sympiekin) Hunting tin interacting protei 




58.80 


102829 


NM 006183 


Hs.80962 


neurotensin 




26B.80 


103207 


X72790 




Qh'Huifirffl £n(tonfinoiift ro t mvi n m mRNA fnr 


70.00 




103242 


X76342 


Hs.389 


alcohol dfihvdmnftnasft 7 fclass I\A mu a 




212.10 


103260 


X78416 


Hs.31K> 


casein, alpha 




130.70 


103351 


X89211 




ab'H saniens DNA for endooenous retrovir 


64.60 




104212 


AB 002298 


Hs.173035 


K1AA0300 protein 


66.80 




104252 


AF002246 


Hs.210863 


cell adhesion molscule with homology to 


63.60 




104258 


AF007216 


Hs.5462 


solute carrier family 4, sodium bkarbon 


94.40 




105024 


AA126311 


Hs.9879 


ESTs 


68.20 




106250 


AIG97144 


Hs.5250 


ESTs, Weakly similar to ALU INHUMAN ALU 5 




74.60 


106440 


AA449563 


Hs.151393 


gtutamato-cysteine Bgase, catalytic sub 




71.10 


106566 


BE298210 




ab-60111801 6F1 NIH MGC 17 Homo saniens c 


73^20 




106605 


AW772298 


Hs.21 103 


Homo sapiens mRNA; cDNA DKFZp564B076 (fr 


83.80 




106614 


AA648459 


Hs.335951 


hvDdthetical nmtein AF301 077 




62.30 


106654 


AW075485 


Hs.286049 


phosphosGnno dininotrctnsf&rsss 




202.40 


106999 


H93281 


Hs.10710 


hypothetical prolan FU 20417 




89.60 


108700 


AA1 21518 


Hs. 193540 


ESTs, Moderately similar fo 2109260A B c 




66.40 


108810 


AW295647 


Hs7l331 


hvDotheficaJ nmtein MGC5350 




95.50 


108857 


AK001468 


Hs 62180 


anidin ( nmjyinWta Scram hnmrJiYil act 




63.40 


109597 


AA9B93S2 


Hs 293780 


ESTs 


85.00 




109691 


T65568 


Hs. 12860 


ESTs 




58.70 


109704 


AI743880 


Hs.12876 


ESTs 




§0*60 


110942 


R63503 


Hs.28419 


ESTs 


76.40 




111722 


R23924 


Hs.23596 


EST 


7460 




112891 


T03927 


Hs^93147 


ESTs, Moderately similar fo A46010 X-ii 


64.80 




112992 


All 57425 


Hs.133315 


Homo sapiens mRNA; cDNA DKFZp761J1324 (f 




76.70 


113073 


N39342 


Hs. 103042 


micro tubule-associated protein 1 B 




120.20 


114251 


H15261 


Hs.21943 


ESTs 


127.20 




115230 


AA27B300 


Hs. 124292 


Homo sapiens cDNA: FU23123 fis, clone L 


174.00 




115291 


BE545072 


Hs.1 22579 


hypothetical protein FU 10461 




91.00 


115815 


AW905328 


Hs. 180842 


nbosonofll protein l»t 3 


66.40 




115909 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAPLHUMAN DEATH 




226.60 


115965 


AA001732 


Hs.173233 


hypothetical protein FU 10970 


82.B0 




116107 


AL133916 


Hs.1 72572 


hypothetical protein FU20093 




361.60 


116552 


D20508 


Ks.164649 


hypothetical protein DKFZp434H247 


69.00 




115571 


D45652 




gb:HUMGS02848 Human adult lung J direct 


64.20 




118466 


N66741 




g b:yz33g08.s1 Morton Fetal Cochlea Homo 




6350 


120484 


AA253170 


Hs.96473 


EST 


81.60 




120983 


AA398209 


Hs.97587 


EST 




81.10 


121034 


AL389951 


Hs.271623 


nucteoporin 50kD 




66.20 


121423 


AW973352 


Hs.290585 


ESTs 


64.40 




122553 


AA451884 


Hs.190121 


ESTs 




60.40 


122945 


AJ718702 


Hs.308026 


major histocompatibSity complex, class 


186.60 




123130 


AA487200 




gfrab19f02*1 Stratagene lung (937210) H 




80.20 


124472 


N52517 


Hs.102670 


EST 


71.00 




f24526 


N62096 


Hs.293165 


ESTs, Weakly similar to JC7328 amino ad 




104.90 


125489 


H49193 


Hs.124984 


ESTs, Moderately similar to ALU7_HUMAN A 




72.00 


125731 


R61771 


Hs.26912 


ESTs 




69.90 


125747 


NM.002884 


Hs.865 


RAP1A, member of RAS oncogene famUy 


69.00 




126020 


H79863 


Hs.114243 


ESTs 




6240 


126547 


U47732 


Hs.84072 


transmembrane 4 superfamily member 3 




6180 


126966 


R38438 


Hs.182575 


solute carrier family 15 (H*/peptide tra 




60.10 
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127472 


M761378 


U. 407ni7 


ESTs 


7ft Oft 




127610 


AA950Bo7 


nS.19U2ri 


ts I s, Htgniy srmnar to unnamed protein 


D4.UU 




127742 


AVV2so490 


tie irm^r 

US. 1 Owl 00 


CCTe 
CO IS 


oc on 




127987 


AJ022103 


HS.lz4oll 


ESTs 


SJb.oU 




126233 


AWBB9132 


II. 4 4 (MR 


ribokinasB 




7B Oft 


128420 


AA650274 


HS.4 1 £20 


fibronecbn leucine rich transmembrane p 




ins on 


17G7RC 

1*0/00 


AW10v4O* 


Wo OQAARfl 


craniofacial development protein 1 


fie on 




12SH314 


AIA/Q7C4 07 
AVVyODloY 


U. 17ft1R0 

nS.l/U10i 


imaa loo/ proiBin 




Rfl R9 
00.00 


129215 


ADft>lftQ7n 


Up itc/ipe 
nS. liouoo 


I/1AA1AQ7 nmlsln 

rwv\14yi protein 


ra on 

04 .iU 






ns/o/o 


ns. i Jtoau 


2inc linyoi pioisin oo ((sua i 0/ 


63.80 




17rt7Jic 
louooo 


AVVUO/OUU 


He 1^777 

ns. ioomj 


sianniocatcui & 




139.60 


17JV7W 


AVVOSUnOf 




caun&nii 10, n-vdunenii ^ncanj 




64.60 


Iglu40 




UefilRQ 

ns.o i a? 


Kl AA14R7 nmieln 


64.40 




1717A1 
101*41 


ppcni qi a 

DtOUlS 11 


He OAR cm 


Hnmn canto nQ rPlMA PI 11 1 RAH fie rlnno HP 


76.20 




17177K 


AQmARAfi 


He ^1 Q01 


IflAAftRAR nrnWn 


97.80 




1777 Aft 


AoUlOOZ** 


He A0R7R 


WlAAfT7ft1 nmlein 
fWvW/OI p/Dtcin 




71.00 


1770 EC 

lOlOOO 


mm nniAAP 


He 5R9R7 

ns.oooo/ 


grypican *» 




88.40 


132977 


A ftnO7770 


Llo Oftl/ftA 

MS.0U14U4 


DMA htnrfin/i mnfif nrntom 7. 

KriA Dinning rnoui protein o 


1» on 
lOO.AI 








nS.lUUlo 


soiuie camBr larrnry m (jjuospnaie Don 




59.30 


133818 


A11 10884 


Hs.7645 


fibrinogen, B beta polypeptide 


341.00 




134264 


AF149297 


Hs.8087 


NAG-5 protein 




64.30 


134265 


M83772 


Hs.80876 


flavin containing monooxygenase 3 




23Z53 


134346 


X84002 


Hs.82037 


TATA box binding protein (TBP)-associate 


66.00 




134395 


AA456539 


Hs.8262 


rysosomal-associated membrane protein 2 




75.80 


135047 


AL134197 


Hs.93597 


cycDn-dependent kinase 5, regulatory su 




108.30 


135056 


N7576S 


Hs.93765 


lipoma HMGIC fusion partner 


71.40 




135309 


A1564123 


Hs.42500 


ADP-ribosylation factor-like 5 


70.40 





TABLE 7B shows the accession numbers for those primskeys lacking untgenelD's for Table 7A. For each probeset we have Gsted the gene cluster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland Cafifomia). The Genbank accession numbers tor sequences comprising each cluster are listed in the 
"Accession' column. 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

103207 30635.-4 X72790 

106566 120358 1 BE298210A1672315AW086489BE298417AA455921 AA902537BE327124R14963 M0&210AW274273A138^ 

AI885095 AI476470 AI267650 A18B5299 A1985381 AW592624 AW34U136 A1266556 AA456390 A1310815 AA484951 

116571 genbanleD45652 D45652 

118466 genbarM66741 NS6741 

101046 entrez„K01160K01160 

101941 entrez_S77583 S77583 

103351 entrezJ(89211X89211 

123130 genbank_AA487200 AA467200 
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Table 8A shows 1 720 genes either up or down-regulated in lung tumors or chronically diseased lung relative to a broad collection of over 40 distinct normal body tissues. 
Chronically (fiseased lung samples represent chronic norwnalignant lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 39494 
probesete on the Eos/Anymetrix Hu02 Genechip array. Gene expression data for each probeset obtained from this analysts was expressed as average intensity (Al), a 
normafeed value reflecfing the relative level of mRNA expression. 

Pkey: Unique Eos probeset Identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UntgenelD: Unigene number . 
Unigene Title: Unigene gene title 

R1: 70th percentile of A! for lung tumors divided by 90th percentile ofAl for normal lung • 

R2: 70th percenSe of Ai for chronlcafly diseased lung divided by 90th percentile of Al for norma) rung 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


pi 


r\z 


300097 


AI916973 


1 1- ft* AfiAA 

Hs.213o03 


ESTs 


C AC 


4.09 


300117 


A IAN omoT 

AW1 89787 


Hs,1 47474 


ESTs 


n 

U.00 


U.DD 


300197 


AI6B6661 


Li* 04 OOQC 

HS.2 18286 


ESTs 


A Oft 


C AA 
0.44 




AJOOOOUO 




rrMaQfVnfi * 1 MH CRAP RmOft kfamn canton 

gunayucuoju Nu„t»oAr_Hmzu nomo sapien 




0.83 


300225 


A1989963 


Hs.1 97505 


ESTS 


1 Aft 
1.00 


l.rO 


300247 


AW2746B2 


US. 101334 


ESTs 


l.VO 




300256 


A1469Q95 


H5.29o24t 


Transmembrane protease, serfrie 3 


U.00 


1 Oft 


300337 


AI707881 


Li., on on on 
Hs.202090 


colS 


C OA 

o.ou 


Q AQ 


300362 


Z42308 




gb:HSC0FB121 normalized ntfant brain cUN 


4.10 


iz.ro 


300374 


AI859947 


Hs.314158 


ESTs 




4.00 


onnoo7 


AWZrOloO 


ll_ OCJC4G 

nS./D40 1 0 


co is 


1 50 


2.53 


300440 


A1421541 


Hs.1 461 64 


ESTs 


0.30 


o.zo 


300441 


R10367 


Hs.307921 


EST, Weakly similar to Z232_HUMArv ZINC F 


iJ.JO 


c on 
o.ou 


300449 


AI362967 


Hs.1 32221 


hypothetical protein FU12401 


0.4o 




300469 


AW135B30 


Hs.233955 


hypothetical protein FU 20401 


0.1O 


0.83 


300552 


X85711 


LI.. 14 OIQ 

Hs.21838 


u,__u,_i;__i BW i ' ri 14440.4 

nypotneocaj protein rumyi 


A 1ft 


Q75 


300627 


W27363 




gb:ao37Qu1.n Stratagene Hela cell s3 93 


4.0U 


1^.00 


300630 


AW1 16822 


Hs.1 28757 


ESTs 


2.91 


0.00 


300716 


AI216113 


Hs.1 26280 


hypothetical protein HJ23393 


4 nn 
1.00 


ft QO 

o.yz 


300738 


A1623332 


Hs.1 30541 


K1AA1542 protein 


l.oZ 


4 74 


300777 


AA235361 


Hs, 96840 


KIAA1 527 protein 


4.48 


0.22 


300790 


k i inn 

AI492471 


Hs.1 88270 


ESTs 




4 40 

l.lo 


300832 


A1688147 


Hs.220815 


ESTs, Weakly similar to T03829 transcrip 


5.51 


8.56 


300836 


Z44942 


Hs^2958 


calcium channel alpha2-delta3 subunit 


4.90 


6.34 


300838 


A1582897 


Hs.1 92570 


nypptneOca protein RJ22028 


1.70 


n Q4 
2.81 


300878 


AW449802 


Hs.285901 


Homo sapiens cDNA FU 20428 fis, clone KA 


4.56 


7 04 

/.yi 


300897 


A1890356 


Hs.1 27804 


nT_ i Al -.11 S i T4 7AjA4 1 n 

ESTs, Weawy similar to T17233 hypothec 


Z23 


1.58 


300926 


AA504860 




gb:ab03a10.s1 Stratagene feta! retina 93 


2.13 


3.50 


300960 


AI041019 


Hs.1 52454 


ESTs 


2.74 


4.46 


300961 


AW204069 


Hs.312716 


ESTs, Weakly similar to unnamed protein 


1.00 


1.00 


300962 


AA593373 


Hs.293744 


ESTs 


1.46 


1.51 


300967 


AA565209 


Hs-269439 


ESTs 


0.39 


1.30 


300987 


AW450840 


Hs.1 48590 


111 ^t.l t 1- L. A rOAAQ Jk£ 4 fill AA 

ESTs, Weakly similar to AF208846 1 BM-00 


1.49 


1.08 


300988 


AI927208 


Hs.208952 


ESTs 


0.16 


0.37 


301050 


AW1 36973 


Hs.288516 


cSTs, weawy similar to oosoyu mitogen I 
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gb:ak72b06.s1 Barstead spleen HPLRB2 Horn 
gb3k84a08.s1 Barstead spleen HPLRB2 Horn 
ribosomal protein, large, PO 
gb:oh63h08^1 NCI_CGAPJCd5 Homo sapiens 
gb:nx21hOZs1 NCI_CGAP_GC3 Homo sapiens 
gb:am08b07^1 Soams_NFU_T.GBC_S1 Homosl.56 
ribosomal protein S6 Wnase, 90kD r polyp 
EST 

gb:olc03g03^1 Soares_NFU_T„GBC_S1 Homos 
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TABLE 8B shows the accession numbers for those Pksys In Table 8A lacking urdgenelD's. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DouWeTwist, Oakland Caiifomia). The Genbank accession numbers for sequences comprising each duster are listed in the 
•Accession* column. 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 
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AW340926 AA249063 N86075 
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R93901AF075073 R93902 
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H69434AF0B5958H69846 
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306897 
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AI093967 


323155 


979809.1 


AL120701 AL135041 AL121524 


322527 


38927.1 


AF147359T58511T58560 


322585 


473768.2 


W88919 W89125 


300362 


1574395.1 


242308 H23514 


322635 


82296.1 


AA005129AA679084AA694399 


322664 


85042.1 


AA011522 M702841 AA011691 AA330797 


315454 


360580.1 


AI239464 AI239473 AA625812 A1208703 


322687 


37372J 


AF074666 AI110759 AF090902 


314852 


327472.1 


AI903735 AA491283 AI694953 AW976903 AA761362 


307783 


697809.1 


AI347274AW844024 


324072 


269032.1 


AA381722AA381829AW963906AW963902AA381242 


300627 


221345.1 


AA488472 W27363 AA317053 BE082689 AW967036 BE079872 


323505 


196389.1 


AW970512AA280251 A1652287 BE4B6438 AI650725 AA551 854 AA281574AW571481 


315791 


403558.1 


AA678177AA577034 


324303 


233842.1 


AL118754AA333202 H38001 


316519 


442885.1 


AA847835AA768376 


300926 


333127.1 


AA504860AA504911 
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324580 
301862 
324804 
324889 
302597 
302711 
302742 
318499 
310624 
302847 
304122 
303598 
311409 
312094 
319312 
319407 
319425 
320007 
320018 
319484 
318865 
312220 
319546 
312389 
319611 
312437 



311896 
319834 
321102 
321158 
321199 

305528 
321270 
314126 
320714 
305442 
306446 
306458 
306510 
305557 
306572 
306582 
306656 
306686 
306751 
308011 
306892 
308106 
308154 
306956 
308958 
308213 
308216 
308219 



308643 
308673 
308697 
308778 
308808 
308875 
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TABLE 8C shows the genomic position for those Pteys in Table 8A lacking unigene ID'S and accession numbers. For each predicted axon, we have feted the genomfc 
sequence source used for prediction. Nucleotide locations of each predicted exon are also fisted. 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a).' refers to the pubucaSon entitled The DNA 

sequence of human chromosome 22.' Dunham I et a)., Nature (1999) 402:489-495. 
Strand: indicates OKA strand from which exons were predicted. 

NLpcsitton: Indicates nucleotide positions of predicted exons. 
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TABLE 9A; Potential Therapeufc. Diagnostic and Proposlic targets for Therapy of Lung Cancer 

Table 9A shows about 1312 genes unregulated in lung tumors (including squamous eel caicinamas, adenceardnomas, small cell carcinomas, granulomatous and carcinoid 
tumors) relative to normal body tissues. These genes were selected from about 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. 

Table 9B show the accession numbers for those Filey's lacking UnigenelD's (or table 9A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived torn Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and AEgnment Tools (DouUeTwist Oakland California). The Genbank accession numbers for sequences comprising each cluster ere listed In the 
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Table 9C show the genomic positioning for those Pte/s lacking Unigene ID'S and accession numbers in table 9A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also fisted. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors {including squamous ceil carcinomas, adenocarcinomas, small ceO carcinomas, granulomatous end carcinoid tumors) divided by the 
average of normal lung samples 

R2: Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 
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Homo sapiens, Similar to R1KEN cDNA 281 0 


11.33 


1.00 


434828 


D90070 


Hs.96 


phorboM 2-myiistate-1 3-ac8tate-tnduced 


1.00 


1.00 


434876 


AF 160477 


Hs 61460 


In sunAifam&v mcpntor I NIR 


1.25 


1.29 


434891 


AA814309 


Hs 123583 


ESTs 


1.00 


6.00 


434928 


AW015595 


Hs.4267 


Homo sapiens dones 24714 and 24715 mRNA 


1.00 


1.00 


435013 


H91923 


Hs.1 10024 


Target CAT 


1.26 


1.10 


435066 


BE261750 


Hs.4747 


dyskeratosis congenita 1, dyskerin 


1.69 


1.37 


435087 


AW975241 


Hs.23567 


ESTs 


1.00 


1.00 


435099 


AC004770 


Hs.4756 


flflp stfnchjr&*sp6crfkr ondonuctessB 1 * 


290 


1.93 


435159 


AA6B8879 


Hs.1 16649 


ESTs 


1.00 


1.00 


435205 


X54136 


Hs.181125 


Immunoglobulin lambda locus 


1.02 


1.46 


435232 


NMJJ01262 


Hs.4854 


eyefln-dependent kinase inhibitor 2C (p1 


2.04 


2.70 


435304 


H10709 


Hs.269524 


ESTs 


27.58 


139.00 


435313 






ESTs 


1.00 


14.00 


435505 


AF200492 


Hs 21 1238 


interteuldn-1 homoloo 1 


1.00 


38.00 


435509 


A1458679 


Hs 181915 


ESTs 


1.00 


1.00 


435525 


Al 831 297 


Hs 123310 

re. luN lw 


ESTs 


1.00 


56.00 


435532 


AW291488 


Hs 117305 


Homo sapiens, done IMAGE:3682908, mRNA 


too 


2.00 


435550 


AI224456 


Hs.324507 


H.sapiens pofyA site DNA 


3.42 


3.92 


435602 


ftrc If 919 




t mf+ianvtnrbpft hnno marmw omtein RM03 


£95 


1.80 


435766 


R11673 


Hs 186498 


ESTs 


1.00 


28.00 


435793 


ARM 7 734 


Hs.4993 


KIAA1313 protein 


23.68 


42.00 


436069 


AJ056879 


Hs.263209 


ESTs 


1.00 


58.00 


436170 


AW450381 


Hs.1 4529 


ESTs 


1.00 


18.00 


436211 


AK001581 


Hs.334828 


hypothetical protein FU10719; KIAA1794 


5.84 


22.00 


436213 


AA325512 


Hs.71472 


hypothetical protein FU 10774; KIAA1709 


1.42 


1^7 


436217 


T53925 


Hs.1 07 


fibrinogen-Ttke 1 


57.97 


31.00 


436238 


AKQ02163 


Hs 301724 


hypothetical protein FU 1 1 301 


2.51 


1.71 


436251 


RF^I ^RS 

DUO 1 JUtW 


Hs 296585 


nirrJp.nlar nmtain fKKF/D rpoeaft 


2.33 


1.64 


436291 


BE5SB452 


Hs.344037 


protein regulator of cytokinesis 1 


108.99 


52.00 


436302 




He oa^n 


hvnnfhpfi«H nmtrin Ft .I?35fl8 


0.75. 


2.81 


436306 




Hs 152713 


wrfr«ilf»ss4vDfl MMTV bitearaSon site fern] 


60.01 


1.00 


436414 




Hs 143638 


WT) rpnflai domain 4 


2.50 


2.19 


436419 


AI948626 


Hs 171356 


ESTs 


0.95 


1.33 


436443 


AW13B211 


Hs 128748 


ESTs 


1.12 


9.26 


436474 


AJ770fi93 

njcf UU«U 


Hs 199887 

no. issooi 


ESTs 


1.00 


1.00 


436481 


AA379597 


Hs.5199 


HSPC150 protein similar to ubtqur&n-con 


3.28 


1.56 


www 


AA742221 


Hs. 120633 


ESTs 


1.00 


19.00 


436511 


AA721252 


Hs.291502 


ESTs 


16.76 


14.00 


436553 


X57809 


Hs 181125 


immunoolflhulin lambda locus 


1.08 


1.74 


436557 


W15573 


Hs.5027 


ESTs, Weakiy similar to A47582 B^efl gr 


19.20 


9.75 


436608 


AA628980 




down syndrome critical region protein DS 


33.92 


25.00 


436667 


AW025183 


Hs.1 27680 


ESTs 


0.89 


1.19 


436771 


AW975687 


Hs.292979 


ESTs 


1.00 


10.00 


436839 


AA401369 


Hs.190721 


ESTs 


1.00 


17.00 


436887 


AW953157 


Hs.193235 


hypothetical protein DKFZp5470155 


1.06 


1.15 


436944 


AW268614 


Hs.5840 


ESTs 


1.00 


1.00 


436961 


AW375974 


Hs.156704 


ESTs 


25.13 


25.00 


436972 


AA284679 


Hs.25540 


daudin 3 


1.59 


1.46 


437016 


AU076916 


Hs.5398 


guanine monphosphate synthetase 


2.35 


1.78 


437044 


AL035864 


Hs.69517 


cDMA for differentially expressed C01 6 g 


1.34 


1.13 



PCT/US02/12476 



162 



WO 02/086443 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



437181 


Al 30 6615 


Hs.1 25343 


ESTs Wealdv sftnflar to KIAA0758 orotein 


1.00 


17.00 


437204 


AL1 10216 


Hs!22826 


ESTs, Weakly similar to 155214 salivary 


40.55 


8100 


437205 


ALt 10232 


Hs.279243 


Homo sapiens mRNA; cDNA OKFZp564D2071 (f 


1.00 


11100 


437259 


A1377755 


Hs.1 20695 


ESTs 


1.00 


205.00 


437270 


R18087 


H&323769 


Ho^sanierts mRWA* cDNADKFZD56fi0134 (ft 


1.56 


1.54 


437271 


All 37445 


Hs.28846 




113.25 


125.00 


437370 


AL359567 


Hs.1 61962 


Homo sapiens mRNA; cDMA DKFZp547D023 (fr 


1.82 


4.57 


437390 


All 25859 


Hs.1 12607 


ESTs 


1.35 


175 


437412 




Hs. 34744 


Homo sanfens mRNA: cDNA DKFZo547C136 ft 


3.58 


3.20 


437435 


AJ306152 


Hs.27027 


hypothetical protein DKFZp762H131 1 


3.03 


108 


437444 


H46Q08 


Hs.31518 


ESTs 


1.00 


39.00 


437558 


Al 954 795 


Hs 156135 


ESTs 


1.00 


19.00 


437623 


053880 


Hs.5719 


chromosome ccntfensafiorv-related SMC-asso 


1.95 


157 


437789 


AJSJ1344 


Hs.1 27812 


ESTs, Weakly similar to T17330 hypolheJi 


1.00 


3.00 


437814 


AJ088192 


Hs.1 35474 


ESTs! WeaWy similar to DDX9J1UMAN ATP-D 


1.00 


45.00 


437840 


AA884836 


Hs.292014 


ESTs 


1.07 


1.78 


437852 


BE0D1836 

ULUv 1 UvU 


Hs.256897 


ESTs, Weakly similar to dJ36501Z1 (H.sa 


1.68 


3.26 


437879 


BE262082 


Hs.5394 


hypotnetical protein FU 10305 


1.87 


152 


437915 


Al 837993 




Homo saniflns dono N1 1 MTfira2D1 taratoca 


74.05 


35.00 


437916 




H&20999 


hvrjothelbal ordain FU23142 


23.15 


89.00 


437937 


A1917222 


Hs 121655 


ESTs 


1.00 


1.00 


437942 


A1888256 


Hs.307525 


ESTs 


12.28 


3100 


438091 


AW373062 




nuctear receptor subfamily 1 , Qroup I, m 


1.53 


10.85 


438113 


AJ467908 


Hs.8882 


ESTs 


1.80 


139 


438119 


AW983917 


Hs 901961 


FSTs Modsratohr shrHlar to AF1 16721 89 


22.67 


36.90 


438274 


Al 9 18906 


Hs.55080 


ESTs 


1.00 


100 


438378 


AW97M29 

A?»3f VKJtJ 


Hs 86434 


hvoofhafica) oroteln FU21B16 


38.92 


38.00 


438403 


AA806607 


Hs99220fi 


ESTs 


1.00 


1.00 


438494 


AA<MRK78 • 

rV>3UQO( O 


Hs 130183 
no. iju i do 


ESTs 


2.05 


80.00 


438546 


AW2979D4 


Hs.125811 


ESTs 


1.00 


13100 


438552 


AI945890 


Hs.6314 


h/rifi 1 tran tmpmhrarw rpmntnr fcp'm rrfi-r 


1,43 


1.45 


438702 


AIB79064 


Hsi>4618 


ESTs 


1.00 


34.00 


438724 


AWS12553 


Hs.1 14670 


Human DNA seauence from dans RP1 1-16L21 


1.33 


1.10 


438746 


A1BS5815 


Hs.1 84727 


Human malannma-assQclalpd anttopn n97 fm 


2.42 


1.59 


438779 




Hs.6414 




1.00 


18.00 




AA89R425 


Hs 19937^ 


ESTs 


2.03 


2.57 


438885 


AID0U390 


Hs 184987 

IB. 1 0450/ 


ESTs 


6.42 


88.00 


438898 


AA401369 


Hs.1 90721 


ESTs 


22.41 


17.00 


438915 


AA280174 

*WtOU 111 


Hs 985681 


WitllanvuSaurBn svndromfi chromosoma rani 


1.00 


100 


438956 


W00847 


Hs.1 35056 


Human DNA seQuence from ctone RP5-B50E9 


2.20 


188 


439000 


AW979121 




gb:EST391231 MAGE resequences, MAGP Homo 


2.78 


4.81 


439023 


AA745978 


Hs 28273 


ESTs 


1.17 


1.31 


439024 


r\3uu ju 


HOS598 


ESTs 


1.00 


28.00 


439128 




Hs.1 53089 
nit* i ujwos 


ESTs 


100 


67.00 


439146 


AW13B909 


Hs.1 561 10 


immunoQlobuiin Kappa constant 


138 


1.41 


439223 


AW238299 


Ue950618 


in 16 hindrno orotain 2 


1.93 


1.64 


439285 


All 33916 




hypothetical protein FU20093 


46.23 


139.00 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


2.00 


22) 


439343 


AF0AR161 


Hs 11 46 11 


hvoot helical oroteln FU 1 1 808 


6.10 


7.37 


439394 


AA401359 


Hs, 190721 


ESTs 


3.39 


17.00 


439410 


AA839019 


Hs 188746 


ESTs 


183 


3.07 


439451 


AF086270 


Hs 278554 


heterochrornaiirt-Hke protein 1 


23.28 


5100 


439452 


AAS18317 


Hs.57987 


B-ceD CLLflyrnphorria 11B (zinc finger pro 


18,76 


12100 


H03HU0 


BE264974 




fhvmlri hnrmnna raf»nior fnfaractor 13 


2,78 


1.58 


439477 


W69813 


Hs.58042 


ESTs Moderately shnfiar to GFR3 HUMAN G 


1.22 


144 


439492 


AF08631D 


Hs 103159 


ESTs 


7.46 


39.00 


439523 


W72348 


Hs.1 85029 

Fid* I9vv4v 


ESTs 


1.00 


1.19 


439592 


AF0R6413 


Ha 58399 


ESTs 


1.00 


1.00 


439506 


W79123 


Hs *iRW1 


Ct nmtofn-tfHfflfpd mofinfrrr 87 


33.61 


1.00 


439670 


AFC8B076 


Hs.59507 


ESTs Weakrv similar to AC004858 3 U1 sm 


1.00 


1.00 


439702 




Hs 134182 


ESTs 


4.30 


10.00 


439706 


AW872597 


Hs 59761 


ESTs Weakrv similar to DAP1 HUMAN DEATH 


86.55 


11.00 


439738 


8E246502 


Hs.9598 


cpma Hnntain. Imrrninooiobuliri domain (\tA 


2,36 


1.88 


439750 


Al ^9nia 

Al-O 33uOj 


Hs S7R64 

ns.d/ Out 


Hnmn sanianQ mRNA fitH lannfh Insad rOM 


2,02 


6.08 


439759 


AL359D55 


Hs.67709 


Homo sapiens mRNA rull length insert cDN 


100 


21.00 


439780 


AL 109688 




gbiHomo sapiens mRNA fuQ length insert 


7.27 


25.00 


439840 


AW449211 


Hs.1 05445 


GDNF family receptor alpha 1 


100 


1.00 


439926 


AW014875 


Hs.1 37007 


ESTs 


32.58 


7100 


*W330O 


AW947599 


no. 0/ 90 


nlstalet.flrfruatinn fopfnf arah/lhvHrnla 
pialclci oCUVoUliy loWUt ouoiyuijuiuw 


21.28 


9.55 


439979 




Hs.6823 


hypothetical protein FU 10430 


68.63 


61.00 


440006 




Hs.6844 


hypothetical protein FLJ20510 


1.83 


4.02 


440028 


AW473675 


Hs.125843 


ESTs Weaklv similar to T17227 hvoothet! 


1.42 


154. 


440106 


AAB64968 


Vk 197fiQQ 

nS. I 033 


K1AA1R03 nmtpin 


1.00 


54.00 


440138 


ARQ33023 


Hs 318127 


hvoafhfifea} orotein FLi 10201 


24.18 


52.00 


440273 




He 79R33S 


Hftmn sarripns rDNA' F1J23523 fis dona L 


3.21 


4.72 


440289 




hk 199071 
no. 1 l 


ESTs 


38,63 


113.00 


*WIU40 




ns.i ids 


3 dtsjntBQnn and metaiiop i utsinase doma 


62,88 


147.00 


440492 


R39127 


Hs.21433 


hypothetical protein DKFZp547J036 


2.35 


3.62 


440527 


AV657117 


HS.1B4164 


ESTs, Moderately similar to S65657 alpha 


10.84 


57.00 


440659 


AF134160 


Hs.7327 


daudinl 


3.18 


137 


440704 


M69241 


Hs.1 62 


tnsutin-Hke growfli factor binding prote 


189 


109 


440943 


AW082298 


Hs.146161 


hypothetical protein MGC2408 


102 


141 


440994 


AI160011 


Hjl272068 


ESTs 


1.29 


114 


441020 


AA401369 


Hs.190721 


ESTs 


14199 


17.00 


441031 


A» 10684 


Hs.7645 


fibrinogen, 8 beta polypeptide 


141 


99.00 
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441128 


AA570256 




ESTs, Weakly similar to T23273 hypothec* 


4.13 


3.50 


441290 


W27501 


HsJ9605 


cholinergic receptor, nicotinic, alpha p 


1.00 


1.00 


441362 


BE614410 


HSJZ3044 


RAD51 (S ■ cerevisiae) homotog (E coll Re 


130.23 


43.00 


441377 


BE218239 


Hs.202656 


ESTs 


22.03 


1.00 


'441390 


AI692550 


Hs.131175 


ESTs 


3.65 


7.70 


441497 


R51064 


Hs^3172 


ESTs 


1.00 


1.00 


441525 


AW241867 


Hs.127728 


ESTs 


1.53 


1.42 


441553 


AA281219 


Hs.121296 


ESTs 


1.69 


1.57 


441607 


NM.005010 


Hs.7912 


neuronal cell adhesion molecule 


1.47 


211 


441633 


AW958544 


Hs.112242 


norma) mucosa of esophagus specific 1 


216.22 


363.00 


441636 


AA081846 


Hs.7921 


Homo sapiens mRNA; cDNA DKFZp566E183 (fr 


231 


205 


441737 


X79449 


Hs.7957 


adenosine deaminase, RNA-speciftc 


1.30 


1.49 


4417S0 


AA401359 


Hs.190721 


ESTs 


44.15 


17.00 


441801 


AW242799 


Hs^6366 


ESTs 


1.00 


1.00 


441919 


AI553802 


Hs.128121 


ESTs 


1.00 


12200 


441937 


R41782 


Hs.22279 


ESTs 


0.66 


1.37 


441954 


AI744935 


Hs.8047 


Fanconl anemia, complementation group G 


1.48 


1.39 


442025 


AW887434 


Hs.11810 


COA11 protein 


1.00 


46.00 


442029 


AW956698 


Hs.1 4456 


neural precursor cell expressed, develop 


9.92 


45.00 


442072 


AI740832 


Hs.12311 


Homo sapiens clone 23570 mRNA sequence 


25.05 


77.00 


442108 


AW452649 


Hs.1 66314 


ESTs 


3.61 


3.14 


442117 


AW664964 


Hs.1 28899 


ESTs 


3.00 


' 5.49 


442137 


AA977235 


Hs.128830 


ESTs, WeaWy similar to Z192_HU WAN ZINC 


1.00 


1.00 


442159 


AW1 63390 


Hs.278554 


heterochromafin-fike protein 1 


1.92 


1.66 


442179 


AA9B3842 


Hs.333555 


chromosome 2 open reading frame 2 


27.22 


50.00 


442328 


A! 952430 


Hs.150614 


ESTs, Weaidy similar to ALU4 HUMAN ALU S 


5.00 


3.42 


442432 


BE093589 


Hs.38178 


hypothetical protein FU23468 


181.59 


76.00 


442530 


A1580830 


Hs.176508 


Homo sapiens cONA FU14712 fis, done NT 


10.59 


144.00 


442547 


AA306997 


Hs.217484 


ESTs, Weaidy similar to ALU1 HUMAN ALUS 


109.23 


98.00 


442556 


AL137761 


Hs.8379 


Homo sapiens mRNA; cDNA DKFZp586L2424 (f 


1.00 


53.00 


442619 


AA447492 


Hs.20183 


ESTs, Weakly simBar to AF 164793 1 prole 


29.02 


50.00 


442710 


A1015631 


Hs.23210 


ESTs 


1.00 


19.00 


442717 


R88362 


Hs.180591 


ESTs, Weakly similar to T23976 hypotheti 


1.00 


5.00 


442875 


BE6230C3 


Hs.23625 


Homo sapiens done TCCCTA00142 mRNA sequ 


22.85 


50.00 


442914 


AW1 88551 


Hs.99519 


hypothetical protein FU14007 


25.33 


B200 


442932 


AA457211 


Hs.B858 


bromodomaln adjacent to zinc finger coma 


3.18 . 


4.41 


442942 


AW1 67087 


Hs.131562 


ESTs 


8.45 


64.00 


443068 


AI188710 




ESTs 


1.00 


27.00 


443204 


AW205876 


Hs.29643 


Homo sapiens cDNA FU13103 fis, done NT 


1.00 


24.00 


443211 


AI128368 


Hs.1 43655 


ESTs 


12.42 


200 


443247 


BE614367 


Hs.333893 


c-Myc target JP01 


128.64 


96.00 


443324 


R44013 


Hs.164225 


ESTs 


0.02 


4.59 


443383 


A1792453 


Hs.1 66507 


ESTs 


1.00 


47.00 


443400 


R28424 


Hs.250648 


ESTs 


18.52 


61.00 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


4.02 


1.75 


443572 


AA025610 


Hs.9605 


cleavage and polyadenyteSon specific fa 


2.98 


257 


443575 


AI078022 


Hs.269636 


ESTs, WeaWy simBar to ALU1.HUMAN ALU S 


1.00 


29.00 


443614 


AV555386 


Hs.7645 


fibrinogen, B beta polypeptide 


1.00 


16.00 


443633 


AL031290 


Hs.9654 


similar to pregnsncy- associated plasma p 


1.00 


39.00 


443648 


AI085377 


Hs.143610 


ESTs 


39.81 


70.00 


443715 


A1583187 


Hs.9700 


cycRn E1 


48.74 


7.00 


443723 


Al 144442 


Hs.157144 


syntaxin6 


1.29 


1.30 


443802 


AW504924 


Hs.9805 


KIAA1291 protein 


1.75 


1.61 


443859 


NMJH3409 


Hs.9914 


foitistaCn 


1.35 


1.13 


443692 


AA401369 


Hs.190721 


ESTs 


1.00 


17.00 


443947 


W24187 




gb2b47!09j1 SoaresJetaUunft.NbHLl9W 


1.33 


1.64 


443991 


NM_002250 


Hs.1 0082 


potassium Intermediate/small conductance 


5.71 


6.87 


444006 


BE395085 


Hs.1 0086 


type 1 transmembrane protein Fn14 


1.47 


1.92 


444009 


A1380792 


Hs.1 351 04 


ESTs 


1.00 


77.00 


444017 


U04840 


Hs.214 


neuro-oncobgicaJ ventral antigen 1 


1.00 


1.00 


444127 


N63620 


Hs.13281 


ESTs 


1.00 


29.00 


444129 


AW294292 


Hs.256212 


ESTs 


1.00 


1.00 


444279 


U62432 


Hs.89605 


cholinergic receptor, nicotinic, alpha p 


0.60 


7.80 


444371 


BE540274 


Hs.239 


fbrkhead box Ml 


291 


1.14 


444378 


R41339 


Hs.1 2569 


ESTs 


1.00 


1.00 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar to S64054 hypotheti 


469.00 


556.00 


444461 


R53734 


Hs.25978 


ESTs, Weakly simBar to 2t 09260A B ceQ 


1288 


105.00 


444471 


AB020684 


Hs.1 1217 


WAA0877 protein 


24.91 


90.00 


444489 


AI151010 


Hs.1 57774 


ESTs 


1.00 


111.00 


444619 


BE538082 


Hs.8172 


ESTs, Moderately simBar to A46010 X-Bn 


1.00 


70.00 


444665 


BE613126 


Hs.47783 


B aggressive lymphoma gene 


30.56 


139.00 


444707 


Ail 6861 3 


Hs.41690 


c^esmocollin 3 


1.00 


1.00 


444735 


BE019923 


Hs.243122 


hypothetical protein FU 13057 similar to 


77.02 


90.00 


444761 


NMJH4400 


Hs.1 1950 


GPI-anchored metastasis-associated prate 


1.57 


1.31 


444783 


AK001468 


Hs-62180 


aniifin (Drosophlla Scraps homotog), act 


7755 


200 


445236 


AK001676 


Hs.1 2457 


hypothetical protein FU10814 
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AL109688 R23665 R26576 

AA570256AW014761 AA573721 AI473237 AI0221 65 AA554071 AA1 27551 N90525 AW973623 AA447991 AA243B52BE328850AI148171 
A1359627AI005088 A1356567 AA232991 AW016855AA906902AA233101 AA1 27550 BE512923 
AI188710 AI032142 AW07B833 N30308 AW675632 AI219028 AI341201 N22181 H95390 
W24187 W24194R17789 

Y10043 NM.005342 L05085 AL034450 BE614226 AW749053 AA379173 AA248230 BE514634 AA334622 R70656 AA367593 AA214649 
AA369318 AW957081 R05760 AA039903 AI886597 AW630122 AA906264 AA041527 R01145 AI0886B8 BE463637 AA398795 AI354883 
AI768938 AI569996 A1452952 A1168582 All 89869 AI086670 AW262560 AW613B54 AABS2839 AA435840 AA670197 AI024032 AI990659 
AI990089 N81095 AA847919 AW9601 50 AA21 1075 AA044704 AA367594 AW582587 AWB58854 AW81 8630 AW818281 AW818433 AW582595 
AA096002N83992 

AI471630 BE540637 BE265481 AW407710 BE513882 BE546739 AA053597 BE140503 BE218514 AW956702 AI656234 AI636283 A1567265 

AW340858 BE207794 AA0530B5 R69173 AA292343 AA454908 AA293504 A1659741 AI927478 AA399460 AI760441 AA346416 BE047245 

AA730380 AA394063 AA454833 A1982791 A1557Z70 A1813332 AI767858 AA427705 D20284 AI221458 BE048537 A1263048 AA346417 

AA911497BE537702 

AI638293AW813561 

A1761324AW8B0941 AW880937 

AW118072 AI631982 T15734 AA224195 A1701458 W20198 F26326 AA890570 N90552 AW071907 AJ671352 A1375892 T03517 R88265 
AI124088 AA224388A!084316A!354686T33652 AI140719AI720211 T03490AI372637T15415AW205B36 AA630384 T03515 T33230 
AA017131 AA443303 T33623 AI222556 T33511 T33786 AI419606 D55612 

W52854 AL117600 BE208116 BE208432 BE206239 BE082291 AW953423 AA351619 BE180648 BE140560 W60080 AAB65478 N90291 
AW450652AW449519 AA993634 AI806539AA351618AW449522AI827626AA904788AA380381 AA886045AA774409 BE003229 Z41756 
AL133619 AA468118 AA383064 AI476447 T09430 AI673758 AA524895 AI581345 A1300820 AW498812 AA256162 AI559724 A1685732 
AA602400 AA905453 AI204595 AW166541 AA157456 AA156269 AA383652 AA431072 AW592707 AI435410 AW272464 AI215594 AA622747 
R74039 N35031 AI80412BAW513621 AA868351 A1026826 A1493388 AA614641 W81604AI567080 AI214351 AA730140 AI125754 A120081 3 
AI269603 AI565082 A1807095 A1476629 AA505909 AI368449 A1685077 A1582930 AW085038 AA757863 AA7301 54 AI767072 AA468316 
AI734130 AI734138 AA426284 AA433997 A1741241 AW043563 AI732741 A1732734 AA437369 AA425820 AA664048 R74130 
BE144666 BE184942 AW238414 BE184946 
AW993247AW861464 
AA203682R11958 

BE550224 AA832519 N45402 AW885857 N29245 BE465409 W07677 AW970089 AI299731 AA482971 BE503548 H18151 W79223 AF086393 
AA461301 W74510 R34182 A1090689 N46003 BE071550 R28075 AW134982 A1240204 AI138906 AW026179 A1572316 BE466182 AI2)6395 
AI276154 A1273269 AI422817 A1371014 AI421274 AI188525 AA939164 BE549810 AW137865 AI694996 BE503B41 AA459718 BE327407 
BE467534 BE21B421 BE467767 AA989054 BE467063 AI797130 BE3277B1 



TABLE 9C 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Idenfifier (Gl) numbers. Dunham I. el al/ refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al, Nature (1 999) 402:469495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400512 


9796593 


Minus 


1439-1615 


400517 


9796686 


Minus 


49996-50346 


400560 


9843598 


Plus 


94182-94323,97056.97243,101095-101236,102824-103005 


400664 


8118496 


Plus 


13558-13721,13942-14090,14554-14679 


400665 


8118496 


Prus 


16879-17023 


400566 


6118496 


Plus 


17982-18115,20297-20456 


400749 


7331445 


Minus 


9162-9293 


400763 


8131616 


Minus 


35537-35784 


401027 


7230983 


Minus 


70407-70554,71060-71160 


401093 


8516137 


Minus 


22335-23166 


401203 


9743387 


Minus 


172961-173056,173868-173928 


401212 


9858408 


Plus 


87B39-88028 


401411 


7799787 


Minus 


144144-144329 


401435 


8217934 


Minus 


54508-55233 


401464 


6682291 


Minus 


170688-170834 


401714 


6715702 


Phis 


. 96484-96681 


401747 


9789672 


Minus 


118596-118816,119119-1192^11960^119761,120422-120^ 








131932,132451-132575,133580.134011 


401760 


9929699 


Plus 


83126-83250,8532^85540,94719-95287 


401780 


7249190 


Minus 


28397-28617,28920-29045^35-292963411-29567,2970^29787,30^4-30573 


401781 


7249190 


Minus 


83215^3435.83531-83656^3740^01,84237-84393,84955^37 1 6629(W6814 


401785 


7249190 


Minus 


165776-165996,168189-166314,166408-166569,167112-167268,167387-167469,168634-168942 


401797 


6730720 


Plus 


6973-7118 


401961 


4581193 


Minus 


124054-124209 


401985 


2580474 


Plus 


61542-61750 


401994 


4153858 


Minus 


4290443124,43211-43336.44607-44763,4519945281,46337-46732 


402075 


8117407 


Plus 


121907-122035,122804-12321,124019-124161.124455-124610,125672-126076 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402265 


3287673 


Prus 


21059-21168 


402297 


6598824 


Pius 


3527^35405,3557^35559 


402408 


9796239 


Minus 


110326-110491 
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402420 


9796339 


Plus 


129750-129919 


402674 


6077108 


Minus 


J9ZSD-395UZ 


402802 


3287156 


Minus 


53242-53432 


402994 


2996643 


Minus 




403137 


n14 4 AAA 

9211494 


Minus 


fwoxri air*)') nonto nonoj nonn fw74*i ft^nin ri*r\-M nicnt njnxa ncn A t\zii~t 

92349-92572,92955-93084,90079-9371 2,93949-94 072,94591 •94748,9521 4-95337 


403306 


8099945 


Plus 


127100-127251 


403329 


851612) 


Rus 


95450-96598 


403381 


9438267 


Minus 


1C/W1 OC4-/6 

ZoOD9>2di7o 


403478 


9958258 


Plus 


116458-116564 


403485 


9965528 


Plus 


nnnn AiAA rt/?cP JidT 

2888-3001,31 98*3532,3655-41 17 


403827 


8569879 


Minus 


23868-24342 


403715 


72o35o9 


Phis 


85128-85292 


404044 


9558573 


Minus 


225757-225939 


404076 


9931752 


Minus 


3848-3967 


404101 


8076925 


Minus 


125742-125997 


404140 


9843520 


Plus 


37761-38147 


404165 


9926489 


Minus 


69025-69128 


404185 


4572584 


Minus 


129171-129327 


404210 


5006246 


Plus 


169926'170121 


404253 


9357202 


MinuS 


55675-56055 


404287 


2326514 


Phis 


53134-53281 


404298 


9944263 


Mima 


73591-73723 


404347 


9338195 


Plus 


74493-74829 


404440 


7528051 


Plus 


80430-81581 


404721 


9656648 


Minus 


173763-174294 


404794 


4826439 


Phis 


101619-101898 


404854 


7143420 


Pius 


14260-14537 


404877 


1519284 


Phis 


1095-2107 


404927 


7342002 


Pfts 


68690-69563 


404996 


6007890 


Pius 


37999-38145,38652-38998^9727-39872,40557-40674,42351*42450 


405449 


7622497 


Plus 


42236-42570 


405566 


6006906 


Plus 


35912-36065 


405572 


3800891 


Plus 


85230-85938 


405646 




Phis 


741-969 


405676 


4557087 


Phis 


73195-73917 


405770 


2735037 


Phis 


61057-62075 


405932 


7767812 


Minus 


123525-123713 


406137 


9166422 


Minus 


30487-31058 


406360 


9256107 


Minus 


7513-7673 


406399 


9256286 


Minus 


63448-63554 


406467 


9795551 


Phis 


182212-182958 



TABLE 1QA: Potential Therapeutic Diagnostic and Prognostic targets for Therapy of Lung Cancer and Norwnafignarrt Lung Disease 

Table 2A shows about 307 genes up-regulated in non-mafignarrt lung disease relative to lung tumors and norma! body tissues and/or down-regulated in lung tumors relative to 
normal hing and non-malignant lung disease. These genes were selected from about 59680 probesets on (he Eos/Affyrretrix Hu03 Genechlp array. 

Table 108 show the accession numbers for those Pkefs lacking UrrfgenelD's for table 1QA. For each probeset we have feted the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
"Accession" column. 

Table 1 0C show the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 1 0A. For each predicted exon, we have fisted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey. | Unique Eos p robes ef identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Unigene) D: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) Divided by the 
average of normal rung samples 

R2: Average of non-malignant lung disease samples (including bronchitis, emphysema fibrosis, atelectasis, asthma) divided by the average of normal lung samples 



Pkey 


ExAccn 


UriigenalD 


Unigene Title. - 


R1 


R2 


404394 






ENSP00000241075:TRRAP PROTEIN. 


0.79 


3.10 


404916 






Target Exon 


100 


159.00 


405257 






Target Exon 


1.00 


422.00 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


a47 


2.33 


407568 


AA740964 


Hs.62699 


ESTs 


1.00 


123.00 


408562 


AI436323 


H5.31141 


Homo sapiens mRNA for KJAA1568 protein, 


1.00 


230.00 


409031 


AA376B36 


Hs.76728 


ESTs 


1.00 


128.00 


410434 


AF051152 


Hs.63668 


toWite receptor 2 


39.65 


149.00 


410467 


AF102546 


Hs.63931 


dachshund (DrosophUa) homotag 


1.00 


109.00 


410808 


T40326 


Hs.167793 


ESTs 


1.14 


U14 


412351 


AL135960 


Hs.73828 


T-ceU acute tymjteytic leukemia 1 


0.37 


2.27 


412372 


R65998 


Hs.285243 


hypolheucai protein FU22029 


1.00 


173.00 


413795 


AL040178 


Hs.142003 


ESTs 


0.10 


11.90 


414154 


AW205314 


Ks.323050 


ESTs 


0.62 


2.09 


414214 


D49958 


Hs.75819 


glycoprotein M6A 


0.03 


4.55 


414998 


NMJJ02543 


Hs.77729 


axitised low density lipoprotein flecfin 


0.64 


Z97 


415122 


D60708 


Hs.22245 


ESTs 


0.07 


8.97 


415765 


NM.005424 


Hs.78824 


tyrosine kinase with immunoglobulin and 


0.67 


1.65 


415775 


H00747 


Hs.29792 


ESTs, Weakly simSar to 138022 hypothefi 


0.29 


Z64 


415910 


U20350 


H5J8913 


chemottne (C-XS-C) receptor 1 


1.00 


145.00 
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416319 


AJ815501 


Hs.79197 


416402 


NM_000715 


Hs.1012 


417355 


D13168 


Hs.82002 


417421 


AL1 38201 


Hs.82120 


417511 


AL049176 


Hs.82223 


418489 


U76421 


Hs.85302 


418726 


BE241812 


Hs.87860 


418741 


H83265 


Hs.8881 


418683 


BE387036 


Hs.1211 


419086 


NM.00Q216 


Hs.89591 


419150 


T29618 


Hs.89640 


419235 


AW470411 


Hs .288433 


419407 


AW410377 


Hs.41502 


420556 


AA276300 


Hs.124292 


420656 


AA279098 


Hs.187638 


420729 


AW964897 


Hs.290825 


421177 


AW070211 


Hs.102415 


422060 


R20893 


HU25823 


422426 


W79117 


Hs.56559 


422652 


AW967969 


Hs. 11 8958 


423099 


NNL002837 


Hs.123641 


424433 


H04607 


Hs.9216 


424585 


AA464840 


Hs. 131987 


424711 


NMJ105795 


Hs.152175 


424973 


X92521 


Hs. 154057 


425023 


AW956889 


Hs.154210 


425664 


AJ006276 


Hs.159003 


425998 


AU076629 


Hs.165950 


426657 


NM.015855 


Hs.1 71 731 


426753 


T89832 


Hs.1 70278 


427558 


D49493 


Hs.2171 


427983 


M17706 


Hs.2233 


42B467 


AK002121 


Hs.184465 


428927 


AA441837 


Hs.90250 


429496 


AA453800 


Hs.192793 


430468 


NMJHM673 


Hs.241519 


431385 


BE178536 


Hs.1 1090 


431728 


NMJM37351 


Hs.268107 


431848 


AI378857 


Hs.126758 


432128 


AA127221 


Hs.11 7037 


432519 


AI221311 


Hs.130704 


433043 


W57554 


Hs.125019 


433803 


A1823593 


Hs.27688 


434730 


AA644669 


Hs.1 93042 


435472 


AW972330 


Hs.283022 


436532 


AA721522 




437119 


AI379921 


Hs.177043 


437140 


AA312799 


Hs.283689 


437211 


AA3B2207 


H&5509 


437960 


AI669&6 


Ha.222194 


438202 


AW169287 


Hs.22588 


438873 


AI302471 


Hs.124292 


438875 


AA827640 


Hs.189059 


441048 


AA913488 


Hs.192102 


441188 


AW292830 


Hs.255609 


441499 


AW298235 


Hs.101689 


444513 


AL120214 


Hs.7117 


444527 


NM 005408 


Hs.1 1383 


444561 


NMJ04469 


Hs.1 1392 


445279 


R41900 


Hs.22245 


446017 


N98238 


Hsi5185 


446984 


AB020722 


Hs.16714 


446998 


N99013 


Hs.1 6762 


447357 


A1375922 


Hs.1 59367 


448106 


A1800470 


Hs.171941 


448253 


H25899 


Hs.201591 


449275 


AW450848 


Hs^05457 


450400 


AI594722 


HsJ79744 


450696 


AI654223 


Hs.16026 


450726 


AW204600 


Hs.250505 


451497 


H83294 


Hs.284122 


451533 


NMJXM657 


Hs^6530 


453636 


R67837 


Hs.1 69872 


458332 


AI000341 


Hs.220491 


459580 
400269 


AA022888 


Hs.1 76065 


403421 






407570 


219002 


Hs.37096 


412295 


AW088826 


Hs.117176 


414517 


M24461 


Hs.78305 


417204 


N81037 


Hs.1074 


418307 


U70867 


Hs.83974 


418935 


T28499 


Hs.89485 


421502 


AF111856 


Hs.105039 


421799 


N74880 


Hs.29977 



CD83 antigen (activated B lymphocytes, i 

complement component 4-ttnding protein, 

endothelin receptor type B 

nuclear receptor subfamily 4, group A, m 

chcfcfin-like 

adenosine deaminase, RNA-spedfic, B1 (h 

protein tyrosine phosphatase, non-recept 

ESTs, Weakly similar to S41 044 chromosom 

acid phosphatase 5, tartrate resistant 

Kallmann syndrome 1 sequence 

TEK tyrosine kinase, endothelial (venous 

neurotrimin 

hypothefical protein FU21276 

Homo sapiens cOMA: FU231 23 frs, clone L 

ESTs 

ESTs 

Homo sapiens mRNA; cONA DKFZp5B6N0121 (f 
ESTs, Moderateiy similar to ALU5_HUMAN A 
ESTs 

syntaxin 11 

protein tyrosine phosphatase, receptor t 

ESTs 

ESTs 

calcitonin receptor-Gke 
matrix metanoproteinase 19 
endothelial differentiation, sphingolipi 
transient receptor potential channel 6 
fibroblast growth facto; receptor 4 
solute carrier family 14 (urea transport 
ESTs 

growth differentiation factor 10 
colony stimulating factor 3 (granulocyte 
hypothetical protein FU11259 
ESTs 
ESTs 

angiopoieOn-tike 1 

membrane-spanning 4-domains, subfamily A 
muIBmertn 

ESTs, Highly similar to AF175283 1 zinc 
ESTs 

ESTs, Weakly similar to BCHUIA S-100 pro 
rymphotd nuclear protein (LAF-4) mRNA 
ESTs 
ESTs 

triggering receptor expressed on myeloid 
gb3w54h12.r1 NCLCGAP.Ewl Homo sapiens 
ESTs 

activator of CREM In testis 
ecotropic viral integration site 2B 
ESTs 
ESTs 

Homo sapiens cONA: FU23123 Ms, done L 

ESTs 

ESTs 

ESTs 

ESTs 

ghrtamate receptor, bnotropic, AMPA 1 
small inducible cytokine subfamily A (Cy 
c-fos Induced growth factor (vascular en 
ESTs 
ESTs 

Rho guanine exchange factor (GEF) 15 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 

ESTs 

ESTs 

ESTs 



ESTs 

hypothetical protein FU23191 

retinolc add receptor, alpha 

Wnt inhibitory factor-1 

serum deprivation response (phosphatidyl 

ESTs 

ESTs 

ESTs 

Eos Control 

NMjD16369*:Homo sapiens daudin 18 (CLDN 
zinc finger protein 145 {KruppeWte, e 
pc^AHnnding protein, nuclear 1 
surfactant pulmonary-associated protein 
surfactant pulmonary-associated protein 
solute carrier family 21 (prostagtantfin 
carbonic anhydrase IV 
solute carrier family 34 (sodium phospha 
N-acyisphingosine arredohydrotese (add c 



15.32 


237.00 


0.64 


4.00 


0.01 


3.90 


3630 


357.00 


1.00 . 


179.00 


0.02 


6.00 


1.00 


113.00 


0.44 


1.90 


0.96 


2.04 


0.62 


Z74 


0.03 


6.90 


1.48 


5.13 


37.55 


336.00 


0.80 


3.65 


1.65 


8.07 


2.99 


25.82 


0.46 


1.95 


1.00 


156.00 


0.03 


7.44 


0.14 


3.62 


0.01 


3.16 


0.75 


141.75 


1.00 


167.00 


0.43 


3.01 


0.37 


19.45 


0.14 


3.35 


1.00 


94.00 


0.68 


1.42 


0.03 


3.74 


1.00 


141.00 


1.00 


117.00 


0.75 


2.20 


0.76 


Z25 


0.01 


3.62 


1.00 


138.00 


1.00 


132.00 


1.00 


157.00 


1.00 


157.00 


0.34 


2.24 


0.00 


1.15 


0.01 


2.06 


1.00 


267.00 


1.00 


105.00 


1.05 


3.15 


0.83 


1.94 


1.00 


218.00 


1.00 


133.00 


0.67 


122.67 


1.00 


142.00 


1.00 


147.00 


1.00 


141.00 


0.71 


3.66 


23,32 


370.00 


0.77 


8.50 


3.43 


16.36 


1.00 


167.00 


1.00 


151.00 


46.47 


153.00 


0.01 


3.08 


0.60 


141.00 


0.18 


139 


0.10 


2.16 


0.01 


2.53 


0.46 


2.64 


18.05 


296.00 


1.00 


141.00 


0.56 


1.38 


0.88 


4.33 


0.52 


208 


0.79 


2.01 


0.35 


2.03 


0.13 


2.25 


1.00 


116.00 


1.00 


192.00 


0.20 


2.98 


0.40 


240 


0.53 


1.77 


0.01 


3.18 


0.56 


1.74 


0.64 


1.50 


0.33 


1.16 


0.53 


1.55 


0.20 


1.28 


0.78 


1.90 


0.59 


1.54 
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423354 


AB011130 


Hs.127438 


calcium channel, voltage-dependent, atph 


0.59 


1.55 


423738 


AB002134 


Hs.132195 


airway trypsin-Uke protease 


10.14 


51.00 


425211 


M18667 


Hs.1667 


progastncsin (pepsinogen Q 


0.35 


1.62 


425438 


T62216 


Hs.270840 


ESTs 


0.23 


9.45 


426828 


NM.000O20 


Hs.1 72670 


acfivin A receptor type ll-JBte 1 


0.03 


1.71 


427019 


AA001732 


Hs.173233 


hypothetical protein FU 10970 


0.01 


1.49 


428043 


T92248 


Hs^240 


uteropjobin 


0.42 


1.26 


430280 


AA361258 


Hs.237868 


interteukin 7 receptor 


0.46 


2.43 


431433 


X65018 


Hs.253495 


surfactant, pulmonary-associated protein 


0.57 


1.59 


431723 


AW058350 


Hs.16762 


Homo sapiens mRNA; cDNA OKFZp564B2062 (f 


0.29 


1.B0 


4329B5 


T92363 


Hs.178703 


ESTs 


0.32 


2.27 


441835 


AB036432 


Hs.184 


advanced glycosylafion end product-speci 


0.31 


1.51 


442275 


AW449467 


Hs.54795 


ESTs 


0.55 


1.7B 


443709 


Ai 082692 


Hs.1 34662 


ESTs 


0.00 


3.02 


444325 


AW152618 


Hs.16757 


ESTs 


0.32 


149 


450954 


AI904740 


HsJ5691 


receptor (calcitonin) activity modifying 


0.46 


1.74 


451558 


NML001089 


Hs.26630 


ATP-blnding cassette, sub-family A (ABC1 


0.52 


1.87 


453310 


X70697 


Hs£53 


solute carrier family 6 (neurotransmiite 


0.00 


3,30 


456855 


AF035528 


Hs.1 53863 


MAO (mothers against decapentaplegic, Dr 


0.01 


2.31 


444342 


NM_01439B 


Hs.10887 


similar to lysosome-associaied membrane 


0.66 


120 


400754 






Target Exon 


1.00 


297.00 


401045 






C1 1 001883*:gi|6753278|reflNP_033938.1 1 c 


1.00 


109.00 


401083 






NrvLQ16582*:Homo sapiens peptide transpor 


0.89 


1.39 


402474 






NM_004079:Homo sapiens calhepsin S (CTSS 


1.45 


4.47 


402808 






ENSP00OOO235229:SEMB. 


1.00 


1.87 


403021 






C21000030:gi|9955960lreilNP.063957.1| AT 


1.00 


149.00 


403438 






KM.03 1 41 9*:Homo sapiens molecule possess 


1.06 


2.96 


403687 






NMJ)07037*:Homo sapiens a disintegrin-ii 


0.04 


4.89 


403764 






NM_005463:Homo sapiens heterogeneous nuc 


1.00 


225.00 


404277 






NM_01 91 1 1 *:Homo sapiens major histccompa 


0.97 


1.93 


404288 






NM_002944*:Homo sapiens v-ros avian UR2 


1.00 


68.00 


404518 


A1815601 




CD 83 antigen (activated B lymphocytes, i 


0.02 


1.83 


405106 






CI l001537*:gil5032241|rBflNP_005732.1| z 


1.00 


235.00 


405381 






Target Exon 


1.00 


93.00 


406387 






Target Exon 


1.37 


6.02 


406646 


M33600 




major histocompatibility complex, class 


0.86 


146 


406714 


A1219304 


Hs.266959 


hemoglobin i gamma G 


0.01 


3.19 


406753 


AA505665 


Hs.217493 


annexin A2 


1.00 


147.00 


406973 


M34996 


Hs.198253 


major histocompatibility complex, class 


1.03 


2.04 


407248 


U82275 


Hs.94498 


leukocyte trnmunogiobuOn-Jike receptor, 


1.00 


64.00 


407510 


U96191 




gfcHuman trophobtast hypoxia-regulaled f 


1.00 


90.00 


407731 


NNL00OQ66 


Hs.38069 


complement component 8, beta polypeptide 


1.00 


67.00 


407830 


NNL0010B6 


Hs.587 


arytacetamide deacetylase (esterase) 


1.00 


10100 


408045 


AW13B959 


Hs.245123 


ESTs 


1.00 


70.00 


406074 


R20723 




ESTs 


1.00 


11100 


408374 


AW025430 


Hs.1 55591 


forkhead box F1 


0.07 


10.17 


409064 


AA062954 


Hs.141883 


EST6 


0.39 


131 


409083 


AF050083 


Hs.673 


interleukin 12A (natural killer cell stJ 


1.00 


95.00 


409153 


W03754 


Hs.50813 


hypothetical protein FU 20022 


0.01 


4.55 


409203 


AA780473 


Hs.687 


cytochrome P450, subfamBy IVB, polype pt 


0.01 


3.72 


409238 


AL049990 


Hs.51515 


Homo sapiens mRNA; cDNA DKFZp564G1 12 (fr 


1.00 


79.00 


409389 


AB007979 


Hs.301281 


Homo sapiens mRNA, chromosome 1 specific 


0.14 


27.35 


409718 


DB6640 


Hs.56045 


src homology three (SH3) and cysteine ri 


1.00 


113.00 


410798 


BE178622 


Hs.16291 


gb:PM3-HT0605-270200-001-a02 HT0605 Homo 


0.64 


147 


411020 


NIUL006770 


Hs.67726 


macrophage receptor with collagenous str 


0.55 


140 


411667 


BE160198 




gbK3V1-HT0413-010200-05^rr03 HT0413 Homo 


1.00 


111.00 


412000 


AW576555 


Hs.1 5780 


ATP-binding cassette, sub-family A (ABC1 


1.00 


95.00 


41235B 


BE047490 


Hs.24172 


ESTs 


1.00 


87.00 


412420 


AL035668 


Hs.73853 


bone morphogenetrc protein 2 


1.43 


8.07 


412S64 


XB3703 


Hs.31432 


cardiac ankynn repeat protein 


0.02 


3.07 


412869 


AA290712 


Hs.82407 


CXC chemokine Qgand 1 6 


0.93 


1.72 


412870 


N22788 


Hs.82407 


CXC chemokine fig and 16 


0.97 


1.51 


413529 


U11674 


Hs.846 


interleukin 8 receptor, beta 


0.02 


142 


413533 


BE146973 




gb:OV4-HT 0222-01 1 199-019-e05 HT0222 Homo 


0.65 


1.50 


413689 


BE1 57286 


Hs20631 


zinc finger protein, subfamily 1A, 5 (Pe 


20.87 


23100 


413724 


AA131466 


Hs.23767 


hypothetical protein FU 12656 


1.00 


80.00 


413800 


A11 29238 


Hs.1 92235 


ESTs 


1.00 


85.00 


413802 


AW964490 


Hs.32241 


ESTs, Weakly similar to S65657 alpha-1 C- 


1.00 


213.00 


413829 


NMJX)1872 


Hs.75572 


carboxypepfidase B2 (plasma) 


0.02 


3.93 


414376 


BE393856 


Hs.66915 


ESTs, Weakly simitar to 16.7Kd protein [ 


1.00 


115.00 


414577 


AI056548 


Hs.72116 


hypothefical protein FU 20992 simBar to 


0.49 


1.94 


414700 


H63202 


Hs.38163 


ESTs 


0.03 


175 


415078 


AA311223 


Hs.283091 


found in Inflammatory zone 3 


0.86 


1.95 


415120 


N64464 


Hs.34950 


ESTs 


1.00 


120.00 


415323 


BE269352 


Hs.949 


neutrophil cytosolic factor 2 (65kD, chr 


0.60 


148 


415335 


AA847758 


Hs.1 11030 


ESTs 


1.00 


95.00 


415582 


W92445 


Hs.165195 


Homo sapiens cDNA FU 14237 fis, clone NT 


1.00 


136.00 


416030 


H15261 


Hs.21948 . 


ESTs 


0.02 


8.07 


416427 


BE244050 


Hs.79307 


Rac/Cdc42 guanine exchange factor (GEF) 


1.00 


73.00 


416464 


NVL000132 


Hs.79345 


coagulation factor VIII, procoag ulant co 


0.70 


3.36 


416585' 


X54162 


Hs.79386 


leiomodin 1 (smooth muscte) 


0,06 


6.56 


416847 


L43821 


Hs.80261 


enhancer of fflamentallon 1 (cas-Ckedo 


0.70 


166 


417148 


AA359896 


Hs.293885 


hypothetical protein FU 14902 


1.00 


114.00 


417370 


T2B651 


Hs.82030 


tryptophanyURNA synthetase 


0.85 


1.30 


417673 


T87281 


Hs.16355 


ESTs 


0.16 


15.54 
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418087 


All 27958 


Hs.83393 


cystatin E/M 


0.81 


1.74 


418298 


C01566 


Hs.86671 


ESTs 


1.00 


99.00 


418643 


J03798 


Hs.86948 


small nuclear ribonucleoproleln 01 polyp 


1.00 


60.00 


418832 


X04011 


Hs.88974 


cytochrome b-245, beta polypeptide (chro 


2.40 


14.74 


418945 


BE246762 


Hs.89499 


arachldonate 5-Ttpoxygenase 


0.67 


• 3.16 


419261 


X07B76 


Hs.89791 


wingless-type WMTV integration site fami 


1.00 


73.00 


419564 


U08989 


Hs.91139 


solute carrier family 1 (neuronaVepIthe 


1.00 


192.00 


419574 


AK001989 


Hs.91165 


hypothetical protein 


1.00 


94.00 


419968 


X04430 


Hs.93913 


interteukln 6 (interferon, beta 2) 


61.16 


500.00 


420256 


U84722 


Hs.76206 


cadherin 5, type 2, VE-cadherin (vascula 


0.52 


1.70 


420285 


AA258124 


Hs.293878 


ESTs, Moderately similar to ZN91_HUMAN Z 


1.00 


172.00 


420577 


AA278436 


Hs.1 86649 


ESTs 


1.00 


97.00 


421262 


AA286746 


Hs.9343 


Homo sapiens cDNA FU 14265 fis, clone PL 


1.00 


64.00 


421445 


AA913059 


Hs.1 04433 


Homo sapiens, clone IMAGE:405486B, mRNA 


0.88 


1.51 


421470 


R27496 


Hs.1 378 


annexln A3 


0.05 


11.26 


421476 


Al 683243 


Hs.97258 


ESTs, Moderately similar to S29539 ribos 


1.00 


73.00 


421563 


NKIL006433 


Hs.1 05806 


granutysin 


0.82 


2.42 


421566 


NMJJ00399 


Hs.1 395 


early growth response 2 (Krox-20 (Drosop 


5.50 


3137 


421855 


F06504 


Hs.27384 


ESTs, Moderately similar b ALU4_HUMAN A 


1.00 


129.00 


421913 


AI934365 


Hs.1 09439 


osteogtycln (osteoinductive factor, mime 


1.00 


101.00 


421952 


AA300900 


Hs.98849 


ESTs, Moderately similar to AF161511 1 H 


0.60 


63.60 


422232 


D43945 


Hs.1 13274 


transcription factor EC 


1.00 


148.00 


422386 


AF105374 


Hs.1 15830 


heparan sulfate (glucosamine) 3-O-sulfot 


1.40 


3.98 


423168 


R34385 


Hs.1 24940 


GTP-binding protein 


0.34 


3.59 


423196 


AK001B66 

rV\Uv IUUU 


Hs.1 251 39 


hypothetical protein FU 11 004 


0.55 


2.00 


423387 


AJ012074 




vasoactive intestinal peptide receptor 1 


0.09 


2.13 


423424 


AF150241 


Hs.128433 


prostaglandin D2 synthase, hematopoietic 


1.00 


141.00 


423456 


AL110151 


Hs.1 28797 


DKF2P586D0824 protein 


1.00 


66.00 


423896 


292545 




Sushi domain (SCR repeat) containing 


0.73 


1.27 


424027 


AW337575 


Hs^01591 


ESTs 


0.54 


258 


424212 


MM 005814 


Hs.143131 


glycoprotein A33 (transmembrane) 


0.77 


247 


425087 


R62424 


Hs 126059 


ESTs 


1.00 


74.00 


425175 


AF020202 


Hs.1 55001 


UNCI 3 (C. elegansHike 


0.85 


1.96 


425771 


BE561776 


Hs 159494 


Bruton agammaglobulinemia tyrosine kinas 


1.18 


256 


426485 


BE1782B5 


Hs 170056 

no* 1 1 vuju 


Homo sapiens mRNA; cDNA DKFZp586B0220 (f 


1.00 


76.00 


427507 


AF240467 


Hs 179152 


toll-tike receptor 7 


1.00 


63.00 


427618 


NM_000760 


Hs.2175 


colony stimulating factor 3 receptor (gr 


0.60 


219 


427732 


NM_002980 


Hs.2199 


secretin receptor 


0.97 


1.42 


427952 


AA765368 


Hs.293941 


ESTs, Moderately similar to A53959 throm 


1.00 


105.00 


428709 


BE268717 


Hs.104916 


hypothetical protein FU21940 


1.00 


80.00 


428769 


AW207175 


H« 106771 


ESTs 


0.09 


255 


428780 


AI47B578 


Hs 50638 


ESTs 


1.00 


98.00 


428833 


AI928355 


Hs 185805 


ESTs 


1.00 


113.00 


429657 




Hs 2465 


KIAAOfHYl nana omduct nutatlve G-oroifii 


1.00 


52.00 


430212 


AA469153 




gbmc67f04.s1 NCJ_CGAP_Pr1 Homo sapiens 


1.00 


13200 


430226 


RF94S5S? 


Hs.2551 


arfrenefntn faseofaf sifrfeep 


0.11 


15.60 


430376 




Hs. 12532 


rhromrtsnmfi 1 onsn rfifldinn framft 21 


i!oo 


103.00 


430414 


AW365665 


Hs 120388 


ESTs 


0.S) 


6.96 


430656 


AA482900 


Hs 162080 


ESTs 


1.00 


70.00 


430843 


AI734149 


Hs.1 19514 


ESTs 


1.00 


90.00 


430998 


AF1 28847 


Hs.204038 


tndolethylamme N-mslhyltransferase 


0.29 


1.84 


431217 


NM 013427 


Hs.250830 


Rho GTPase activating protein 6 


1.00 


79.00 


431921 


N46466 


Hs.58879 


ESTs 


0.91 


1.67 


432176 


AW090386 


Hs.1 12278 


Bjrcstin, bsts 1 


0.66 


263 


432203 


AA305746 


Hs.49 


fTiPcrnoriPOA scAVfinnfif rwrfintof 1 


1.00 


76.00 


432231 


AA339977 


Hs.274127 


CLST 11240 protefo 


0.46 


1.46 


432485 


N90866 


Hs 976770 


CDW52 anfiflen /CAMPATH-1 antioenl 


0.79 


225 


432522 


D 11466 


Hs.51 


jJJlUO^/l lOUUjflU U/OLtUJ y IJ VOJ4j WKfcW \K° 


1.93 


4.83 


432596 


AJ 224741 


Hs!278461 


matrifin 3 


0.04 


5.79 


432850 


X87723 


Hs.3110 


flnflfetefisffi foosptor 2 


1.00 


167.00 


433138 




Hs 59729 




0.04 


9.16 


433583 


AI732637 


Hs 277901 


ESTs 


1.00 


91.00 


433£Afl 

HOODOO 


HlUUOOl £ 


Hs 1^53Sfi 
no. iiMwoo 


ESTs 


120.16 


315.00 


434445 


A134Q306 


Hs. 11782 


ESTs 


0.60 


1.84 


435498 


AW840171 


Hs 265398 


ESTs, Weaxty similar to transformation-r 


1.00 


128.00 




U 29690 


Hs.37744 


Hnirin saftfens hata>1 adrBtieratc recentor 


1.00 


108.00 


436061 


AI248584 


Hs.1 90745 


Homo sapiens cDNA: FU21326 fis, clone C 


1.00 


91.00 


437157 


BEQ48860 


Hs 120655 


ESTs 


1.00 


87.00 


437207 


T27503 


Hs 15929 


hvndhafical nrtrtein RJ12910 


1.00 


105.00 


437311 


AA370041 


Hs.9456 


SWVSNF related, matrix associated, acti 


1.00 


71.00 


437439 


H29796 


Hs 269522 


ESTs 


1.00 


115.00 


438199 


AW016531 


Hs.1 22147 


ESTs 


1.00 


60.00 


439551 


W72062 


Hs.1 1112 


ESTs 


0.30 


3,10 


440515 


AJ131245 


Hs.7239 


SEC24 (S. cerevisiae) related gene famfl 


1.00 


77.00 


440887 


A1799488 


Hs 135905 


ESTs 


1.00 


85.00 


441025 


AA913880 


Hs.176379 


ESTs 


1.00 


82.00 


441384 


AA447849 


Hs.288660 


Homo sapiens cDNA; FU22182 fis, done H 


0.79 


1.89 


441735 


AI738675 


Hs.1 27346 


ESTs . 


1.00 


75.00 


442200 


AW590572 


Hs.235768 


ESTs 


0.78 


5.83 


442832 


AW205560 


Hs.253569 


ESTs 


0.03 


10.88 


442957 


A1949952 


Hs.49397 


ESTs 


1.00 


70.00 


443282 


T47764 


Hs.132917 


ESTs 


1.00 


197.00 


443547 


AWZ71273 


Hs.23767 


hypothetical protein FU 12666 


1.00 


253.00 


443951 


F13272 


Hs.111334 


ferritin, light polypeptide 


0.55 


209 


444330 


AI597655 . 


Hs.49265 


ESTs 


1.00 


90.00 
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444515 


AW204908 


Hs, 169979 


ESTs 


1.00 


04.UU 


445769 


AI74U71 


Hs.23666 


ESTs 


0.02 




445908 


R13580 


1 1. 4 nine 

HS, 13436 


Homo sapiens clone 24425 mRNA sequence 


1.00 


97.00 


446291 


BE397753 


Hs. 14623 


Interferon, garnma-mdiidble protein 30 


0.93 


1 CO 

i.oy 


446917 


AI347863 


Hs. 156672 


ESTs 


l.UU 


lUO.UU 


447261 


ill* Msrrn4 

NM.006691 


Hs. 17917 


extracellular link domain-containing 1 


rt Aft 

U-4U 


J 7 nn 
4/.<IU 


447432 


AW958473 


Hs.301957 


nudix (nucleoside diphosphate linked mol 


4 nn 
1.00 


mn nn 
lUU.UU 


447482 


AB033059 


Hs. 18705 


KiAAuoJ protein 


U.U!> 




447997 


H00656 


Hs.29792 


ESTs, weakly similar to 1 3BQZZ nypoineu 


n no 
U.U2 


c A 1 ) 


44B299 


AA497044 


HSxUoO/ 


nypotneticai protein riJiujs*; 


1 nn 

l.UU 


7q nn 

1 3.UU 


448782 


ai ne/vqac 
AL05Q295 




KiAAUrOo protein 




1 .00 


450575 


NM_005859 


Hs .29117 


purine-rich element tending protein A 


n 17 
U.1/ 


11 11 

1 I.J J 


450584 


AAU4U4UJ 


nS.tHM/i 


CO IS 


1 nn 

l.UU 


04 nn 


450693 


AW450461 


HS20o96o 


ESTs 


1 nn 

I.UU 


m nn 


450715 


AI 256484 


Hs.31570 


co is, weakly similar to kjaai am protein 


a nn 
l.UU 


159 nn 


451103 


R52804 


HS.25956 


uKFZroMDZUo protein 


i nn 

l.UU 


OO.UU 


451220 


AF1 24251 


Hs. 26054 


novel SH2-containing protein i 


U.OU 


1 ^n 

l.OU 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


n ca 
0.54 


i m 

1.31 


452197 


AWQ23595 


Hs.232048 


ESTs 


l.UU 


R7 nn 
o/.uu 


452331 


AA598509 


Hs.29117 


purine-rich element binding protein A 


A CI 

4.53 


11.07 


452353 


CI 8825 


Hs.29191 


epithelial membrane protein 2 


0,72 




453049 


BE537217 


Hs.30343 


ESTs 


4 nn 
1.00 


cp nn 

OO.UU 


453107 


NM_016113 


Hs^79746 


vanubld receptor-loe protein 1 


n m 
0.63 


l./U 


453355 


AW295374 


Hs.31412 


Homo sapiens cDNA FUil422 fts, done Ht 


1 nn 

1.0U 


19? nn 

104.UU 


453390 


AA862496 


Hs.28482 


ESTs 


1.00 


72.00 


453531 


AA417940 




ESTs, Weakly similar to JC5795 CDEP prot 


1.00 


68.00 


454741 


BE154396 




gb:CM2-HT0342-091299^5Ob05 HT0342 Homo 


0.57 


189 


456579 


AA287827 


Hs.284205 


up-regulated by BCG-CWS 


1.00 


82.00 


456672 


AK002016 


HsAWZT 


Homo sapiens, clone MGC: 16327, mRNA, com 


0.79 


1.96 


457400 


AF032906 


Hs^52549 


catnepsinZ 


1.03 


3.25 


457718 


F18572 


Hs22978 


ESTs, Weakly similar to ALU4.HUMAN ALU S 


1.00 


113.00 


459696 


F03027 




gb:HSC1 KA072 normalized infant brain cDN 


1.00 


544.00 



TABLE 10B 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
408074 

411667 
413533 



430212 
436532 
453531 
454741 



CAT Number 
103584.1 

1253334.1 
1375344.1 



423387 22779.1 



423696 23112.1 



314437.1 
421B02J 
97026J 
1232559 J 



R20723 AA2630Q3 AA333976 AA334725 AA334151 AW9S5490 AA310513 AI810530 D31302 AW 134597 AA830127 AA046953 A/668930 
C06094AW104534 

BE160198 AW935898 T11520 AW935930 AW856073 AW861034 

BE146973 BE146972 BE147042 BE147018 BE146783 BE147020 BE146781 BE147019 BE146766 BE147021 BE146952 BE146767 BE147044 
BE146797 BE146776 BE146985 BE146793 BE146768 BE146771 8E146954 BE146760 BE147048 BE147025 BE147030 
AJ012074 U11087 L1328B X75299 L20295 AW630780 H14880 T28037 AI872991 R72136AW449839T81622T79697 T29519 R94105T83923 
R73300 A1797007 R73390 AA961010 H74168 A16B9932 BE045543 AI808418 AI508912 AI806573 AWB84084 AW872978 AW872985 AA565655 
AI022915 R50647 R73210 H45098 R46451 AW166269 T71132 AI264547 R52146 A1304920 R73391 AW8B4059 AW884085 H73241 T60038 
T79612 R73145 R50549 A1094557 AI668793 R72302 A1564366 W01 956 AA41 8962 W32571 R72840 H45409 R72085 R46356 R46758 
AA50B805 AA418798 T83751 R94072T16182AA928785AA903896 

Z92546 AA330586 A1570568 AW341487 A1827050 AW298668 A1792189 AI016693 AI733599 AI572251 AI672488 AW193262 AI244716 

A1864375 A1206100 AA912444 A1269365 AI640254 AW772466 AI867336 AA627604 H16914 AA358477 AA338009 

AA469153AI718503AA469225 

AA721522 AW975443 T93O70 

AA417940AA036735 T07025 

BE154398AW817959BE154393 



TABLE 10C 

Pkey; Unique number corresponding to an Eos probeset 

Ref. . Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham I. et al." refers to the pubBcaiion entitled "The DNA 

sequence of human chromosome 22.' Dunham I. et aL, Nature (1 999) 402j4B9495. 
Strand: indicates DNA strand from which exons were predicted. 
Ntjposition: Indicates nucleotide positions of predicted exons. 



Pkey 


RBf 


Strand 


NLposifjon 


400754 


7331445 


Plus 


144559-144684 


401045 


8117619 


Phis 


90044-90184,91111-91345 


401083 


3242744 


Rus 


33192-33360 


402474 


7547175 


Minus 


53526-53628,55755-55920^530-57757 


402808 


6456148 


Minus 


114964-115136 t 1154S1-1155B5 i 115931-116047,117666-117771,11B004-118l02 


403021 


7547270 


Plus 


120799-120966 


403421 


9665041 


Minus 


126609-126773,139988-140205 


403438 


9719679 


Plus 


90792-90938 


403687 


7387384 


Plus 


9009-9534 


403764 


7717105 


Minus 


118692-118853 


404277 


1834458 


Minus 


91665.91946 


404288 


2769644 


Plus 


3512-3691 


404394 


3135305 


Minus 


37121-37205,37491-37762,4105341140,4132241593.4177341919 


404518 


8151988 


Plus 


84494-840)3 


404916 


7341826 


Plus 


91057-91188 


405106 


8079395 


Minus 


80877-81418 


405257 


733310 


Plus 


73121-73273 


405381 


6006920 


Minus 


7635-8054 
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TABLE 1 1 A: Genes Distinguishing Adenocarcinoma from Other Lung Diseases and Normal Lung 

Table 11A shows about 84 genes unregulated In lung adenocarcinomas relative to other lung tumors, non-rnafignant lung disease, and normal lung. These genes were selected 
from about 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. 

Table 1 1 B show the accession numbers for those Pke/s lacking UnJgenelD*s for table 1 1 A For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
sirnBanty using Clustering and Alignment Toots (DoubteTwlst, Oakland California). The Genbank accession numbers for sequences comprising each duster are fisted in the 
'Accession' column. 

Table 11C show the genomic positioning for those Pkey's lacking Unigene ID'S and accession numbers In table 1 1 A. For each predicted exon, we have listed the genomic 
sequence source used tor prediction. Nucleotide locations of each predicted exon are also listed 

Pkeyr Unique Eos probeset Identifier number 

ExAccrc Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors (including squamous cell carcinomas, adenccartirrcmas, small eel! carcinomas, granulomatous and carcinoid tumors} divided by the 
average of normal lung samples 

R£ Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of norma) lung samples 



Pkey 


ExAccn 


1 Inlnwnfiin 

Ui ityoj i wis 


Untoana Tlfia 

vfU^CMv 1 1110 


R1 


R2 


403329 






Tains! Exon 


1.00 


61.00 


406399 






NM 0031 22*'Hfiffno Banians B&rine nmt&asa 


1.00 


39.00 


406690 


M29540 


Hs29fl5?fi 


rarrinftflmhrwonir anfinajwalatad eaB ad 


226.37 


350,00 


4Q7869 


AIB37976 




hvonthaffeat nmioAn Ft J13R19 


0.77 


1.18 


407881 




no.*tusoo 


ho n 3 ran cutfata fnli/fYisarninftl 3-O-atitfhl 


1.00 


10.00 


HU09U0 


RF9Qfi297 




Qprinpflhmnnrrifi Unp^P 1 1 
oUiUIcjUIIcUJIJIIq lUJidoc 19 


7.76 


1.00 


409103 


AF251237 


Hs 112208 


XAGF-1 nrolatn 


80.44 


40.00 


409187 


AF1<i4fi3fl 

AT IDfOvJv 


rio.OUoOO 


r_arhamnvl-nhnsnhafp svnlhatasa 1 rnlfoch 


1.00 


1.00 


409269 


A/V) i U339 


Hs.22972 


hvno^hfifir.al nmiain FU13352 


1.00 


1.00 


410076 


T05387 


Hs 7991 

no. r JO 1 


ESTs 


1.12 


1.50 


410102 


fVt X CH09\K) 


Hs.279727 


Homo qaniens cDMA FU14Q35 fis clone HE 


9.89 


1.00 


410399 


RPflRRRRQ 

□tvUDOOS 




Qvruirdpin oamma fbreasl cancar-^nactfic 
o/iHiwcuii yajtniia ^imcon waiiw "opcvfiiv 


0.92 


1.06 


411908 


L27943 


Hs.72924 


cybdino deaminase 


1.00 


1.00 


412612 




Hs.74131 


arvt&iilfabvvA F frhnnrimrivsnlasla mi net a 


1.02 


1.03 


414075 


U11862 


Hs.75741 


amtfavida hinritnn nmlaln 1 /annna codda 


0.84 


1.07 




AW2Q11RR 


Hs.41295 


ESTs. WeaWv similar to MUC2 HUMAN MUCIN 

irdimj ouiumj u/ inuw^i iuiwmi iwuv^ii 


3.67 


1.00 


417542 


J04129 




progestagen-associeted endomstnel prote 


1 28 


1.35 


419183 


U60S69 




rvtrahmmp P4*ifl Riihfamity yXrV /vitamin 


1.00 


1.00 


419502 


ai ro7fi7fu 

rvJviOf\r* 






13.05 


115.00 


419631 


AW1RR117 

r\YV too llf 


Hr ^1*14 




1.00 


13.00 


420931 


AFA441Q7 


He inn4.li 

nSi t w*o i 


email tndtirlhlp rvir\\cW\a fl ^nhfamih/ fHv 


1.00 


8.00 




UQ7B70 

no/o/y 


He iny)fi7 
ns. juzzoi 


lysyt oxidase 


1 nn 


15 nn 

10. uu 


421190 


U95031 


Uo 1024*9 
Ho. IU£ < rO£ 


miirin ^ Rtihh/nn R tranh pnhmnrfi ifll 

IllUwal 9, OUUljpt? L> r UOWimMNUIIUIKII 


1.17 


1.55 




U/OJO/ 


ns. iiwDor 


enbrfo <*ornor famlhi 1 /nlirtamiitn Irsnc 
oUKJlo Colilol Idiiuy 1 ^yiuullIlplB UdllO 


1.46 


1.76 


141010 


t i iojy 


ns. iuoo9£ 


Oair*MC oipriar&f O-MaJyUf aJ iMarooc 1| I 


1.00 


3,00 








)m£nH fst*tnr 1 fhroacf r*aof*or ActmnA 
UCIUQ laClOl 1 (uicaol CoJiUcT t cotfO^a 


1.23 


1.00 


422026 


U80736 


no. 1 1 uotu 


trrmirJpfrflrlft rnnnat ronlsinlnn Q 
uiiiuwbuuuo icpcai mmiibmhuu o 


1.00 


52.00 


/i 90 note 


AIflfiRR79 
ntOuOOfc 




hunr»fhoHr>o! nmfoJn PI I777/M 


4.37 


2.34 


422311 


rvvl JO 19 


Ue 11/Q4R 
no. 1 1 4 3*tO 


rvfnkinp rprpntor.fikp farifw 1 


1.15 


1,78 


422867 


L32137 


Hs.1584 


cart3aQ8 dfQomenc matrix protein (pse 


1.69 


3.17 


*W9*t7 i 




Ue 19Qfl57 
ns. izouo/ 


hmgel rnr^rflAmn amnlrfiarf earn lonro 1 
LmcooI CaJwtftJIIMI oDlpUllOU ScqUPltPP i 


48.13 


72.00 


423554 


M90516 


Hs.1674 


yuumuiwy^n UHwaovyi mop* hup uoicthhihi 


1,00 


50.00 


424502 


Mr /moo 


no. ihoocm 


ienosin 


1.00 


1.00 


A') AC. A A 
WWW 


niOOiwU 


Hs 150403 


dope decarboxyiase (aromefic L-amirto act 


1.00 


59.00 


424905 


NM-002497 


Hs. 153704 


NiMA (never in mitosis gene a)-reiated k 


21.35 


1.00 


424950 


BE245380 


Hs!l53952 


5* nucleotidase (CD73) 


1.00 


1.00 


425523 


AB007948 


Hs.158244 


K1AA0479 protein 


1.00 


35.00 


426230 


AA367019 


Hs.241395 


protease, serine, 1 (trypsin 1) 


1.00 


83.00 


427701 


AA411101 


Hs.243886 


nuclear auto antigenic sperm protein (his 


7.41 


34.00 


428585 


AB007863 


Hs.185140 


K1AA0403 protein 


1.00 


6.00 


428758 


AA433988 


Hs.98502 


hypothetical protein FU14303 


1.06 


1.13 


429170 


NM,001394 


Hs.2359 


dual specificity phosphatase 4 


16.18 


105.00 


429263 


AA019004 


Hs.198396 


ATP-btoding cassette, suWamBy A (ABC1 


1.07 


1.00 


429610 


AB024937 


Hs.211092 


UJNX protein; PLUNC (palate Jung and nas 


1.59 


1.69 


430508 


AI015435 


Hs.104637 


ESTs 


4.75 


7.27 


430985 


AA490232 


Hs.27323 


ESTs, Weakly similar to 178885 serine/th 


0.94 


1.26 


431548 


AI834273 


Hs.9711 


novel proten 


5.66 


15.00 


431566 


AF176012 


Hs.260720 


J domain containing protein 1 


49.75 


37.00 


431986 


AA536130 


Hs.149018 


Novel human gene mapping to chomosome 20 


1.19 


1.47 


432375 


BE536069 


Hs.2962 


S100 calcium-binding protein P 


1.65 


1.06 


432677 


NM_004482 


Hs.278611 


UDP44-aceryU]pha4^aiactosarnlrie:poiyp 


1.00 


48.00 


433556 


W56321 


Hs.1 11460 


catdurn/catn^uliiHieDendent protein kin 


1.00 


19.00 


433819 


AW511097 


Hs.112765 


ESTs 


371 


8.00 


434001 


AW950905 


Hs.3697 


serine (or cysteine) proteinase tnhlbito 


29.31 


7200 


434424 


AJ811202 


Hs.325335 


Homo sapiens cONA: FU 23523 fis, clone I 


1.00 


64.00 


434792 


AA649253 


Hs.132458 


ESTs 


8.52 


44.00 


436217 


T53925 


Hs.107 


fibrinog en-like 1 


57.97 


31.00 


436749 


AA584890 


Hs.5302 


lectin, gaiactoside-bindinfl, soluble, 4 


1.10 


1.41 


436972 


AA284679 


Hs.25640 


daudin3 


1.59 


1.46 


437866 


AA156781 




metaDothionein IE (functional) 


3.62 


101.00 


437935 


AW939591 


Hs.5940 


mucin 13, epimelial transmembrane 


1.60 


1.39 


438915 


AA280174 


Hs.285681 


WilHams-Beuren syndrome chromosome regi 


1.00 


1.00 


439451 


AF086270 


Hs.278554 


heterochrornafin-Cxe protein 1 


23.28 


52.00 
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wy/oy 


ALOOtUOO 


67709 


Hnmn canlprw mRMA fiitt fannih ineert rDN 


1.00 


21.00 


441(101 


AI1ffl£A4 

A1HU004 




flhnrmnon R hats rtnJvnonffWa 
•IIW iiRjy tat, d DBla pmjpopuuo 


1 41 


99.00 


AA4VT7 
4410/ i 


DtiTlOiOy 




ESTs 


22.03 


1.00 


AA3MA 


A\/fKCM& 
AvDOOOOO 


Hs.7645 




1.00 


16.00 


4400lJ 


AAoVOOf £ 


nS.3 09O 1 




1 90 


1.99 


wossji 


Mil fVW^fl 


Hs. 10082 


potassium intorrnediate&rnall conductance 


5.71 


6.87 


AAACltS 

4440 /U 


nooo/o 


ns.oozsoo 


liwno4h atrial nmtoin k flf^^C^TA 

nypuuisucaJ pruLcDi iwjuoo/u 


1 QR 

1.30 




AAA Ml 

444331 


AVOOZUOO 


115.' Oi 10 


general uanscnpuon racujj iiift 


i no 


ka nn 

0*».Wv 


446109 


AW1fiR0£7 
Mi» IQOtsOr 




ESTs 


1.00 


1.00 


•WOIOO 


AAUiDDOU 


ns.£O£0& 


Homo sapiens cDNA FU 13603 fis, dons PL 


1.00 


36.00 


44A4RQ 




ns. o i to 


hnrrwiBntiesfa 1 O-ffinwnonocA Ihnmnnonll 

nuniogciiusaie ipt-tiiUAjycndSo \nuinoyciiu 


1.00 


11.00 




AAWRQflClA 
ArV00w04 


Hs.76277 


Unmn cantons rlnnp Mf^P'WHi mRMA ravnn 


1.24 


1.16 






Hs. 18791 


hvnrrfhntir^ nrfliptn PI J2A6D7 
nypuuiouyoi piuicin ru<uwr 


1.23 


1.63 


446243 


AW369771 


Hs.52620 


integrin, beta 8 


15.84 


i!oo 


448844 


AI5B1519 


Hs.177164 


ESTs 


1.00 


. 31.00 


449444 


AW818436 


Hs^3590 


solute canter family 16 (monocaiboxyfc 


1.00 


83.00 


451807 


W52B54 




hypothetical prolan FU 23293 similar to 
transferrin 


1.55 


35.00 


452689 


F33868 


Hsi84176 


1.54 


1.44 


453392 


U23762 


Hs.32964 


SRV {sex determining region Y)-box 1 1 


• 1.00 


16.00 


453464 


AJ884911 


Hs.32989 


receptor (calcitonin) activity modifying 


1.55 


2.45 


453735 


AI066629 


Hs.125073 


ESTs 


1.01 


1.30 



TABLE 11B 

Pkey: Unique Eos prcbeset WenBfier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Accession 

BE068889 BE068862 AF044311 AF017256 NMJJ03087 AF037207 AF010126 AA633976 AA872836 BE298825 BE299889 AJ016464 A1684600 
A1936527 AA804675 AA394097 AI139933 AA946605 BE171313 AA722407 AA293803 AI46B480 AA056035 AA055968 AW796957 AI637713 
AA4 1 0737 H49348 AA485472 AA41 1 094 AA235594 AA402624 AA443633 A W4521 37 AA421 708 AW26521 1 AI493266 AA3651 32 AW966044 
AU076704 T74854 T74860 T72098 T73265 T73873 T69180 T74658 T58786 T60385 T73410 T68781 T67845 T67593 T73952 T67864 T60630 
T68367 T68401 T53959 T72360 T72099 T60377 T58961 T7171 2 T72821 T64738 T74645 T72037 T68688 T72063 T73258 T72826 T64242 
T68220 T74673 T71800 T68355 T61227 T62738 T6931 7 T53850 T64692 T73768 T73962 T73382 T68914 T70975 T73400 T60531 T73277 
T73203 T70498 T61409 T58925 NJVL000508 M64982 TB8301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61109 T68329 T58850 T71857 T73425 T53736 T68607 T58898 T64309 T72031 T72079 T64305 T71908 T681 07 
T71916 T73787 T56035 T64425 T71870 T60476 T61376 T67820 T71895 T41006 T69441 T68170 T74617 T71958 T69440 T61875 R05795 
H48353 T71914 T53939 T64121 AA693996 T72525 T67779 T66078 AA01 1465 AA345378 AV654847 AV654272 AV656001 AI064740 T82897 
N33594 AA344542 AW805054 AI207457 T61743 AA026737 H94389 AA3B2695 AA918409 T68044 S82092 T39959 AJ01 7721 AA31 2395 
AA31291 9 T40155 H66239 AV652989 H38728 R98521 AV655200 R957S0 W03250 W0091 3 AA3441 36 AV660126 R97923 AA343596 
AW470774 AV651256 N54417 AA812862 AW182929 All 11 192 H61463 H72060 AA344503 H38639 AI27751 1 AV661 108 A1207525 T47810 
AA235252 T27853 T47778 R95746 H70620 AA7D1463 AW827166 R98475 C20925 AV657267 T71959 T71313 T73920 T73333 T61618 T69293 
T692B3 H3931 T721 78 T72456 AV645639 AV653476 T72957 T72300 T5B906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71518 T72304 AA343853 T73909 T68070 T72065 H72149 T73493 T73495 AV645993 R02293 
T70475 T64751 AA344441 AA343657 AA345732 AA344328 All 10639 AA344603 AF063513 T64696 T68516 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69206 T70452 T74677 R29366 T61277 T74914 T60352 R29675 T74843 AV645792 AA344408 T69197 T72057 
T69368 T69358 T68258 AV650429 T73341 T61702 T74598 T40095 K02272 T40106 AA343045 AA341 908 AA341907 AA342807 AA341954 
T53747T72042 T62764AIO&«99AA3^0^T67832 T72440T7177^ T6910BT72449T69167T71289T63251 AV654844 T64375 
AA345234 T67598 AA011414T68035 H48262 A1207557 T68219 W86031 T69081 T64232 R93196 T62136 AV650539 H67459 T72978 
AA344583T60362H58121 T95711 T72B03T68055 T71715 R29036T72793 T69122T64595T62888 T69139 T68291 T64652 T67971 T46B62 
AA693592 AJ248502 R29454 T64764 T57001 T73052ni429 T51176T586ttAV6S^14H904^^ T72512T53835 
T67837 T7331 7 T74273 T69420 T68245 T74380 T67862 T74474 T56068 

A1910275 X00474 X52003 X05030 MM 003225 AA314326 AA308400 AA506787 AA314825 AI571948 AA507595 AA614579 AA58761 3 R83818 
AA568312 AA614409 AA307578 AI925552 AW950155 AI910083 M12075 BE074052 AW004668 AA578674 AA582084 BE074053 BE074126 
BE074140 AA514776 AA588034 BE074051 BE074068 AW009769 AW050690 AA858276 R55389 AI001051 AW050700 AW750216 AA614539 
BE074045A13^7AW602S)3BE073575A12025£ BE076078 AI909749 R55292 

AA156781 AW293839 U52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE 16401 2 BE 163972 BE163974 BE163992 
AA837481 AW468444 BE 185091 AW468002 AA6B7333 AA81 1 830 AA581806 AI866686 AI572124 AA043777 AA040926 D20160 AI536733 
AA81 2489 AW8741 42 AJ471883 W84421 AA156850 

W52854AL1 17600 BE208116 BE208432 BE205239 BE082291 AW953423 AA351619 BE180648 BE140560 W60080 AA865478 N90291 
AW450652 AW449519 AA993634 AI806539 AA351618 AW449522 A1827626 AA904788 AA380381 AA886045 AA774409 BE003229 241756 



TABLE 11C 

Pkey, Unique number corresponding to an Eos probeset 

Refc Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et ai* refers to the pubScation entitled "The DNA 

sequence of human chrarnosorne 22" Dunham !. et aL, Nature (1 999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposiSon: Indicates nucleotide positions of predicted exons, . . 

Pkey Ref Strand Ntjwsifion 

4033^ 8516120 Rus 96450-96598 

406399 9256288 Minus 63448-63554 



Pkey CAT Number 
410399 11995J 



419502 1B535J 



421582 2041.1 

437866 44433_2 
451807 8865J 
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TABLE 12A: Genes Distinguishing Squamous Cell Carcinoma from Other Lung Diseases and Norma! Lung 

Table 12A shows about 72 genes unregulated in squamous cell carcinomas of the lung relative to other lung tumors, non-mafignant lung disease, and normal lung. These genes 
were selected from about 59580 probesets on the Eos/Affymetrix Hu03 Genechlp array. 

Table 12B show the accession numbers for those Pke/s lacking UnJgenelD's tor table 12A. For each probeset we have Dsted the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tods (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the 
"Accession' column. 

Table 12C show the genomic positioning tor those Pke/s lacking Unigene ID'S and accession numbers in table 1 2A. For each predicted exon, we have feted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also fisted. 



Pkey: Unique Era probeset identifier number 

ExAccm Exemplar Accession number, Genbank accession number 

UrtrgeneiD: Unigene number 

Unigene Tifle: Unigene gene title 

R1: Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2: Average of non-malignant rung disease samples (including bronchitis, emphysema, fibrosis, ateledasis, asthma) divided by the average of normal lung samples 



Pkey 
400289 


ExAccn 


UnigenelD 


Unigene Tide 


R1 


R2 


X07820 


Hs2258 


matrix metaHoproteinase 10 (strometysin 


132.45 


4.00 


400666 






NMJW2425:Homo sapiens matrix metallopro 


3.26 


3,22 


401780 






NM^005557*:Homo sapiens keratin 1 6 (foca 


26.47 


10.50 


401781 






Target Exon 


ia33 


4.61 


401785 






NM.002275':Homo sapiens keratin 15 (KRT1 


4.13 


2.70 


401994 






Target Exon 


61.84 


47.00 


402075 






ENSP00000251056*:Plasma membrane caldum 


1.00 


1.00 


404996 






Target Exon 


1.00 


1.00 


407839 


AA045144 


Hs.161566 


ESTs 


173.91 


108.00 


408000 


L11690 


Hs.620 


bullous pemphigoid antigen 1 (23Q7240kD} 


151.17 


8.00 


408522 


AI541214 


Hs.46320 


Small proSne-rich protein SPRK [human, 


1.98 


1.24 


410561 


BE540255 


Hs.6994 


Homo sapiens cDNA: FU22044 fis, done H 


10.04 


1.00 


415091 


AL044872 


Hs.77910 


3-hydroxy-3-methylg]utaiYl-Coeri2ym9 A sy 


1.00 


30.00 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-t 


24.30 


1.00 


416658 


U03272 


Hs,79432 


fibrillin 2 (congenital contractural are 


53.29 


51.00 


417034 


NMJ06183 


Hs.80962 


neurotensin 


1.00 


1.00 


417366 


BE185289 


Hs.1076 


small pmline-rich protein IB (comifin) 


8.97 


3.27 


418663 


AK001100 


Hs.41690 


desmocolBn 3 


11Z17 


19.00 


418678 


NM_001327 


Hs.87225 


cancer/testis antigen 


1.18 


1.10 


419121 


AA374372 


Ks.89626 


parathyroid hormone-like hormone 


1.00 


1.00 


420783 


AI659838 


Hs.99923 


lectin, galadoside-binding, sduWe, 7 


3.04 


1.25 


421773 


W59233 


Hs.112457 


ESTs 


1.12 


1.14 


421948 


L42583 


Hs.334309 


keratin6A 


51.83 


20.25 


421976 


AJ243662 


Hs.110196 


NICE-1 protein 


1.01 


0.91 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


2.37 


1.10 


422440 


NNL004812 


Hs.1 16724 


aido-keto reductase family 1, member B10 


47.53 


32,00 


423634 


AW959908 


Hs.1690 


heparin-binding growth factor binding pr 


76.02 


too 


.423725 


AJ403108 


Hs.132127 


hypothetical protein LOC57822 


4.20 


1.00 


423738 


AB002134 


Hs.132195 


airway txypsin-fike protease 


10.14 


51.00 


424012 


AW358377 


Hs.137569 


tumor protein 63 kDa with strong homdog 


233.42 


68.00 


424046 


AF027866 


Hs.138202 


serine (or cysteine) proteinase inhibito 


1.00 


1.00 


424098 


AF077374 


Hs.139322 


small proSne-rich protein 3 


137.82 


54.00 


424834 


AK001432 


Hs.153408 


Homo sapiens cDNA FU 10570 fis, done NT 


56.19 


12.00 


425650 


NMJX)1944 


Hs.1925 


desmogteln 3 (pemphigus vulgaris antigen 


33.45 


1.00 


427099 


AB032953 


Hs.173560 


odd Oz/terwn homdog 2 (Drosophila, mous 


4.24 


17.00 


427335 


AA448542 


Hs.251677 


G antigen 7B 


51.83 


4.00 


428162 


BE385042 


Hs.293317 


ESTs, Weakly similar to GGC1 JWMAN G ANT 


1.00 


1.00 


428645 


AA431400 


Hs.98729 


ESTs, Weakly similar to 2017205A(fihydro 


1.00 


16.00 


428748 


AW593206 


Hs.98785 


Ksp37 protein 


1.00. 


87.00 


429259 


AA42O450 


Hs^92911 


ESTs, Highly simfiar to S60712 band-6-pr 


Z01 


1.18 


429538 


BE182592 


Hs.1 1261 


small profme-rich protein 2A 


4.43 


Z90 


429903 


AL134197 


Hs.93597 


cydin-dependent kinase 5, regulatory su 


11.80 


1.00 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, tarn 


12.28 


41.00 


430890 


X54232 


Hs.2699 


gtypicanl 


1.58 


1.40 


431009 


BE149762 


Hs.48956 


gap junction protein, beta 6 (connexin 3 


6025 


28.00 


431846 


BE01S924 


Hs.271580 


uroplaxtn 1B 


4.49 


2.51 


433091 


Y12642 


Hs.3165 


lymphocyte antigen 6 complex, locus D 


1.20 


1.09 


434360 


AW015415 


Hs,1277B0 


ESTs 


40.98 


27.00 


434880 


U02388 


Hs.101 


cytochrome P450, subfamily IVF, potypept 


1.00 


1.00 


435505 


AF200492 


Hs711238 


interleukin-1 homolog 1 


1.00 


38 00 


435793 


AB037734 


Hs.4993 


KIAA1313 protein 


23.68 


42.00 


436511 


AA721252 


Hs^91502 


ESTs 


16.76 


14.00 


438403 


AA806607 


Hs.292206 


ESTs 


1.00 


1.00 


439285 


AL133916 




hypothetical protein FU 20093 


4673 


139.00 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


33.61 


1.00 


439670 


AF08B076 


Hs.59507 


ESTs, Weakly similar to AC004858 3 U1 sm 


1.00 


1.00 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1.HUMAN DEATH 


86.55 


11.00 


440325 


NrVL003812 


Hs.7164 


a disintegrin and metalloproteinase doma 


62.88 


147.00 


441525 


AW241867 


Hs.127728 


ESTs 


1.53 


1.42 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


31.11 


38,00 


444378 


R41339 


Hs.12569 


ESTs 


1.00 


1.00 
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446292 


AF081497 


Hs,279682 


Rh tvnfl C ofvemrateln 


447078 


AW885727 


Hs.9914 


ESTs 


447342 


All 99268 


l_t_ iHMI) 

Hs. 19322 


Unrnn can! on c Similar to R1KFN cONA 2010 

nUJllU ooplpilj, villi mew u/ nir\bii wim cm iw 


449003 


X76342 


Hs.389 


alcohol dehydrogenase 7 (class IV), mu o 


449101 


AA205847 


Hs.23016 


G protein-coupled receptor 


450832 


AW970602 


Hs.105421 


ESTs 


452240 


AI591147 


H3.61232 


ESTs 


453317 


NM.002277 


Hs.41696 


keratin, hair, acidic ,1 


453830 


AA534296 


Hs.20953 


ESTs 


454098 


W27953 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


455601 


AI368680 


Hs.816 


SRY (sex determining region Y)-box 2 



1.55 


1.26 


47.24 


24.00 


23.63 


1.00 


1.00 


too 


2.58 


27.00 


25.17 


36.00 


13.42 


1.00 


1.19 


1.27 


24.92 


25.00 


1.2S 


1.11 


206.11 


1.00 
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TABLE 12B 

Pkey. Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

439285 47065 1 AL133916 N791 13 AF086101 N76721 AW95082B AA364013 AW9556B4 A1346341 AI867454 N54784 AI655270 AW21279 AW014882 

AA775552 N62351 N59253 AA626243 AI341407 BE175639 AA456968AI358918 AA457077 

TABLE 12C 

Pkey: Unique number corresponalng to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Q) numbers. "Dunham I. et at' refers to the pubfication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et a)., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted 
NLpostoon: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

400666 8118496 Plus 17982-18115,20297-20456 

401780 7249190 Minus 28397-28617»289»)-2904S ( 29135.29296 t 2941 1-29567,29F705-29787 t 30224-30573 

401781 7249190 Minus 8321 5^3435,83531 -83656,83740^330 1,84237-84393,84955-85037, 8 6290^6814 

401785 7249190 Minus 165776-165996^66189-166314.156408-166569,167112-167268.167387-167469,168634-168942 

401994 4153858 Minus 4290443124,43211-43336,4460744763,45199^5281,46337-46732 

402075 8117407 Plus 121907-1 22035,122804-122921 ,12401 9-1241 61,124455-124610 .1 25672-126076 

404996 6007890 Plus 37999-381 45.38652-38998,39727-3987*40557-40674,42351 -42450 
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TABLE 13A: Genes Distinguishing Non-Malignant Lung Disease from Lung Tumors and Normal lung 

Table 13A shows about 23 genes upregulated in non-malignant lung disease relative to lung tumors and norma) lung. These genes were selected from about 53680 probesets on 
the Eos/Afrymetrix Hu03 Genechip array. 

Table 13B show the accession numbers for those Pke/s lacking UnigenelD's for table 13A. For each probeset we have listed the gene duster number from which the 
oGgonudeoSdes were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwtet, Oakland California). The Genbank accession numbers tor sequences comprising each cluster are fisted in (he 
"Accession* column. 

Table 13C show the genomic positioning for those Pke/s lacking Unigene ID'S and accession numbers in table 13A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene Bile 

Rl: Average of lung tumors {Including squamous ceil carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2: Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal rung samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


408562 


A1436323 


Hs.31141 


Homo sapiens mRNA tor KIAA1568 protein, 


1.00 


230.00 


409031 


AA376B36 


Hs.76728 


ESTs 


1.00 


128.00 


412372 


R55998 


Hs.285243 


hypothetical protein FU22029 


1.00 


173.00 


415910 


U20350 


Hs.78913 


chemokine (C-X3-C) receptor 1 


1.00 


145.00 


417511 


AL049176 


Hs.B2223 


chordin-like 


1.00 


179.00 


418819 


AA228776 


Hs.191721 


ESTs 


1.00. 


140.00 


422060 


R20893 


Hs.325823 


ESTs, Moderately similar to AlU5_HUMAN A 


1.00 


156.00 


424585 


AA464840 


Hs.131987 


ESTs 


too 


167.00 


426753 


T69832 


HS.17027B 


ESTs 


1.00 


141.00 


429496 


AA453800 


Hs.192793 


ESTs 


1.00 


138.00 


430719 


AA4B8988 


Hs.293796 


ESTs 


1.00 


133.00 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


23.32 


941.00 


431385 


BE178536 


Hs. 11090 


membrane-spanning 4-domams, subfamily A 


1.00 


157.00 


431728 


NM_007351 


Hs.268107 


mul time rin 


1.00 


157.00 


436532 


AA721522 




gb:nv54h12.r1 NCLCGAP_Ew1 Homo sapiens 


1.00 


216.00 


437960 


AI669586 


Hs.222194 


ESTs 


1.00 


147.00 


438202 


AW169287 


Hs,22588 


ESTs 


1.00 


141.00 


441499 


AW298235 


Hs.101689 


ESTs 


1.00 


167.00 


444513 


AL120214 


Hs.7117 


glutamate receptor, ionotroptc, AM PA 1 


1.00 


151.00 


448253 


H25899 


Hs.201591 


ESTs 


1.00 


141.00 


453636 


R67837 


Hs.169872 


ESTs 


1.00 


116.00 


458332 


AKXJ0341 


Hs.220491 


ESTs 


1.00 


moo 


459587 


AA031956 




gbzk15e04.s1 Soaresjiregnantuterus.NbH 


1.00 


154.00 



TABLE 13B 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 

Accession: Genbank accession numbers 

Pkey CAT Number Accession 

431089 327825J BE041395 AA491826 AA621946 AA715980AA666102 

436532 421802J AA721522 AW975443 T93070 



TABLE 13C 
Pkey: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (GO numbers. Dunham I. el aT refers to the publication entitled "The DNA 
sequence of human chromosome 21' Dunham I. et aL, Nature (1999) 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted 
NLposilion: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLpostfon 

402075 8117407 Phis 121907-122035,122804.122921,124019.124161.124455-124610.125672-126076 
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TABLE 1 4A: Preferred UGffly and Subcellular Locaflzation tor Potential Lung Disease Targets 

Table 14A shows the subcellular localization and preferred utility for the genes appearing in Tables 9A and 10A mAb symbolizes monoclonal antibody, diag symbolizes 
diagnostic s.m symbolizes small molecule, and CTL symbolizes cytotoxic lymphocytic ligand. These genes were selected from 59630 probesets on the Eos/Affyrnetrix Hu03 
GenechJp array. 

Table 14B show the accession numbers for those Pkey's lacking UnigenelLTs for table 14A. For each probeset we have listed the gene cluster number from which the ' 
oQgonucteoMes were designed. Gem dusters were compiled using sequences denved from Genbank ESTs and mRUAs. These sequences were dustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwtet, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 14C show the genomic positioning for those Pkey's lacking Unigene ID'S and accession numbers in table 14A. For each predicted exon, we have listed me genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also Ested. 



Pkeyr Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

PiefUm Preferred Utility 

PredMoc: Predicted subcellular localization 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


Pref Utility 


ricu. mc 




Aw Ocv 




matrix metaflcproteinase 1 0 fslrornelysin 


mAb & diag &s.m. 


p<lra*»llnlaf 


inmn% 


/V\Zh£/DO 




UV-1 protein, estrogen regulated 


mAb 


rdfsemo momKrano 

ptdSHid incinofdJia 


*RWU/D 






ENSP00000251056*:PIasma membrane calcium mAb & diag 


secreted 


WiOl \ 




Ue innoft 


cysteine knot superfamDy 1, BMP antagon 


giag 


secreted 




vnri7ft7 

TUU/O/ 




tnieneuxino 


olag 




408790 




Ue A7QGf\ 


neurotrophic tyrosine kinase, receptor, 


mAb & s.m 


plasma membrane 






u. oc no TO 


serine/threonine kinase 15 


s.rn. 


cytoplasm 






He cnnfti 


Hypolheucal protein, XP_u51B60 (KiMiiy 


CTL & diag 


civ*rolaH 

ScCltHCU 


anoint 


Ar/D l col 


Uc 1 1 

nS. 1 \LL\X> 


XAGE-1 protein 


CTL 


nuclear 


A no A 'in 


£l5UUO 


HS.imoi 


lamtnln, gamma 2 (nlcetn (100ku), kalim 


diag 


secreted 




VY/4UU1 


Uc 


serine (or cysteine) proteinase inhibito 


diag 


secreted 


Afttntrr 
4U3/0/ 


mm nniDQii 


Ui> 1011 1i4 

nS.lz<J114 


--Mm CM 

cysiaiin SN 


diag 


extracellular 


4Uyoyd 


AyvZ4/DyU 


lie C7ini 
HS.D/1U 1 


minichromosorne maintenance deficient (S. 


CTL 


nuclear 


xrtoocc 


AWlUodo4 


U. 707 


innlbin, beta A (aewn A, acuvin AB a 


diag 


extracellular 


41UUU1 




Ue C7771 


KatuKreinii 


diag 


extracellular 




XS6839 


Me 

rts.DJ/o/ 


caroomc annyorase ia 


mAb & s.m. 


plasma membrane 


41U41B 


D31362 


11. CtViC 

HS,DW« 


transmembrane protease, serine 4 


mAb & diag & s.m. 


plasma membrane 


41Z14U 


AAO10C01 

AA^iyoyi 


Li- 

MS./ JOZO 


RABo interacting, kinesm-fike (rabkines 


s.m. 




him iy 


auimrria 

AWUloOlU 


Ue B1R 


ESTs 


SJn. 


nuclear 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


diag 


extracellular 


41 WOO 






CDC28 protein kinase 1 


SJTL 




/I C4 *3Q 

Aloldo 


C18356 


tie, *iQCOAA 

MS .233344 


tissue factor pathway inhibitor 2 


CTL & diag 


n J 

extracellular 


415669 


& til fifxentc 

NM_w50Zb 


Lift 7QCDQ 


serine (or cysteine) proteinase inhibito 


mAb & diag & s.m. 


secreted 


41301 l 


1 1DQOC7 


Ue 7QDC7 


protein tyrosine phosphatase, receptor-t 


mAb & s.m 


plasma membrane 


416658 


\Miiri 


Ub 7Q4Q1 

ris.ry404 


fibrillin 2 (congenital contracture) ara 


diag 


extracellular 


41 /U04 


iMM_UU01OJ 


U» QAQCO 

ns.ouyw 


neurotensin 


diag 


extracellular 


417079 


I tec con 


HS.81134 


interleukin 1 receptor antagonist 


diag 


extracellular 


4 I iSUo 




riS.oloyz 


KIAA0101 gene product 


SJTL 


mitochondrial . 


41 Mod 


btzbuyw 


nS4)«U49 


rmrtlrfnR fnoirrito nfrriwtturimnvrtttvi fsrhv 
MlUJfJJlO ^iWUliw yiuwur-pjuticjuiiy lacwt 




secreted 


41 /HOO 


0C07/V)CR 
OCU/UZOD 


Up P0110 


5T4 oncofetal tmphobtast glycoprotein 


mAb 


plasma memorans 


41 /3 oi 


aWJUo 


Up. cooeo 
nS.o<cyo4 


thyrnidylate synthetase 


sjn. 


enoopiasmic reucuium 


4104/0 


i laoo/e 
UooiWo 


U» 147/ 

HS.il »4 


cycFin-dependent kinase Inhibitor 2A (me 


s.m. 


cytoplasm 


416506 


AA084248 




G protein-coupled receptor 39 


mAb&sjn. 


plasma membrane 


41867B 


NM.001327 


Hs.1 67379 


cancer/tesfc antigen (NY-ES01) 


CTL 


cytoplasmic 


419121 


AA374372 


Hs.89626 


parathyroid hormone-tike hormone 


diag 


secreted 


419171 


NM_002846 


Hs.89655 


protein tyrosine phosphatase, receptor t 


mAb & s.m. 


plasma membrane 


419183 


U60669 


Hs.89663 


cytochrome P450, subfamily XXIV {vitamin 


CTL&sJn. 


mnocnonanai - 


419216 


AU07671B 


Hs.164021 


small inducible cytokine subfamfty B (Cy 


diag 


secreted 


419235 


AW470411 


Hs.2B8433 


naurotrimin 


mAb & diag 


plasma membrane 


419452 


U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


mAb & s.m. 


plasma membrane 


419556 


U29615 


Hs.91093 


chitinase 1 (chitotriosidase) 


mAb&diag 


extracellular* 


420610 


A1683183 


Hs.99348 


distai-iess homeo box 5 


CTL 


nuclear 


421110 


AJ250717 


Hs.1355 


cathepsbE 


sm & diag 


extracefiular 


421379 


Y15221 


Hs.103982 


smaO fndudbfe cytokine subfamfly B (Cy 


diag 


secreted 


421474 


U76362 


Hs.104637 


solute carrier family 1 (glutamate trans 


mAb & s.m 


plasma membrane 


421552 


AF026692 


Hs.105700 


secreted frizzied-related protein 4 


diag 


secreted 


421753 


BE314826 


Hs.107911 


ATP-blnding cassette, sub-family B (MDR/ 


mAb & s.m. 


plasma membrane 


421817 


AF146074 


Hs.108660 


ATP-binding cassette, sub-fan% C (CFTR 


mAb & s.m. 


plasma membrane 


422109 


S73265 


Hs.1473 


gastrirHeleasing peptide 


dag 


secreted 


422158 


L10343 


Hs.1 12341 


. protease inhibitor 3, skin-derived (SKAL 


diag 


secreted 


422282 


AF019225 


Hs.114309 


apoSpoprotsin L 


diag 


secreted 


422283 


AW411307 


Hs.114311 


COG45 (celt division cycle 45, Sxerevfs 


s.m. 


nuclear 


422424 


A1186431 


HA296638 


prostate diffeTenfiation factor 


diag 


extracellular 


422765 


AW409701 


Hs.1578 


bacutovtral IAP repeat-containing 5 (sur 


s.m. 


cytoplasm 


422809 


AK001379 


Hs.121028 


hypothetical protein FU10549 


s.m. 


nuclear 


422867 


L32137 


Ks.1584 


cartilage digomerfc matrix protein {pse 


diag 


extracefiular 


422956 


BE545072 


Hs.122579 


ECT2 protein (Epithelial cell transform! 


CTL&sjn. 




423634 


AW959908 


ffe.1690 


heparirvbindmg growth factor binding pr 


diag 




423573 


BE003054 


Hs.1695 


matrix metafloproteinase 12 (macrophage 


mAb&diag&&m. 


secreted 


423961 


D13666 


Hs.136348 


pariostin (OSF-2osJ 


mAb&diag 


extracellular 


424046 


AF027866 


Hs.138202 


serine (or cysteine) proteinase inhibito 


diag 


secreted 


424381 


AA28S249 


Hs.146329 


protein kinase Chk2 


s.m. 


nuclear 
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424502 


AF24238B 


HS. 149585 


lengsin 


s.m. 


cytoplasmic 


424503 


lit 1 AAOVkf 

NM_002205 


Hs.1 49609 


Integrin, alpha 5 (fibronectin receptor, 


mAb & s.m. 


plasma membrane 


424687 


J05070 


Hs.151738 


matrix metaltapro'einase 9 (gelatin ase B 


dlag 


extracellular 


425247 


NMJJ05940 


Hs. 155324 


matrix metafoproteinase 11 (stromelysin 


mAb&diag&s.m. 


secreted 


425322 


U 53630 


HS.1 55637 


prutetn kinase, UNA-actrvaiea, catayuc 


SJTL 


cytoplasmic 


425650 


fc.ll J Af\ 4 Hi J 

NMJJ01944 


Hs.1 925 


desmogtefo 3 (pemphigus vulgaris anCgen 


mAb 


piasma membrane 


425734 


AF0552Q9 


HS. 159395 


peptidytglyrine alpha-amldatlng monooxyg 


&m. 


plasma membrane 


425776 


U25128 


ns,i5»4yy 


parathyroid hormone receptor 2 


mAb & diag 


425652 


A]\UU15U4 


Ue 1 COCCI 


death receptor 6, TNF superfamily member 


mAb & sjtl 


piasma rneniuranB 


426215 


AWyW4iS 


U— 4 r COO'S 


stannbcatein 2 


mAb & diag 




426427 


M85699 


lis. 169840 


TTK protein kinase 


CTL & s.m 


. nuclear 


426514 


DCDIO&M 


U. 47/14 QC 

HS.1 /Urn 


bone morphogertetic protein 7 (osteogenic 


mAD a oi ag 


secreted 


427335 


A Al AQUA*) 

AA44o54Z 




G antigen 7B 


OIL 


cyTopiasmic 


427747 


Atlf 4 4 4 i 

AW411425 


Hs.1 80655 


serine/threonine kinase 12 


sjn. 


cytoplasmic 


428242 


H55709 


Hs.2250 


leukemia inhibitory factor (cholinergic 


dlag 




428330 


L22524 


Hs.2256 


matrix metallo proteinase 7 (matritysin, 


mAb & diag & sjti. 


extracellular 


428450 


Oil A4 

NM_014791 


Hs. 184339 


K1AA0175 gene product 


SJTL 


nuclear 


428479 


Y00272 


Hs. 334562 


cell fljvision cycle 2, Gl to s and G2 to 


SJTL 


nuclear 


428484 


Ai- 104032 


11- 40 ACM 

Hs.1 84501 


solute carrier family 7 (cationic amino 


-AL O . _ 

mAD its jn. 


plasma membrane 


428664 


AK001666 


Hs.1 89095 


similar to SALL1 (sal {DrosophilaJ-like 


CTL&sjn. 


nuclear 


428598 


AAo5Z77d 


Hs. 334838 


\n a a 4 Dec •»*>i»«t_ 

KlAAiooo protein 


-AL 

mAD 




428748 


AW593206 


Hs.98785 


Ksp37 protein 


diag 


extracellular 


428758 


AA433988 


Hs.98502 


CA125 antigen; mucin 16 
artemin 


(Sag 


miQcnoona^ 


428969 


AF1 20274 


Hs. 194689 


diag 


extracellular 


42921 1 


AF052693 


Hs.1 98249 


gap junction protein, beta 5 (connexin 3 


mAb & s.m. 


plasma membrane 


429263 


AA019004 


Hs.1 98396 


ATP-bindlng cassette, sub-family A (ABC1 


mAb & s.m. 


plasma membrane 


429547 


AWU09166 


Hs.99376 


ESTs 


oiag 


secreted 


429510 


AB024937 


Hs.211092 


lunx protein; plunu (palate lung ana nas 


_AL a JS^tt 

mAD afliag 


secreted 


429903 


AL134197 


Hs. 93597 


cycfin-dependent kinase 5, regulatory su 


8.ITL 




430486 


BE062109 


HS241551 


chloride channel, calcium activated, fam 


mAb & s.m. 


plasma membrane 


431462 


AW583672 


Hs.2 56311 


gran in-like neuroendocrine peptide precu 


dlag 


extracellular 


431515 


NMJJ12152 


HS2585B3 


endothelial omerenuaiion, lysophospna 


mAb & s.m. 


plasma membrane 


431846 


BE019924 


Hs.271580 


uropiakin 1B 

cadherin 3, type 1, P-cadherin (placenta 


mAb & diag 


plasma membrane 


431958 


X63629 


Hs.2877 


mAb & diag 


plasma membrane 


432201 


AI53S613 - 


Hs.298241 


Transmembrane protease, serine 3 


mAb 81 diag & sjti. 


plasma membrane 


433001 


AF217513 


Hs.279905 


clone HQ0310 PR00310p1 


s.m 


nuclear 


435505 


AF200492 


Hs.211238 


interleukin-1 homolog 1 


diag 


secreted 


436481* 


AA379597 


Hs.5199 


HSPC1 50 protein similar to ubiquiuri-con 


SJTI. 




437016 


AU076916 


HaS398 


guanine mon phosphate synthetase 


SJTI. 


cytoplasm 


437044 


ALO 35864 


Hs.69517 


differentially expressed in FanconFs an 


CTL 


ER 


437783 


A IC04 ^li 

AJ581344 


Hs.127812 


ESTs, Weawy similar to T1 7330 hypothec 


CTL 


nuclear 


437852 


BtOOIoJo 


HS.256B97 


ESTs, Weakly similar to dJ 36501 Z 1 [H.sa 


mAb &SJTL 


plasma membrane 


439223 


AW238299 


Hs.250618 


UL16 binding protein 2 


mAb 


plasma membrane 


439477 


W69813 


HS.5B042 


ESTs, Moderately snrtlar to GFR3_HUMAN G 


mAb & s.m 




439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


mAb & s.m. 


plasma membrane 


439738 


BE2465U2 


Hs.9598 


sema domain, immunoglobuDn domain (tg), 


mAD & s.m 


plasma membrane 


440006 


AK000517 


Hs.6844 


NALP2 protein; PYRlr^Containing APAFI-O 


s.m 


nuclear 


441362 


BE614410 


■ *- AAA! J 

Hs.23044 


RAD51 (S. cerevtslae) homolog (t ecu Re 


s.m. 




442117 


AW664964 


Hs.1 28899 


ESTs; hypothetical protein for IMAGB447 


mAb & s.m 


- 

plasma membrane 


443247 


BE614387 


LL. A44DM ' 

HS.333B93 


c-Myc target JPOI 


CTL 


-II, ,L-_* 

extracellular 


443426 


a r-nno * ra 

AF09815o 


Hs.9329 


chromosome 20 open reading frame 1 


CTL 




443859 


NMJJ134Q9 


Hs.9914 


foffistatin 


diag 


extracellular 


444006 


BE395085 


Hs.1 0086 


type 1 trans mambrane protein Fn14 


mAb 


plasma membrane 


444371 


BE540274 


Hs.239 


torkhead box Ml 


SJTL 


nuclear 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar to S 64054 hypothec 


diag 


secreted 


444781 


All M A4 J JAA 

NM_014400 


Hs.11950 


GPf-enchored metastasis-assoctated prate 


mAb & diag 


plasma membrane 


445537 


AJ245671 


Hs.1 2844 


EGr-CXe-oonTain, mulnpte 6 


mAb & diag 


secreted 


446619 


AU076643 


Hs.313 


secreted phospho pro tain 1 (osteopontin. 


diag 


secreted 


446921 , 


kHA4A4 44 

ABOIZilo 


II- 4fC4A 

HS. 16530 


small tnoucioiB cytotcine suDiarrury a (oy 


diag 


extracellular 


447033 


Al 35741 2 


lt__ u 

Hs.1 57601 


ESTs 


CTL & dlag 


secreted 


447342 


AI199268 


HS.1 9322 


Homo sapiens, Similar to R1KEN cDNA 2010 


CTL 




448243 


AIArttflW»4 

AW369771 


Hs.52620 


integnn, oeta 0 


mAb & s.m - 


plasma membrane 


448844 


AI581519 


Hs.177164 


ESTs 


mAb&sJTL 




449048 


Z45051 


Hs.22920 


similar to S68401 (came) glucose indue 


mAb 


plasma membrane 


449722 


BE280074 


Hs.23960 


eyefin B1 


SJTL 


cytoplasm . 


450001 


NMJJ01044 


Hs.406 


solute canter family 6 (neurotransmitte 


mAb&sjTL 


plasma membrane 


450375 


AA009647 


■ ■ 


a disintegrin and metaltoproteinase doma 


mAb & diag & s.m. 


plasma membrane 


450701 


K39960 


Hs.288467 


hypothetical protein XP_098151 (leucine* 


mAb & diag 


plasma membrane 


450983 


AA3QS384 


Uc 9C7iin 




diag 




451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


mAb&dlag 


plasma membrane 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA FU1 1 041 fis, clone PL 


diag 




452401 


N1A.007115 


HS29352 


tumor necrosis factor, alpha-induced pro 


diag 


extracellular 


452747 


BE153855 


Hs.61460 


Ig superfamily receptor LN1R 


mAb 


plasma membrane 


452838 


U65011 


Hs.30743 


preferentially expressed antigen In mela 


CTL 


nuclear 


453968 


AA847843 


Hs.62711 


High mobility group (nonhistone chromoso 


CTL&SJTL 


nuclear 


457489 


AI693815 


Hs.127179 


cryptic gene 


diag 





TABLE 148 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Acces s io n : Gen bank accession numbers 



Pkey 



CAT Number Accession 
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414883 15024J AA926960 AA926959 W76521 W24270 W21526AA037172 BE267636 H83186 AA469909 N8B39SAA001348 BE535736 AA081745 BE566245 

AA082436 H72525 H77575 N49786 W80565 H78746 BE569085 W04339 R98127 T55938 BE279271 AW960304 T29812 AA476873 BE297387 
AA292753 AA177048 NM.C01826 X54941 BE314366 AA9087B3 A1719075 BE270172 BE269819 AA889955 AI204630 W25243 AI935150 
AA872039 W72395 T99630 A1422691 H98460 N31428 BE255916 H03265 AI857576 AA776920 AA910B44 AA459522 AA293140 AW514667 
R75953 AW662398 AAS62522 AI865147 AI423153 AW262230 AA584410 AA583187 AW024595 AW069734 AI828998 AA282997 AA876046 
AW613002AA527373AVTC72459AI831360AA621337M100926M 

N9521 0 AI459432 AI041437 AA932124 AA6276B4 AA935829 AI004827 AJ42351 3 AI094597 H42079 R54703 AIB30359 AA617681 AA978045 
AA643280 W44581 AI991988 A1537692 A1090262 AA740817 AI312104 AI911822 AA416871 A1 1 85409 AA1 29784 AA701 623 AI075239 
A1 1 39549 AA633648 A1339996 AI335880 AA399239 AI078708 AI085351 A1362835 A334661 8 Al 146965 AI9893B0 A! 348243 N92B92 AA765850 
AI494230 A1278887 AA962596 AM92500 W80435 AA001979 R97424 AI129015 N24127 AA157451 AA235549 AA459292 AA037114 AA1297B5 
AW94211 AW059S01 AW886710 R92790 N59755 AI36112B AW589407 H47725 H97534 H48076 H4845GT99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 A1457100 N92845 N49682 H42038 BE220698 BE220715 H99552 AA701524 N74173 R54704 H79520 H72923 
H03266BE261919AA769633AA480310 AA507454AA910586 AI203723AW1 04725 W25811 W25071 T88980 H0351 3 T77589 R991 55 
W95095 R97470 AA702275 T77551 AA911952 H82956 NB3673 AA283572 
450375 83327 1 AA009647 AA1 31 254 AA374293 AW954405 H0441 0 AW606284 AA1 51 1 56 BE157467 BE157601 H04384 W4B291 AW663674 H04021 H01532 

AA190993 H03231 H59505 H01642 AA852876 AA113758 AA626915 M746952 AI161014 AA099554 R59067 



TABLE 14C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit number in this column are Gen bank Identifier (G!) numbers. 'Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22.' Dunham I. et at., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLpositian: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposifion 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 
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TABLE 15A; Information for all sequences to Table 16 

Table 15A shows the Seq ID No, Pkey, ExAccn, UnigenelD, and Unlgene Tilte for aD of the sequences in Table 16. 

Table 1 5B show the accession numbers for those Pke/s lacking UnigeneiD's for laWe 15A. For each probeset we have listed the gene duster number from which the 
oligonucleotides were designed. Gene dusters were compilsd using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland CaEfomia). The Genbank accession numbers for sequences comprising each cluster are listed In the 
'Accession* column. 

Table 15C show the genomic positioning for those Pke/s lacking Unlgene ID'S and accession numbers in table 15A. For each predicted exon, we have listed Ihe genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Seq ID No: Sequence ID number 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

Unlgene Title: Unlgene gene title 



Seq ID No: 


Pkey 


ExAccn 


UnigenelD 


SeqlDNo:1&2 


410407 


X66839 


Hs.63287 


SeqlDNo:3&4 


412719 


AW016610 


HsJ16 


SeqlDNo:5&6 


417034 


NMJJ06163 


Hs*0962 


Seq IDNa7&8 


430486 


BE062109 


Hs-241551 


Seq ID No: 9 & 10 


407788 


BE514982 


Hs^8991 


Seq ID No: 11 & 12 


407788 


BE514982 


Hs.38991 


SeqlDNa13&14 


4077B8 


BE514982 


Hs.38991 


Seq ID No: 15& 16 


407788 


BE514982 


Hs.38991 


Seq ID No: 17 & 18 


439285 


AL133916 




Seq ID Na 19420 


413753 


U17760 


Hs.75517 


Seq ID Not 21 & 22 


120486 


AW368377 


Hs. 137569 


Seq ID No: 23 & 24 


425650 


! NWL001944 


Hs.1925 


Seq ID No: 25426 


412140 


AA219691 


Hs.73625 


Seq ID No: 27 & 28 


423673 


BE003054 


Hs.1695 


SeqlDNo:29&3G 


452838 


U65011 


Hs.30743 


Seq ID No: 31&32 


418663 


AK001100 


Hs.41690 


Seq ID Na 33434 


418663 


AK001100 


Hs.41690 


Seq ID No: 35 & 36 


409632 


W74001 


Hs.55279 


Seq ID No: 37 & 38 


429610 


AB024937 


Hs.211092 


Seq ID No: 39 & 40 


406690 


M29540 


Hs.220529 


Seq ID No: 41 & 42 


431846 


BE019924 


Hs.271580 


Seq ID No: 43 & 44 


418830 


BE513731 


Hs.88959 


Seq ID No: 45 & 46 


424098 


AF077374 


Hs.139322 


Seq ID No: 47 & 48 


443648 


AKJ85377 


Hs.143610 


Seq ID No: 49 


311034 


BE567130 


Hs^113B9 


Seq ID No: 50 & 51 


406522 


AI541214 


Hs.46320 


Seq ID No: 52 & 53 


422158 


L10343 


Hs.1 12341 


Seq ID No:54&55 


435505 


AF200492 


^211238 


Seq ID No: 56457 


417366 


BE185289 


Hs.1076 


Seq ID No: 58 & 59 


431958 


X63629 


Hs.2877 


Seq ID No: 60 & 61 


441020 


W79283 


Hs.35962 


Seq ID No: 62463 


423217 


NM.000094 


Hs.1640 


Seq ID No: 64 & 65 


429538 


BE182592 


Hs.1 1261 


Seq ID No: 66467 


448733 


NM.005629 


Hs.187958 


Seq ID No: 68 & 69 


444371 


BE540274 


Rs.239 


Seq ID No: 70471 


444371 


BE540274 


H^239 


Seq ID No: 72 & 73 


444371 


BE540274 


Hs^39 


Seq ID No: 74 & 75 


422168 


AA586894 


Hs. 112408 


Seq ID No: 76 & 77 


4221 68 


AA586894 


Hs.1 12408 


Seq ID No: 78 & 79 


429259 


AA420450 


HU92911 


Seq ID No: 80 & 81 


426440 


BE382756 


Hs.169902 


Seq ID No: 82 & 83 


437044 


AL035864 


Hs.69517 


Seq ID No: 84 & 85 


423662 


AK001035 


Hs.130881 


Seq ID No: 86&S7 


428484 


AF104032 


Hs.184601 


Seq ID No: 88 & 69 


429211 


AF052693 


Hs.198249 


Seq ID No: 90 & 91 


417389 


BE260964 


Hs.82045 


Seq ID Na 92493 


423634 


AW959908 


Hs.1690 


Seq ID No: 94495 


417515 


L24203 


Hs.82237 


Seq ID No: 96 & 97 


441362 


BE614410 


Hs.23044 


Seq ID No: 9B499 


425322 


U63630 


. Hs.155637 


Seq ID Na 100&101 


449003 


X76342 


Hs.389 


Seq ID No: 102 & 103 


431009 


BE149762 


Hs.48956 


Seq ID No: 104 & 105 


409103 


AF251237 


Hs.112208 


Seq ID No: 106 & 107 


417542 


J04129 


Hs.82269 


Seq ID No: 108 & 109 


428471 


X57348 


Hs.184510 


SeqID No:110&111 


418004 


U37519 


Hs.87539 


Seq ID No: 112 & 113 


414761 


AU077228 


Hs.77256 


Seq ID No: 114 & 115 


416203 


X54942 


HS.B3758 


Seq ID No: 116 


447343 


AA256641 


Hs.236894 


Seq ID No: 117 & 118 


437016 


AU076916 


Hs.5398 


Seq ID No: 119 & 120 


449230 


BE613348 


Hs.211579 


Seq ID No: 121 & 122 


446989 


AK001898 


Hs.16740 


Seq ID No: 123 & 124 


457819 


AA057484 


Rs.35406 


Seq ID No: 125 & 126 


424687 


J05070 


Hs.151738 



Unlgene Title 

carbonic anhydrase IX 
ESTs 

neurotensin 

chloride channel, calcium activated, ram 
S100 catdun>binding protein A2 
SI 00 caftium-tindlng protein A2 
S100 catduirvbinding protein A2 
S100 caWum-bindirtg protein A2 
hypothetical protein FU 20093 
laminin, beta 3 (niceln <125kD), kaGnin 
tumor protein 63 kDa with strong homolog 
desmoglein 3 (pemphigus vulgaris antigen 
RAB6 interacting, klnes'tn-Uke (rabkbes 
matrix metafioproteinase 12 (macrophage 
preferentially expressed antigen in mela 
desmocollin 3 
desmccofiin 3 

serine (or cysteine) proteinase inhlbto 
LUNX protein; PLUNC (palate lung and nas 
cardnoembryonic antigen-related cell ad 
uroplaWn 1B 

hypothetical protein MGC4816 
small proDne-ridi protein 3 
ESTs 

ESTs, Highly similar to NKGD.HUMAN NKG2- 

Small proDne-rich protein SPRK [human, 

protease inhibitor 3, skin-derived (SKAL 

interleukIn-1 homolog 1 

small proOne-rich protein 1 B (comifin) 

cadherin 3, type 1, P-cadherin (placenta 

ESTs 

collagen, type Vll. alpha 1 (epWermolys 
small proCne-rich protein 2A 
solute carrier family 6 (ni 
forkhead box M1 
forkhead box M1 
forkhead box M1 

S100 calcium-binding protein A7 (psorias 
S100 calcium-binding protein A7 (psorias 
RakophJln 

solute carrier family 2 (facilitated gtu 
differentially expressed in Fanconf s an 
B-cel CLL/iymphoma 1 1A (zinc finger pro 
solute carrier family 7 (cafioroc amino 
gap juncfion protein, beta 5 (connexin 3 
midkine (neurite growth-promoting factor 
heparin-bioding growth factor binding pr 
ataxia-telangiectasia group D-associated 
RAD51 (S. cerevisiae) homolog (E coli Re 
protein kinase, DNA-acUvated, catalytic 
alcohol dehydrogenase 7 (class IV), mu o 
gap junction protein, beta 6 (connexin 3 
XAGE-1 protein 

progestagen- associated endometrial prote 
straSfin 

aldehyde dehydrogenase 3 family, member 

enhancer of zeste (Drosophila) homolog 2 

CDC28 protein kinase 2 

ESTs, Highly similar to S02392 alpha-2-m 

guanine monphosphate synthetase 

melanoma cell adhesion molecule 

hypothetical protein FU11035 

ESTs, Highly similar to unnamed protein 

matrix metafioproteinase 9 (gelatinase B 
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SeqlDNa 1274 128 

SeqlDNa 129&130 

SeqlDNa 1314 132 

SeqlDNa 133 4 134 
5 SeqlDNa 135&136 

SeqlDNa 137 4 138 

SeqlO No: 139& 140 

SeqIDNo: 141 & 142 
f . SeqIDNo: 143& 144 
10 SeqlDNa 145 4146 

SeqlDNa 147 & 148 

SeqlDNa 149 & 150 

SeqlDNa 151 & 152 

SeqIDNo: 153 & 154 
IS SeqlDNa155&156 

SeqlDNa157&158 

SeqlDNo:159&160 

SeqID No: 161 & 162 

SeqlDNa 163 4 164 
20 SeqID No: 165 & 166 

SeqID No: 167 &16B 

SeqlO No: 169 & 170 

SeqIDNo: 171 & 172 

SeqID No: 173 & 174 
25 SeqlDNa 175 4 176 

SeqID No: 177 & 178 

SeqID No: 179 & 180 

SeqIDNo: 181 & 182 
„ SeqIDNo: 183 & 184 
30 SeqID No: 185 & 186 

SeqIDNo: 187 & 188 

SeqIDNo: 189 & 190 

SeqID No: 191 & 192 
„_ SeqIDNo: 193 & 194 
35 SeqID No: 195 & 196 

SeqID No: 197 & 198 

SeqIDNo: 199 & 200 

SeqIDNo: 201 & 202 

SeqID No: 203 & 204 
40 SeqID No: 205 & 206 

SeqID No: 207 & 208 

SeqID No: 209 & 210 

Seq ID No: 211 4212 

SeqID No: 213 & 214 
45 SeqIDNo: 215 & 216 

SeqID No: 217 & 216 

SeqID No: 219 & 220 

SeqID No: 22U 222 

SeqlO No: 223 & 224 
50 Seq ID No: 225 & 228 

Seq ID No: 227 & 228 

SeqID No: 229 & 230 

Seq ID Na 231 & 232 
„ SeqlDNa 233 
55 Seq ID Na 234 & 235 

Seq ID Na 236 & 237 

SeqlDNa 238 

Seq ID No: 239 & 240 

SeqIDNa24U242 
60 Seq ID Na 243 & 244 

SeqID Na245 

Seq ID Na 246 & 247 

Seq ID Na 248 & 249 

Seq ID Na 250 & 251 
65 Seq ID Na 252 & 253 

Seq ID Na 254 & 255 

SeqID No: 256 & 257 

Seq ID Na 258 & 259 

Seq ID Na 260 & 261 
70 SeqID No: 262 & 263 

Seq ID Na 264 & 265 

Seq ID Na 266 & 267 

Seq ID Na 268 & 269 

Seq ID Na 270 & 271 
75 Seq ID Na 272 & 273 

SeqID No: 274 & 275 

SeqID No: 276 4 277 

SeqlDNa 2764 279 

SeqlDNa 280 4 281 
oO SeqlDNa 282 

SeqlDNa 283 4 284 

SeqlDNa 285 4 286 

Seq ID Na 2874288 
0 _ SeqlDNa2B94 290 
85 Seq ID No: 291 4 292 414774 



414430 


AI346201 


11, 7C4 4 Q 

HS.76118 


418462 


B E00 1536 


Lin ooce 
Hs.85Zoo 


100668 


L05424 


Hs. 169810 


458933 


a f r"")n J AA 

Al 63 8429 


Hs.24753 


418478 


U38945 


Hs.1174 


418478 


U38945 


Hs.1174 


418478 


U38945 


Hs.1174 


418478 


U38945 


Hs.1174 


445269 


AW263155 


Hs.14559 


422765 


AW409701 


Hs.1678 


436481 


AA379597 


Hs.5199 


440325 


NM_003B12 


Hs.7164 


439606 


W/9lZ3 


Hs. 58551 


453884 


a a scenic 
AA355925 




453884 


AA355925 


Hs.35232 


453884 


AA355925 


HSv35232 


453584 


A A IE CftOE 

AA355925 


H 5.36232 


404877 






413129 


A F**WA 4 A A 

AF292100 


Hs. 104613 


413281 


AAB51271 


Hs^22024 


444781 


Lit J A4 A J AA 

NMjJ 14400 


Hs.1 1950 


416819 


U77735 


Hs.80205 


451320 


AW1 18072 




418543 


Alt i nftrMA 

NM_005329 


Hs.85952 


454034 


NM_000691 


Hs.575 


425397 


J04088 


Hs. 156346 


415817 


U8B967 


Hs.78867 


415817 


U88967 


Hs.78867 


415817 


U88967 


Hs.78867 


415817 


U88S67 


Hs.78867 


415817 


U88967 


Hs.78867 


419121 


AA374372 


Hs.89626 


448993 


A1471630 


Hs.8127 


421817 


AF 146074 


Hs. 108660 


430393 


BE185030 


H&J241305 


425057 


AA826434 


Hs.1619 


420462 


AFO50147 


Hs.97932 


102963 


X02404 


Hs.274534 


100576 


X00356 


t 1 A*MPA 

Hs.37058 


101175 


U82671 


1 1 Af* AAA 

Hs.36980 


429038 


AL023513 


1 1— 4 A J^ff 

Hs.1 94766 


418678 


tit | AAA A AT 

NMJJ01327 


11.4 f»TATA 

Hs.1 67379 


418678 


NMJJ01327 


Hs. 167379 


131927 


A JAAA4 4 A 


Hs.34780 


428182 


BE386042 


• 1 AAA A 4 T 

Hs.293317 


427335 


AAA IAf J A 

AA448542 


Hs.251677 


409420 


. Z15008 


Hs.54451 


114346 


All 37256 


1 I- A AAJAA 

Hs. 130489 


438956 


W00847 


Hs.135056 


404440 






415669 


NM.005025 


Hs.78589 


103312 


Y12642 


Hs.3185 


320843 


8E069288 


Hs.34744 


429065 


AI753247 


Hs.29643 


446102 


AW1 68067 


Hs.317694 


330495 


U47924 


Hs.71642 


413573 


Al 733859 


l 1.* 4 AAA OA 

Hs.1 49089 


428479 


Y00272 


Hs.334562 


428479 


Y00272 


Hs.334562 


332180 


AF134160 


Hs.7327 


437915 


Al 637993 


M. AAAAA A 

Hs.202312 


441553 


AA281219 


11. i f%4 AnA 

Hs. 121295 


331692 


A 1 AA A A AT 

AI683487 


Hs.1 5221 3 


429413 


NM.014058 


1 l_ AAA API 

Hs.201877 


422283 


AW411307 


Hs.1 14311 


448357 


N20169 


1 1_ 4 AAft<M 

Hs. 108923 


446292 


AF081497 


1 1- ATA/* n A 

Hs.279682 


416209 


A A AAATTA 

AA236776 


1 1. TAATA 

Hs.79078 


453922 
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Lin 4 4 404 0 

nS.11421B 


tnzzieo (urosopnuaj nomoiog o 


428450 


Kill MAIM 

NM_UW91 


HS.1B4339 


VI A AIM "7C -arm nrnJi ml 

MAAUi Jo gene product 


446619 


AI UY7CCA1 

AUU70043 


Uk 341 


secreted phesphoprotsin 1 (osteoponQn, 


45339/ 


U23752 




SRY (sex dsterrninlng region Y)-box 1 1 


426514 


BE616533 


1 1- 4 7fMft£ 

HS.i70i95 


bone morp hog srie tic protein 7 (osteogenic 


425776 


U25128 


HS. 15949b 


parathyroid honnone receptor 2 


425776 


U25128 


tl_, j pniAA 

Hs. 159499 


parathyroid honnone receptor 2 


431515 


Lift j ni At pn 

NM_012152 


Hs,258583 


endotneUal dtfrerenuauon, iysopnospna 


419452 


U33635 


Hs.90572 


PTK7 protein tyrosine Idnase 7 


432653 


N62096 


Hs.293185 


ESTs, Weakly similar to JC7328 amino act 


432653 


& i fits to f* 

N62096 


Hs.293185 


ESTs, Weakly similar to JC7328 amino act 


432653 


N62096 


Hs.293185 


ESTs, weakly similar to J 073^8 ammo act 


432653 


NoZU9o 


HS./9J 1 00 


co i s, weawy similar 10 jw 040 arroiio aa 


410001 


AB041036 


Hs.57771 


kaliikrein 11 


426501 


AW043782 


Hs.293616 


ESTs 


408369 


R38438 


Hs.182575 


solute earner family 15 (H777 transport 


445413 


AA151342 


Hs.12677 


CGM47 protein 


422424 


AI186431 


Hs.295638 


prostate differentiation factor 


428330 


L22524 


Hs.2256 


matrix melalloproteinase 7 (matrih/sin, 


420610 


AI683183 


Hs.99348 


distal-less homeo box 5 



Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 
309931 
330493 
439285 

450375 



GAT Number 
AW341683 
33264_5 
47065J 

83327J 



451320 86576J 



Accession 

M27826 R78416 AA307645 AW957B79 AW957800 AA633529 H03562 

AL133916 N79113 AF086101 N76721 AW950828 AA364013AW955684AI346341 A1B67454 N54784 AI655270 AW21279 AW014B82 
M775552 N62351 N59253 AA626243 AI341407 BE175639 AA456968 AI35B918 AA457077 

AA0^WM131254AA374293AW9^05H(W10AW6M2M H04364 W46291 AW663674 H04Q21 K01532 

AA190993 H03231 H59605 H01642 AA852876 AA1 13758 AA626915 AA746952 AI161 014 AA099554 R69067 
AW118072 AI631982 T15734 AA224195 AI701458 W20198 F26326 AA890570 N90552 AW071907 AI671352 A1375892T03517 R88265 
AI124088 AA224388 AI084316 AI3546B6 T33652 AI140719 AI72021 1 T03490 AI372637 T1541 5 AW205B36 AA630384 TQ3515 T33230 
AA017131 AA443303 T33623 AI222556 T33511 T33785AI419606 D55612 



TABLE 15C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gt) numbers. "Dunham 1. et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22.' Dunham I. et al, Nature (1999) 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposItion: Indicates nucleotide positions of predicted exons. 



Pkey 

402075 

403329 

403478 

404440 

404877 

405770 

405932 



Ref 

8117407 
8516120 
9958258 
7528051 
1519284 
2735037 
7767812 



Strand 

Plus 

Plus 

Plus 

Plus 

Plus 

Plus 



NLposItion 

121907-122035,122804-122921.124019.124161,124455-124610,125672-126076 

96450-96598 

116458-116564 

80430-81581 

1095-2107 

61057-62075 

123525-123713 
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Table 16 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession fh NMJJ01216 

Coding sequence: 43.. 1422 



PCT7US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 
TCCCCCTTGG 
AGT6AAGAGG 
GAG6ATCTAC 
TCCCTGAAGT 
AATAATGCCC 
CCGCCCTGGC 
CGCCCCCAGC 
CTCCCGCCGC 
CCTCCTGGGC 
CTGCACTGGG 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 



11 
I 

ACCGTGTGCT 
TCCCTCTGTT 
TGCTTCTGAT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 
CCCGGGTGTC 
TCGCCGCCTT 
TCCCAGAACT 
TAGAGATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTGG 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 



21 
I 

GGGACACCCC 
GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
ACCTACTGTT 
AGAAGGGGAT 
CCCAGCCTGC 
CTGCCCGGCC 
GCGCCTGCGC 
TCTGGGTCCC 
TCGTCCGGGC 
TCACCTCAGC 
CGTGTTGGCC 
GTCTCGCTTG 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCTGAGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAGGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTCCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TCGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 
CACACCCTCT 
GCGACGCAGC 
AGTCCTCGGG 
GCCCTGGTTT 
AGGCAGCACA 
ACTGGAGCCT 
GGAGCCGGTA 
TTTTAAAATA 



41 
I 

GCATGGCTCC 
GCCTCACTGT 
TGCCCCGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
rCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



Seq ID NO: 2 Protein sequence: 
Protein Accession tfi NP_001207 



MAPLCPSPWL 



PUiIPAPAPG 
SPREEDPPGE 



DPQEPQNNAH 
BLLGPQLPPL 
VEGHRFPAEI 
EEGSETQVPG 
DTLWGPGDSR 
GLLFAVTSVA 



HWHLSTAFA 
LDISALLPSD 
LQLNFRATQP 
FLVQMRRQHR 



21 

I 

LTVQLL.LSLL. 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
HVDEALGRPG 
PSRYFQYEGS 
LNGRVIEASF 
RGTKGGVSYR 



31 
t 

LLMPVHPQRL 
GEEDLPEVKP 
RVSPACAGRF 
EMALGPGREY 
GLAVXaAAFLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVAETGA 



41 

I 

PRMQEDSPLG 
KSEEEGSLKL 
QSPVDIRPQL 
RALQLHLHWG 
EGPEENSAYE 
IWTVPNQTVM 
AEPVQLNSCL 



51 
I 

CCTGTGCCCC 
GCAACTGCTG 
GGAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



51 

I 

GGSSGEDDPL 
EDLPTVEAPG 
AAFCPALRPL 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: BC013923 

Coding sequence: 438-1391 



AGCGGGGTTG 
GTGTTTGCAA 
AGAGGAGAGA 
TAATAATAAC 
TGATCCTGAT 
TCCTCGCGGA 
CCCGCGGGCC 
CCGCGCACAG 
AGCAAACTTC 
AAAACAGCCC 
AGCGGCGCAA 
TGGGCGCCGA 
AGCGGCTGCG 
AAACCAAGAC 
GCGGCAATAG 
AGCGCATGGA 
AGGACCAGCT 
AGCCCATGCA 
CCTACATGAA 
TGGCTCTTGG 
TTACCTCTTC 
GCATGTATCT 
CCCAGCACTA 
CACACATGTG 
GGAAATGGGA 
TCAAAAAAAA 
AACACCAATC 
GAGAGATCCT 
GGGGAGGGCG 
TTTAAAAGTT 
TAATATTTAG 
TTGTACAGTA 



11 

I ' 

TCTATTAACT 
AAGGGGGAAA 
GAAAGAAAGG 
AATCATCGGC 
TCCAGTTTGC 
GCCCTGCGCT 
CCCCAAAGTC 
CGCCCGCATG 
GGGGGGCGGC 
GGACCGCGTC 
GATGGCCCAG 
GTGGAAACTT 
AGCGCTGCAC 
GCTCATGAAG 
CATGGCGAGC 
CAGTTACGCG 
GGGCTACCCG 
CCGCTACGAC 
OGGCTCGCCC 
CTCCATGGGT 
CTCCCACTCC 
CCCCGGCGCC 
CCAGAGCGGC 
AGGGCCGGAC 
GGGGTGCAAA 
AAAAAAAAAA 
CCATCCACAC 
GGACTTCTTT 
GGGGAATGGA 
CTAGTGGTAC 
AGCTAGTCTC 
TTTATOGAGA 



21 
I 

TGTTCAAAAA 
GTAGTTTGCT 
GAGAGAAGTT 
GGCGGCAGGA 
CTCTCTCTTT 
CCCGACACCC 
CCGGCCGGGC 
TACAACATGA 
GGCGGCAACT 
AAGCGGCCCA 
GAGAACCCCA 
TTGTCGGAGA 
ATGAAGGAGC 
AAGGATAAGT 
GGGGTCGGGG 
CACATGAACG 
CAGCACCCGG 
GTGAGCGCCC 
ACCTACAGCA 
TCGGTGGTCA 
AGGGCGCCCT 
GAGGTGCCGG 
CCGGTGCCCG 
AGCGAACTGG 
AGAGGAGAGT 
AAAATCCCAT 
TCACGCAAAA 
TKGGGGGACT 
CCTTGTATAG 
GGTAGGAGCT 
CAAGCGACGA 
TAAACATGGC 



31 

I 

GTATCAGGAG 
GCCTCTTTAA 
TGAGCCCCAG 
TCGGCCAGAG 
TTTTCCCCCA 
CCGCCCGCCT 
CGAGGGTCGG 
TGGAGACGGA 
CCACCGCGGC 
TGAATGCCTT 
AGATGCACAA 
CGGAGAAGCG 
ACCCGGATTA 
ACACGCTGCC 
TGGGCGCOGG 
GCTGGAGCAA 
GCCTCAATGC 
TGCAGTACAA 
TGTCCTACTC 
AGTCCGAGGC 
GCCAGGCCGG 
AACCCGCCGC 
GCACGGCCAT 
AGGGGGGAGA 
AAGAAACAGC 
CACCCACAGC 
ACCGOGATGC 
ATTTTTGTAC 
ATCTGGAGGA 
TTGCAGGAAG 
AAAAAATGTT 
AATCAAAATG 



41 
I 

TTGTCAAGGC 
GACTAGGACT 
GCTTAAGCCT 
GAGGAGGGAA 
AATTATTCTT 
CCCCTCCTCC 
CGGCCGCCGG 
GCTGAAGCCG 
GGCGGCCGGC 
CATGGTGTGG 
CTCGGAGATC 
GCCGTTCATC 
TAAATACCGG 
CGGCGGGCTG 
CCTGGGCGCG 
CGGCAGCTAC 
GCACGGCGCA 
CTCCATGACC 
GCAGCAGGGC 
CAGCTCCAGC 
GGACCTCCGG 
COCCAGCAGA 
TAACGGCACA 
AATTTTCAAA 
ATGGAGAAAA 
AAATGACAGC 
CGACAAGAAA 
AGAGAAAACC 
AAGAAAGCTA 
TTTGCAAAAG 
TTAATATTTG 
TCCATTGTTT 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



QLLSRLtEEIA 
LSAKQLHTLS 
AAGDILALVF 



51 
I 

AGAGAAGAGA 
GAGAGAAAGA 
TTCCAAAAAA 
GCGCTTTTTT 
CGCCTGATTT 
TCTCCCCCCG 
CGGGCCGGGC 
CCGGGCCCGC 
GGCAACCAGA 
TCCCGCGGGC 
AGCAAGCGCC 
GACGAGGCTA 
CCCCGGCGGA 
CTGGCCCCCG 
GGCGTGAACC 
AGCATGATGC 
GCGCAGATGC 
AGCTCGCAGA 
ACCCCTGGCA 
CCCCCTGTGG 
GACATGATCA 
CTTCACATGT 
CTGCCCCrCT 
GAAAAACGAG 
CCCGGTACGC 
TGCAAAAGAG 
ACTTTTATGA 
TGGGGAGGGT 
CGAAAAACTT 
TCTTTACCAA 
CAAGCAACTT 
ATAAGCTGAG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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AATTTGCCAA TATTTTTCAA GGAGAGGCTT CTTQCTGAAT TTTGATTCTG CAGCTGAAAT 1980 

TTAGGACAGT TGCAAACGTO AAAAGAAGAA AATTATTCAA ATTTGGACAT TTTAATTGTT 2040 

TAAAAATTGT ACAAAAGGAA AAAATTAGAA TAAGTACTGG CGAACCATCT CTGTGGTCTT 2100 

GTTTAAAAAG GGCAAAAGTT TTAGACTGTA CTAAATTTTA TAACTTACTG TTAAAAGCAA 2160 

AAATGGCCAT GCAGGTTGAC ACCGTTGGTA ATTTATAATA GCTTTTGTTC GATCCCAACT 2220 

TTCCATTTTG TTCAGATAAA AAAAACCATG AAATTACTGT GTTTGAAATA TTTTCTTATG 2280 

GTTTGTAATA TTTCTGTAAA TTTATTGTGA TATTTTAAGG TTTTCCCCCC TTTATTTTCC 2340 

GTAGTTGTAT TTTAAAAGAT TCGGCTCTGT ATTATTTGAA TCAGTCTGCC GAGAATCCAT 2400 

GTATATATTT GAACTAATAT CATCCTTATA ACAGGTACAT TTTCAACTTA AG 1' IT IT ACT 2460 

CCATTATGCA CAGTTTGAGA TAAATAAATT TTTGAAATAT GGACACTGAA AAAAAAAAAA 2520 

AAAAAAACAA AACAAAAAAA CAAAAAACAA AAACAGAAAA AACAAAAAAA AAAACAAAAC 2580 

CACAACACAA AAACAAAAAA AAAAAAAAGA AACAAACACA CAACACAACA CAACACAAAA 2640 
CCACAACACA AACAACAACA CACAGAGGG 

Seq ID NO: 4 Protein sequence; 
Protein Accession ft:CAA83435.l 

1 11 21 31 41 51 

I I III! 

MYNMMETELK PPGPQQTSGG GGGNSTAAAA GGNQKNSPDR VKRPMNAPMV WSRGQRRKMA 60 

QENPKMHNSE ISKRLGAEWK LLSETEKRPF IDEAKRLRAL HMKEHPDYKY RPRRKTKTLM 120 

KKDKYTLPGG HAPGGNSMA SGVGVGAGLG AGVNQRMDSY AEMNGWSNGS YSMMQDQLGY 180 

PQHPGLNAHQ AAQMQPMHRY DVSALQYNSM TSSQTYMNGS PTYSMSYSQQ GTPGMALGSM 240 

GSWKSEASS SPPWTSSSH SRAPCQAGDL RDMISMYLPG ABVPEPAAPS RLHMSQHYQS 300 
GPVPGTAING TLPIiSHM 

Seq ID NO: 5 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequence: 29-541 

1 11 21 31 41 51 

I I I ( I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA . 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG 540 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: AAB50564 

1 11 • 21 31 41 51 

1 I I I I I 

MMAGMKIQLV CMLLLAFSSVJ SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 60 

VCSLVNNLNS PAEETGBVHB EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 120 
LlQEDIXiDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession ft: NM_006536.2 

Coding sequence: 109-2940 

1 11 21 31 41 51 

I I . ! I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAOT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TOCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 



190 
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CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCAT6ATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTQ GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTQGQCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTAOGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAAC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA AOGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 25B0 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

AOGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAQTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP_006527.1 

1 11 21 31 41 51 

t I I I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIPVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADPHHSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNO* PTVLSSGSTI HSIAJjGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALX VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRYSLKVH VNHSPSISTP AHSIPGSHAM YVPGYTANGN 720 

IQMNAPRKSV GRKEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDQGOATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSIiQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKEKGT KLL 



Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #t Eos sequence 

Coding sequencei 336-632 

1 11 21 31 41 51 

I I t II I 

CTCCCCTCAC CCCGGTCCAG GATGCCCAGT CCCCACGACA CCTCCCACTT CCCACTGTGG 60 

CCTGGGTGGG CTCAGGGGCT GCCCTTGACC TGGCCTAGAG CCCTCCCCCA GCTGGTGGTG 120 

GAGCTGGCAC TCTCTGGGAG GGAGGGGGCT GGGAGGGAAT GAGTGGGAAT GGCAAGAGGC 180 

CAGGGTTTGG TGGGATCAGG TTGAGGCAGG TTTGGTTTCC TTAAAATGCC AAGTTGGGGG 240 

CCAGTGGGGC CCACATATAA ATCCTCACCC TGGGAGCCTG GCTGCCTTGC TCTCCTTCCT 300 

GGGTCTGTCT CTGCCACCTG GTCTGCCACA GATCCATGAT GTGCAGTTCT CTGGAGCAGG 360 

CGCTGGCTGT GCTGGTCACT ACCTTCCACA AGTACTCCTG CCAAGAGGGC GACAAGTTCA 420 

AGCTGAGTAA GGGGGAAATG AAGGAACTTC TGCACAAGGA GCTGCCCAGC TTTGTGGGGG 480 

AGAAAGTGGA TGAGGAGGGG CTGAAGAAGC TGATGGGCAG CCTGGATGAG AACAGTGACC 540 

AGCAGGTGGA CTTCCAGGAG TATGCTGTTT TCCTGGCACT CATCACTGTC ATGTGCAATG 600 

ACTTCTTCCA GGGCTGCCCA GACCGACCCT GAAGCAGAAC TCTTGACTTC CTGCCATGGA 660 

TCTCTTGGGC CCAGGACTGT TGATGCCTTT GAGTTTTGTA TTCAATAAAC TTTTTTTGTC 720 

TGTTGATAAT ATTTTAATTG CTCAGTGATG TTCCATAACC CGGCTGGCTC AGCTGGAGTG 780 

CTGGGAGATG AGGGCCTCCT GGATCCTGCT CCCTTCTGGG CTCTGACTCT CCTGGAAATC 840 

TCTCCAAGGC CAGAGCTATG CTTTAGGTCT CAATTTTGGA ATTTCAAACA CCAGCAAAAA 900 

ATTGGAAATC GAGATAGGTT GCTGACTTTT ATTTTGTCAA ATAAAGATAT TAAAAAAGGC 960 
AAATACCA 



Seq ID NO: 10 Protein sequence: 



191 



WO 02/086443 PCTYUS02/12476 

Protein Accession #: NP_005969.1 

1 11 21 31 41 51 

c I I I I I I 

J MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 
QSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 

Seg ID NOt 11 DNA sequence 
10 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 336-626 

1 11 21 31 41 51 

I I I I I I 

15 CTCCCCTCAC CCCGGTCCAG GATGCCCAGT CCCCACGACA CCTCCCACTT CCCACTGTGG 60 

CCTGGGTGGG CTCAGGGGCT GCCCTTGACC TGGCCTAGAG CCCTCCCCCA GCTGGTGGTG 120 

GAGCTGGCAC TCTCTGGGAG GGAGGGGGCT GGGAGGGAAT GAGTGGGAAT GGCAAGAGGC 180 

CAGGGTTTGG TGGGATCAGG TTGAGGCAGG TTTGGTTTCC TTAAAATGCC AAGTTGGGGG 240 

CCAGTGGGGC CCACATATAA ATCCTCACCC TGGGAGCCTG GCTGCCTTGC TCTCCTTCCT 300 

20 GGGTCTGTCT CTGCCACCTG GTCTGCCACA GATCCATGAT GTGCAGTTCT CTGGAGCAGG 360 

CGCTGGCTGT GCTGGTCACT ACCTTCCACA AGTACTCCTG CCAAGAGGGC GACAAGTTCA 420 

AGCTGAGTAA GGGGGAAATG AAGGAACTTC TGCACAAGGA GCTGCCCAGC TTTGTGGGGC 480 

ATTCCAGAGA ACCATGTGCT GTGAGGGCCT TCCGAGTCCA TCTGTTTAAT CCTGTCATTG 540 

GAGACTTGAG AAACCAGAGC CCAGAAGGGA AAAGTGATTG TCCCAAGATC ACACAGCACT 600 

25 GGAGAAAGTG GATGAGGAGG GGCTGAAGAA GCTGATGGGC AGCCTGGATG AGAACAGTGA 660 

CCAGCAGGTG GACTTCCAGG AGTATGCTGT TTTCCTGGCA CTCATCACTG TCATGTGCAA 720 

. TGACTTCTTC CAGGGCTGCC CAGACCGACC CTGAAGCAGA ACTCTTGACT TCCTGCCATG 780 

GATCTCTTGG GCCCAGGACT GTTGATGCCT TTGAGTTTTG TATTCAATAA ACTTTTTTTG 840 

TCTGTTGATA ATATTTTAAT TGCTCAGTGA TGTTCCATAA CCCGGCTGGC TCAGCTGGAG 900 

30 TGCTGGGAGA TGAGGGCCTC CTGGATCCTG CTCCCTTCTG GGCTCTGACT CTCCTGGAAA 960 

TCTCTCCAAG GCCAGAGCTA TGCTTTAGGT CTCAATTTTG GAATTTCAAA CACCAGCAAA 1020 

AAATTGGAAA TCGAGATAGG TTGCTGACTT TTATTTTGTC AAATAAAGAT ATTAAAAAAG 1080 

GCAAATACCA 

35 Seq ID NO: 12 Protein sequence: . 
Protein Accession fl> Eos sequence 

1 11 21 31 41 51 

AC\ ' I ' I I I 

4U MMCSSLEQAL AVLVTTFHKY SCQEGDKPKL SKGEMKELLH KELPSFVGHS REPCAVRAFR 60 
VHLFNPVIGD LRNQSPEGKS DCPKITQHWR RWMRRG 

Seq ID NO: 13 DNA sequence 
45 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 58-354 

1 11 21 31 41 51 

- n I I II I I 

J\J GTGAGCTCAC CATGTGGGGG TGAGGCTGAG AGAAAACAAG TACACAGCCA CAGATCCATG 60 

ATGTGCAGTT CTCTGGAGCA GGCGCTGGCT GTGCTGGTCA CTACCTTCCA CAAGTACTCC 120 

TGCCAAGAGG GCGACAAGTT CAAGCTGAGT AAGGGGGAAA TGAAGGAACT TCTGCACAAG 180 

GAGCTGCCCA GCTTTGTGGG GGAGAAAGTG GATGAGGAGG GGCTGAAGAA GCTGATGGGC 240 

AGCCTGGATG AGAACAGTGA CCAGCAGGTG GACTTCCAGG AGTATGCTGT TTTCCTGGCA 300 
55 CTCATCACTG TCATGTGCAA TGACTTCTTC CAGGGCTGCC CAGACCGACC CTGAAGCAGA 360 

ACTCTTGACT TCCTGCCATG GATCTCTTGG GCCCAGGACT GTTGATGCCT TTGAGTTTTG 420 

TATTCAATAA ACTTTTTTTG TCTGTTGATA ATATTTTAAT TGCTCAGTGA TGTTCCATAA 480 

CCCGGCTGGC TCAGCTGGAG TGCTGGGAGA TGAGGGCCTC CTGGATCCTG CTCC CTTCT G 540 

GGCTCTGACT CTCCTGGAAA TCTCTCCAAG GCCAGAGCTA TGCTTTAGGT CTC AATTT TG 600 
60 GAATTTCAAA CACCAGCAAA AAATTGGAAA TCGAGATAGG TTGCTGACTT TTATTTTGTC 660 

AAATAAAGAT ATTAAAAAAG GCAAATACCA 

Seq ID NO: 14 Protein sequence: 
^_ Protein Accession #: NP_005969.1 

65 1 11 21 31 41 51 

I I I 1 1 1 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDP FQGCPDRP 



70 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62-358 



75 1 11 21 31 41 51 

I I I I I I 

GGAGGGTGTG CCGCTGAGTC ACTGCCTGGG CATCTGGGCC TGGAACCTCG GCCACAGATC 60 

CATGATGTGC AGTTCTCTGG AGCAGGCGCT GGCTGTGCTG GTCACTACCT TCCACAAGTA 120 

OA CTCCTGCCAA GAGGGCGACA AGTTCAAGCT GAGTAAGGGG GAAATGAAGG AACTTCTGCA 180 

80 CAAGGAGCTG CCCAGCTTTG TGGGGGAGAA AGTGGATGAG GAGGGGCTGA AGAAGCTGAT 240 

GGGCAGCCTG GATGAGAACA GTGACCAGCA GGTGGACTTC CAGGAGTATG CT GTTTT CCT 300 

GGCACTCATC ACTGTCATGT GCAATGACTT CTTCCAGGGC TGCCCAGACC GACCCTGAAG 360 

CAGAACTCTT GACTTCCTGC CATGGATCTC TTGGGCCCAG GACTGTTGAT GCCTTTGAGT 420 

_ _ TTTGTATTCA ATAAACTTTT TTTGTCTGTT GATAATATTT TAATTGCTCA GTGATGTTCC 480 

85 ATAACCCGGC TGGCTCAGCT GGAGTGCTGG GAGATGAGGG CCTCCTGGAT CCTGCTCCCT 540 

TCTGGGCTCT GACTCTCCTG GAAATCTCTC CAAGGCCAGA GCTATGCTTT AGGTCTCAAT 600 

TTTGGAATTT CAAACACCAG CAAAAAATTG GAAATCGAGA TAGGTTGCTG ACTTTTATTT 660 



192 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TGTCAAATAA AGATATTAAA AAAGGCAAAT ACCA 

Geq ID NO: 16 Protein sequence: 
Protein Accession 8: NP_005969.1 



PCT/US02/12476 



li 



21 



31 



41 51 

I I 1 I I I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 



Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 939-2372 



60 



1 

I 

AAGACGGATT 
AGAATAGTTA 
CCCGAGGCTC 
C6CQGGC6CA 
AACCAA6CAC 
CTGTGGTGTG 
AATTTTCTGG 
GCAGAGGCGG 
CGCTCGCGCT 
GGCGGGCGTG 
AACAAGGAAT 
GGGCCATGCA 
GGTTCGCTAT 
ACCAGCTCAG 
6AGAGCCGCA 
GCGGGGACA6 
ATGGACCCGC 
CCGCTTTCGC 
CTTCTCCTGG 
TCACCGAAAT 
AAGCTTATGT 
ATAAAGCATT 
CGAGTTTGTC 
ATCCATTTAC 
GTCCAGACAC 
ACCTGCAGAT 
TGGAGGAAGG 
TGTATTGGGA 
GCTCCTTAAG 
CGGAAAATCT 
CTATCACATT 
AAGGCAACCC 
AATACATCTG 
TGGATAATCC 
GGAAGGATGA 
CAAACCCAAA 
GGGACACCAC 
GGGAACATCT 
TGGTAATGCT 
TTTTGTTTCA 
GGCTGTGGTG 
CTTATCCCGG 
GCTGTACTAT 
GTAACTCTCA 
TTACAGTAGT 
AAAGTGTGCT 
TTGACCTGCA 
GTCTAATCTA 
TTCAGAGGGT 
ACCATCACTT 
GTCCAAATGT 
TTAGCCAGCA 
AAGAAAAAAA 
GGCATAGTCA 
TTGCCCTTTT 
AGGCCACAGT 
CCTGGGAGCA 
CCCCCTACAA 
CCAGCAGCAA 
TGCTGAGAGG 
ACACCTGTTT 
CAGGCAGTAT 
TGTTCCTTTT 
AATCAGCTCT 
TTTTAAAAAT 
TCTATAGATT 
CAAGGCATTA 
TCCATAATGA 
ATGAAGACCT 



11 
I 

CTCAGACAAG 
CGGTTTGTCA 
TGCCCGCGCC 
GGGCACGCGT 
GGTTTCCATT 
AATTAGGGAC 
AGTTTCTGCC 
CGGCGGCGGC 
CTACGCGCTC 
AGGCGCCGGA 
CTGCGCCCCA 
GCGACGGCCG 
GCCGGGGCCA 
CCTCTGATAA 
AGCGCAGGGA 
GCACTCGGGC 
CATGGCGCGG 
CTGTCCCACG 
CATCGTGGCA 
TTTCATCGCA 
GGGACTGAGA 
TCTGAAAAAC 
TAGGAAACAT 
ATGCTCCTGT 
TCAGGATTTG 
ACCCAATTGT 
AAAGTCTATC 
TGTTGGTAAC 
GATAACTAAC 
TGTAGGAGAA 
TCTCGAATCT 
CAAACCAGCG 
TACTAAAATA 
CACTCACATG 
GAAACAGATT 
TTATCCTGAT 
GAACAGAAGT 
CTCGGTCTAT 
GTTTCTGCTT 
TAAGATCCCA 
CTTGTTGGTT 
GAAGTGCTGC 
ATGAAGCCTG 
GGCAGCTAAG 
TCAAATACAA 
TTTTGACCCT 
AAGTTAAAAA 
CATG TAACAC 
TTGACTTTTT 
TGGGACTTGG 
TTAGCTTAGG 
AAACAAAACA 
ACAAGAACAA 
ATTTCAGAAT 
TTATTTGCCC 
ATCTCATGCT 
GAATGGCTGG 
CATACTGTCA 
AGAGGTGGCA 
GCAGCCTTAG 
CATTCACTTT 
GCTTGTCCTG 
TTATCAGGAG 
GACAGTTAGA 
TTATTTTTTT 
TTTAACTAGT 
ATCTTAATAA 
ATATTTTATA 
TTCACAGAAT 



21 
I 

GCTTGCAAAT 
CCCGACCCTC 
TGGCTTCTTC 
TCGCGCACAC 
TCAAAAAGGG 
CGGGAGGOGT 
CCTGCTCTGC 
TCCCGGAATT 
AGTCCCCGGC 
GCCCGGCCTC 
GAGAGTCCCG 
CCGCGGAGCT 
CTGTGAACCC 
GCTGGACTCG 
AGGCCTCCCC 
TGGCACTGGC 
CTCTGGGGCT 
TCCTGCAAAT 
TTTCCGAGAT 
AACCAGAAAA 
AATCTGACAA 
AGCAACCTGC 
TTCCGTCACC 
GACATTATGT 
TACTGCCTGA 
GGTTTGCCAT 
ACATTATCCT 
CTGGTTTCCA 
ATTTCATCGG 
GATCAAGATT 
CCAACCTCAG 
CTTCAGTGGT 
CATGTTACCA 
AACAATGGGG 
TCTGCTCACT 
GTAATTTATG 
AATGAAATCC 
GCTGTGGTGG 
AAGTTGGCAA 
CTGGATGGGT 
GATGCTGCCA 
TTATCTGGGG 
CATATACTGT 
CAGCACCTCA 
AACTGAAATG 
ACTGGACATT 
AAAATTAAAG 
ATATTTTAGT 
CATCTATAAC 
TAGTATTATT 
TCTGAGAGTC 
AAACAAAACA 
GCAGCAACAG 
AACTAAGAGT 
TCTGOGATCC 
GTTTGCATTA 
CCTGCTGTGA 
TACTGCTGGG 
GGTCGCTAAT 
AGCTGTGGAT 
AGCATCACAG 
AAGAGAGGTT 
GACTTCAGAG 
CATGCACACA 
TGGAAATAGT 
CCAACACAGT 
ACCAGGATCC 
CTGCATCCTT 
CCTATGGATT 



31 
I 

GCCCCGCAGC 
CCGGATCGCC 
GTAGCTGGAT 
CCTAGCACAC 
AGACAGCCTC 
CGAACGGAGG 
GTCAGCCCTC 
GGGTTGGAGC 
GGTAGCAGGA 
GAGGTGCATA 
GGAGCGCCGC 
CCGAGCAGCG 
TGCCGCCTGC 
GCACGCCCGC 
GCACGGGTGG 
TGCTAGGGAT 
TCTGCTGGCT 
GCAGTGCCTC 
TGGAGCCTAA 
GGTTAGAAAT 
TTGTGGATTC 
AGCACATCAA 
TTGACTTGTC 
GGATCAAGAC 
ATGAAAGCAG 
CTGCAAATCT 
GTAGTGTGGC 
AACATATGAA 
ATGACAGTGG 
CTGTCAACCT 
ACCACCACTG 
TCTATAACGG 
ATCACAOGGA 
ACTACACTCT 
TCATGGGCTG 
AAGATTATGG 
CTTCCACAGA 
TGATTGDGTC 
GACACTCCAA 
AGCTGAAATA 
TGTAAGCTGG 
TTTTCTGGTA 
GAGCTGTGAT 
AGAAAACATG 
AAATCCCATT 
TATTGACTTA 
TTGAGAACAG 
GTGATTTTCT 
ACAGTGACTA 
AAAAGGTTAT 
AAACAATGTT 
AACAAATGAA 
CTGTTTTGTT 
GGAATATATG 
ACCTGCTTTT 
CAGAACTGCA 
GCAGGAGAGG 
TTTTCATGGG 
GAATATATGC 
TTCTGCATCC 
TGACCTTTGT 
TGGCTATCCC 
CCAGGCCTGC 
GACGCCATAG 
TGCACAAATG 
CAGAAACATT 
ATTTAGGTAC 
TACATTAGCC 
GCAGCATTTC 



41 

I 

CATCATTTAA 
TAATTTGTCC 
GCATATCGTG 
ATGAACACGC 
TACCGCGATT 
AACGGTTCAT 
ACGTCACTTC 
AGGAGCCTCG 
GCCTGGACCC 
CCGGACCCCC 
CGGTCGGTGC 
GTAGCGCCCC 
CGGAACACTC 
AACAAGCACC 
GGGAAAGCGG 
GTCGTCCTGG 
GGTTGTGGGC 
TCGGATCTGG 
CAGTGTAGAT 
CATCAACGAA 
TGGATTAAAA 
TTTTACCCGA 
TGAACTGATC 
TCTCCAAGAG 
CAAGAATATT 
GGCCGCACCT 
AGGTGATCCG 
TGAAACAAGC 
GAAGCAGATC 
CACTGTGCAT 
GTGCATTCCA 
GGCAATATTG 
GTACCACGGC 
AATAGCCAAG 
GCCTGGAATT 
AACTGCAGCG 
CGTCACTGAT 
TGTGGTGGGA 
GTTTGGCATG 
AAGGAAAAGA 
ACTCCTGGGA 
GATGTGGGCG 
TGGGGAACAC 
TTAAATTAAT 
GGATTGTACT 
ATTGCTTCTG 
GTATAAGTGC 
ATACTCTAAT 
AAAGAGTTAA 
TTCCTTCACT 
AAGGATTGTC 
AAACGTTTAA 
GGGGCTATAG 
CATATGGTGA 
TAGAAGTCTG 
GCTTTTCTAC 
AGATTCTAAG 
TAGGAAAGCT 
TTTATAATGT 
CCCCTGAGTC 
ATGCTCTGTT 
CACCCCACCC 
AGCATTTTGT 
CTGGATTGGA 
CTGCAATTTA 
GTTTTGAATC 
CACTTGATAT 
ACTAAATACG 
ACTTGGCTAC 



SI 
I 

CTGCACCOGC 
CTAGTGAGAC 
CTCCGGGCAG 
GCAAGAGCTG 
GTAGAAGAGA 
CTTAGAGACT 
GCCAGCAGTA 
CTGGCTGCTT 
AGGCGCCGCC 
ATTCGCATCT 
CCGGCGCGCC 
CCTGTAAAGC 
TTCGCTCCGG 
GAGGAGTTAA 
CCGGTGCAGC 
ATAAGGTGGC 
TTCTGGAGGG 
TGCAGCGACC 
CCTGAGAACA 
GATGATGTTG 
TTTGTGGCTC 
AACAAACTGA 
CTGGTGGGCA 
GCTAAATCCA 
CCCCTGGCAA 
AACCTCACTG 
GTTCCTAATA 
CACACACAGG 
TCTTGTGTGG 
TTTGCACCAA 
TTCACTGTGA 
AATGAGTCCA 
TGCCTCCAGC 
AATGAGTATG 
GACGATGGTG 
AATGACATCG 
AAAACCGGTC 
TTTTGCCTTT 
AAAGGTTTTG 
CAGAGAAAGG 
CTGCTGTTGG 
GTGTTTGGAG 
CAATGCAGAG 
GCTTCTCTTC 
TCTCTTCTGA 
TTTATTAAAA 
ACACTGAATA 
CAGCACTGAA 
GGGTATATAT 
GTCAATAAAA 
TTAAAGTTCC 
AAAGAAGAAG 
ATTTAAGTTA 
AATTATAACC 
CCGAGTGAGA 
TCTGAAAAGG 
AAGGATAGTC 
TGTCCTGACC 
CCTTCTTCAT 
TGACCCATGG 
CAGTCTGTGT 
CACCCCACCC 
TTGAAAACAC 
AACATTGATG 
GCTTTAAGGT 
CTCTGTAAAC 
AAAAAGGATA 
TTATTGCTTG 
TTCATACCCA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3B40 
3900 
3960 
4020 
4080 
4140 
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5 

10 
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WO 02/086443 

TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC 
TCCTAACTCC ATTTGAATGT AAGGGCAGCT 
TCTGAATTCC CATTTTCTTG TTCGCGGCTA 
GATCTTTCCC AAAGGTGTTG ATTTACAAAG 
GAAAGAGAGA TGAAATTCAA GCTGTGAGCC 
GAGAATCAGC CATTTGGTAC AAAAAAGATT 
ATAGAAAGGC TATGGATTGT TTAAGAACTA 
AATAAAAAAA AAGGAATATT TGTACCCAAC 
TTTAAAATGG AGAGAAGTGG ACAGATAAGG 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA 

Seq ID NO: IB Protein sequence t 
Protein Accession #: CAA53571 



PCT/US02/12476 



MSSWIRWHGP 
NSVDPENITE 
NFTRNKLTSL 
SKNIPLANLQ 
NETSHTOGSL 
WCIPPTVKGN 
LIAKNEYGKD 
DVTDKTGREH 



11 
I 

AMARLWGFCW 
IFIANQKRLE 
SRKHFRHLDL 
IPNCGLPSAN 
RITNISSDDS 
PKPALQWFYN 
EKQISAHPMG 
LSVYAVWIA 



21 
I 

LWGFWRAAF 
IINEDDVEAY 
SELILVGNPF 
LAAPNLTVEE 
GKQISCVAEN 
GAILNBSKYI 
WPGIDDGANP 
SWGFCLLVM 



AGAAACATGC 
GGCCCCCAAT 
AATGACAGTT 
AGGCCAGCTA 
AGGCAGGAGC 
TTTAAAGCTT 
TTTTAAAGTG 
AGCTAGAAGG 
CCATTTARTA 
T 



31 
I 

ACPTSCKCSA 
VGLRNLTIVD 
TCSCDIMWIK 
GKSITLSCSV 
LVGEDQDSVN 
CTKIHVTNHT 
NYPDVIYEDY 
LFIjLIQjARHS 



CGCCAGTTCT 
GTGGGGAGGT 
TCTGTCATTA 
ATAGCAGAAA 
TCAGTATGGC 
TTATGTTATA 
TTCCAGACCC 
ATTGCAAGGT 
TATCAAAGAT 



41 
I 

SRIWCSDPSP 
SGLKFVAHKA 
TLQEAKSSPD 
AGDPVPNMYW 
LTVHFAPTIT 
EYHGCLQLDN 
GTAANDIGDT 
KFGMKGFVLF 



CAAGTTTTCC 
CGGAACATTT 
CTTAGATTCC 
TCATGACCCT 
AAAGGTTCTT 
CCATGGAGCC 
AAAAAGGAAA 
AGATTTTTCT 
CAGTTGACAT 



51 

I 

GIVAFPRLEP 

FLKNSNLQHI , 

TQDLYCLNES 

DVGKLVSKHM 

FIiESPTSDHH 

PTHMNNGDYT 

TNRSNEIPST 

HKIPLDG 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession #: NM__000228 

Coding sequence: 82-3600 

1 11 21 31 

111! 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG 
GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT 
CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT 
CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT 
ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT * 
CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT 
TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC . 
TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA 
GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC . 
ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA I 
CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG ( 
ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA . 
ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG 
CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC « 
GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG 
CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG 
GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG < 
CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT 
GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC 
AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC 
GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG 
CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG 
AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG 
AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG 
CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC 
AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA 
CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG 
GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG 
TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG 
CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT 
CGCTACCOGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA 
GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG 
GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG 
ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG 
GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC 
GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA < 
ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA 
GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC 
GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA 
CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC 
ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC 
ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC 
TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG 
CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC , 
AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG 
GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC i 
CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG 
GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA 
GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT 
CAGGACATTG CGCGTGCCOG CCGGTTGCAG GCTGAGGCTG 
CATGCAGTGG AGGGCCAGGT GGAAGA7GTG GTTGGGAACC 
CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT 
AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA 
AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC 
GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG 
GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG 
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240 
300 
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420 
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CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC 
GAGCTGTTTG GGGAGACCAT GGAGATGATG 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC 
GTGGAGCAGA TCCGTGACCA CATCAATGGG 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC 
GATTGGGTTG GAATGCTTTC CATCTCCAGG 
GACCACCCCT GGTGTGTAGC TAGTAAGATT 
GGGACAGTTA CACTTGACAG ACAAAGATGG 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG 
GGAATCCTGG ACCAAGCACA AAAACTTAAC 
AAAATCTTTG G 

Seq ID NO i 20 Protein sequence: 
Protein Accession ft: NP 000219 



PCT/US02/12476 



i . 
I 

MRPFFUjCFA 
EWQMKCCKCD 
MEFQGPMPAG 
NARLNGGKVQ 
VSQLRLQQSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQLDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAELKD 
MLRSADLTGL 



11 

I 

LPGLLHAQQA 
SRQPHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HECQRCDCNG 
SIQBTCISCE 
CDCNILGSRR 
PTVQPVHEAV 
GRCLCRPGLT 
SGPGLEDRGL 
LEEETLSLPR 
QVSDSSRXiLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFW 
RLGQS5MLGE 
EKRVEQIRDH 



21 

I 

CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFDPA 
CDPDGAVPGA 
DMPCDEESGR 
PCRJEGFGGLM 
GPRCDQCQRG 
ASRIIiDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRGG 
TRMEELRHQA 
QGARIQSVKT 
IKGRVLYYAT 



CGGATCCAGA 
GACAGGATGA 
TCGGCGGACC 
CGCGTGCTCT 
ATCTGCCGCC 
AGACTTTCAT 
ACCCTGAGCT 
TGGAGATTGG 
TATCCCCCGC 
AAAAGTGATG 



31 
I 

GDLLVGRTRP 
PMRWWQSQND 
ADCTSTFPRV 
VGEITNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCLPNWGP 
CSAAA1ROCP 
YCNRYPVCVA 
IEQIRAVLSS 
GLiLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQOGAEAVQA 
EAEELFGETM 
CK 



GTGTGAAGAC 
AAGACATGGA 
TGACAGGACT 
ACTATGCCAC 
TTTGCTTTTG 
GCAGCCTAAA 
GCAGCTGAGC 
CATGCCATTG 
CTTTAGTTCT 
TAAAAATGAA 



41 
I 

LRASSTCGIiT 
VNPVSLQIiDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVOGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCPQTYDA 
PAVTEQBVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MEEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 
EMMDRMKDME 



Seq ID NO: 21 DNA sequence 
Nucleic Acid Accession #: NM_003722 
Coding sequence: 145-1491 



1 
I 

TCGTTGATAT 
ACAGTACTGC 
AAAGAAAGTT 
CCAGAGGTTT 
ATTGACTTGA 
AGCATGGACT 
ACGAACCTGG 
AGTCCCTATA 
CCCAGCTCCA 
CCAGGCCCGC 
TGGACGTATT 
CAGATCAAGG 
AAAAAAGCTG 
GAATTCAACG 
CATGCCCAGT 
CCACCCCAGG 
TGTGTTGGAG 
GGGCAAGTCC 
AGGAAGGCGG 
GATGGTACGA 
AAACGAAGAT 
GAAATGCTGT 
ATTGAAACGT 
CTTTCAGCCT 
GACGTCTTCT 
TCTATATTTT 
TGTGTGTGCG 
CCCAACTGCT 
TTACAAGAAA 
GAACCACTGT 
GAAAGGGGCA 
AATTCACAGG 
AAAAAAGTTG 
CCCTTTTAAT 
TACTGCTGGG 
TTTGTGAGAA 
GCTGTGTACC 
CATGAAACCC 
CTCATTTTGT 
TGTTTACCAT 
AATTTGCTTA 
CTGATACTGT 
AGACGTGTTA 



11 

I 

CAAAGACAGT 
CCTGACCCTT 
ATTACCGATC 
TCCAGCATAT 
ACTTTGTGGA 
GTATCCGCAT 
GGCTCCTGAA 
ACACAGACCA 
CCTTCGATGC 
ACAGTTTCGA 
CCACTGAACT 
TGATGACCCC 
AGCACGTCAC 
AGGGACAGAT 
ATGTAGAAGA 
TTGGCACTGA 
GGATGAACCG 
TGGGCCGACG 
ATGAAGATAG 
AGCGCCCGTT 
CCCCAGATGA 
TGAAGATCAA 
ACAGGCAACA 
GCTTCAGGAA 
TTAGACATTC 
AAGTGTGTGT 
TGTGTATCTA 
CAAAGGCACA 
GGATGTTTTC 
GTTTGTCTGT 
TTAAGATGTT 
GAAGCTTTTG 
TTATTGTCTG 
GCTGGTCATG 
CAGCGAGGTG 
CTTGCATTAT 
TGCCTCTGCC 
TGGAAGACCT 
GCTTTTAATA 
TATTCAAAGC 
ATTAGAGCTT 
TCAGTGCATT 
AAATCAGCAC 



21 
I 

TGAAGGAAAT 
ACATCCAGCG 
CACCATGTCC 
CTGGGATTTT 
TGAACCATCA 
GCAGGACTCG 
CAGCATGGAC 
CGCGCAGAAC 
TCTCTCTCCA 
CGTGTCCTTC 
GAAGAAACTC 
ACCTCCTCAG 
GGAGGTGGTG 
TGCCCCTCCT 
TCCCATCACA 
ATTCACGACA 
CCGTCCAATT 
CTGCTTTGAG 
CATCAGAAAG 
TCGTCAGAAC 
TGAACTGTTA 
AGAGTCCCTG 
GCAACAGCAG 
TGAGCTTGTG 
CAAGCCCCCA 
GTTGTATTTC 
GCCCTCATAA 
AAGCCACTAG 
TGCAGATTTT 
GAGCTTTCTG 
TATTGGAACC 
AGCAGGTCTC 
TGCATAAGTA 
TAATAATATT 
ATCATTACCA 
TTGTGTCCTC 
ACTGTATGTT 
ACTACAAAAA 
GAAAGACAAA 
TCAAAATAGA 
CTATCCCTCA 
TAGCCAGGAG 
TCCTGGACTG 



31 
I 

GAATTTTGAA 
TTTCGTAGAA 
CAGAGCACAC 
CTGGAACAGC 
GAAGATGGTG 
GACCTGAGTG 
CAGCAGATTC 
AGCGTCACGG 
TCACCCGCCA 
CAGCAGTCGA 
TACTGCCAAA 
GGAGCTGTTA 
AAGCGGTGCC 
AGTCATTTGA 
GGAAGACAGA 
GTCTTGTACA 
TTAATCATTG 
GCCCGGATCT 
CAGCAAGTTT 
ACACATGGTA 
TACTTACCAG 
GAACTCATGC 
CAGCACCAGC 
GAGCCCCGGA 
AACCGATCAG 
CATGTGTATA 
ACAGGACTTG 
TGAGAGAATC 
GTATCCTTAG 
TTGTTTCCTG 
CTTTTCTGTC 
AAACTTAAGA 
AGTTGTAGGT 
GCAAGTAGTA 
AAAGTAATCA 
CCCTCATGTG 
GGCATCTGTT 
AACTGTTGTT 
TCCACCCCAG 
ATTTGAAGCC 
AGCCTACCTA 
ACTTACGTTT 
GAAATTAAAG 



41 

1 

ACTTCACGGT 
ACCCAGCTCA 
AGACAAATGA 
CTATATGTTC 
CGACAAACAA 
ACCCCATGTG 
AGAAGGGCTC 
CGCCCTOGCC 
TCCCCTCCAA 
GCACCGCCAA 
TTGCAAAGAC 
TCCGCGCCAT 
CCAACCATGA 
TTCGAGTAGA 
GTGTGCTGGT 
ATTTCATGTG 
TTACTCTGGA 
GTGCTTGCCC 
CGGACAGTAC 
TCCAGATGAC 
TGAGGGGCCG 
AGTACCTTCC 
ACTTACTTCA 
GAGAAACTCC 
TGTACCCATA 
TGTGAGTGTG 
AAGACACTTT 
TTTTGAAGGG 
ACCGGCCATT 
GGAGGGAGGG 
TTCTTCTGTT 
TGTCTTTTTA 
GACTGAGAGA 
AGAAACGAAG 
ACTTTGTGGG 
TAGGTAGAAC 
ATGCTAAAGT 
TGGCCCCCAT 
TAATATTGCC 
CTCTCACAAA 
CCATAAAACC 
TGAGTAAGTG 
ATTGAAAGGG 



AGAGGCAGAG 
GTTGGAGCTG 
GGAGAAGCGT 
CTGCAAGTGA 
GTTGGGGGCA 
GTACAGCCTG 
CTGAGCCAAT 
AAACTAAGAG 
CCACTGGGGA 
AAGCCAAATA 



51 
I 

KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGL 
KLASGQGCEP 
CRACDCDFRG 
DLREQALRFG 
VASAILSIiRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
IiIQQVRDFLT 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 



51 
I 

GTGCCACCCT 
TTTCTCTTGG 
ATTCCTCAGT 
AGTTCAGCCC 
GATTGAGATT 
GCCACAGTAC 
CTOGTCCACC 
CTACGCACAG 
CACCGACTAC 
GTCGGCCACC 
ATGCCCCATC 
GCCTGTCTAC 
GCTGAGCCGT 
GGGGAACAGC 
ACCTTATGAG 
TAACAGCAGT 
AACCAGAGAT 
AGGAAGAGAC 
AAAGAACGGT 
ATCCATCAAG 
TGAGACTTAT 
TCAGCACACA 
GAAACATCTC 
AAAACAATCT 
GAGCCCTATC 
TGTGTGTGTA 
GGCTCAGAGA 
ACTCAAACCT 
GGTGGGTGAG 
GTCAGG TGGG 
GTTTTTCTAA 
AGAAAAGGAG 
CTCAGTCAGA 
GTGTCAAGTG 
TGGAGAGTTC 
ATTTCTTAAT 
TTTTCTTGTA 
AGCAGGTGAA 
CTTACGTAGT 
ATCTGTGATT 
AGCCATATTA 
AGATCCAAGC 
TAGACTACTT 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
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TTCTTTTTTT TACTCAAAAG TTTAQAGAAT CTCTGTTTCT TTCCATTTTA AAAACATATT 2640 
TTAAOATAAT AGCATAAAGA CTTTAAAAAT GTTCCTCCCC TCCATCTTCC CACACCCAGT 2700 
CACCAOCACT GTATTTTCTG TCACCAAGAC AATGATTTCT TGTTATTGAG GCTGTTGCTT 2760 
TTGTGGATGT GTGATTTTAA TTTTCAATAA ACTTTTGCAT CTTGGTTTAA AAGAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession #: NP_003713 

1 11 21 31 41 51 

I I i I I I 

MSQSTQTNEF LSPEVFQHIW DFLEQPICSV QPIDLNFVDE PSEDGATNKI EISMDCIRMQ 60 
DSDLSDPMWP QYTNLGLLNS MDQQIQNGSS STSPYNTDHA QNSVTAPSPY AQPSSTPDAL 120 
SPSPAIPSNT DYPGPHSPDV SFQQSSTAKS ATWTYSTELK KLYCQIAKTC PIQIKVMTPP 180 
PQGAVIRAMP VYKKAEHVTB WKRCPNHEL SREFNEGQIA PPSHLIRVEG NSHAQYVEDP 240 
ITGRQSVLVP YEPPQVGTEF TTVLYNFMCN SSCVGGMNRR PILIIVTLKT RDGQVLGRRC 300 
FEARICACPG RDRKADEDSI RKQQVSDSTK NGDGTKRPFR QNTHGIQMTS IKKRRSPDDE 360 
LLYLPVRGRE TYEMLLKIKE SLELMQYLPQ HTIETYRQQQ QQQHQHLLQK HLLSACFRNE 420 
LVEPRRETPK QSDVFFRHSK PPNRSVYP 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession ft: NM_001944.1 

Coding sequence: 84-3083 

1 11 21 31 41 51 

I I I 1 I I 

TTTTCTTAGA CATTAACTGC AGACX3GCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT . 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTOGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CGGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATOGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTG OTTTTA T CCCAGTTOCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC . TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGOGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGOGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCGATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAAOGGAGAC TTACTCGGCT TCTGGT TC CC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 24 Protein sequence t 
Protein Accession #: NP_001935.1 

1 11 21 31 41 51 
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I I I I 

MMGLFPRTTG ALAIFVWIIj VHGELRIETK GQYDEEEMTM 
GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF 
PSFLITCRAL NAQGLDVEKP LILTVKILDI HDNPPVPSQQ 
ATDADEPNHL NSKIAFKIVS QEPAGTPMPL LSRNTGEVRT 
DKDGEGLSTQ CECNIKVKDV NDNPPMFRDS QYSARIEENI 
WLAVYFFTSG NEGNWPEIQT DPRTNEGILK WKALDYEQL 
SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK 
SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK 
VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY 
ITTTiKATSAX) LRAQEQIPPG VYHISLVLTD SQNNRCEMPR 
TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT 
GSEGTIHQWG IBGAHPEDKE ITNICVPPVT ANGADFMESS 
MTTKLGAATE SGGAAGFATG TVSGAASGPG AATGVGICSS 
DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG 
LDDSFLDSLG PKFKKIiAEIS LGVDGEGKEV QPPSKDSGYG 
LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seg ID NO: 25 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 56-1642 
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QQAKRRQKRE 


WVKPAKPCRE 


60 


WDKNTGDIN 


ITAIVDREET 


120 


IFMGEIEENS 


ASNSLVHILN 


1B0 


LTNSLDREQA 


SSYRLWSGA 


240 


LSSELLRPQV 


TDLDEEYTDN 


300 


QSVKLSIAVK 


NKAEFHQSVI 


360 


LVDYILGTYQ 


AIDEDTNKAA 


420 


TITAEVLAID 


EYTGKTSTGT 


4B0 


TGPYT PALED 


QPVKLPAVWS 


540 


SLTLEVCQCD 


NRGICGTSYP 


600 


CDCGAGSTGG 


VTGGFIPVPD 


660 


EVCTNTYARG 


TAVEGTSGME 


720 


GQSGTMRTRH 


STGGTNKDYA 


780 


ADATGSPVGS 


VGCCSFIADD 


840 


IESCGHPIEV 


QGTGFVKCQT 


900 


GSLVQPSTAG 


FDPLLTQNVI 


960 



1 

I 

AGTATCCCAG 
GCAAGGGATC 
CATGTTTGAG 
CTGCTCTGTC 
GGAGAAGGTG 
GGAAGATCAG 
GGACTCTTTT 
CTTTTCCCAG 
GGAGATGGTA 
TAACTCAGGG 
GTCCCTGGCG 
GCCCTTGCTC 
GAAGAAGCTG 
GAGGAGTGTC 
TGGGCTCTCT 
ATGGGCACAG 
GATCTCATTC 
ACAGCGCAAG 
AGATCTCAAC 
TCGTAAGAAC 
CATCTTCTCA 
CGAGCTGTCA 
ACGGTTGAAG 
TGCCCTTCGT 
CAAGTTGACT 
CAATGTGAAT 
CATTGCTAGC 
ACTCGTTCAT 
CAGACACAGG 
AGGAGCTCCT 
AGCTACAGCT 
AACAGCGGGA 
TGTATGAAGA 
TTCAGGAGCG 
AGTCAGTGGC 
CAGCTTCTGC 
CAGAGCTAAA 
CCTCAGCCAA 
TAAGGCTGTT 
CTTGTTGCCA 
TCTTAATCAA 
TGGACCTTCG 
AAGGCCAGGT 
AACAACCACC 
AAAGCTCAAC 
TCAAATCTGG 
GGCCCTGAGG 
TATCAGGAAT 
TATAACCACC 
GCACACAAAA 
GTAGCAAAAT 



11 
I 

GAGGAGCAAG 
CTTTCTCCGC 
TCCACAGCTG 
GTCTCTACCT 
AAAGTATACT 
GGTTGTGTCC 
GCCCTGAAGA 
ATCTTTGGGC 
AAGGATGTAC 
AAAACCCACA 
CTGATCTTCA 
TCCAATGAGG 
TCCCTGCTAA 
TACATCGAAA 
TCTATCAGTC 
CCAGACACTG 
TTTGAGATCT 
AGGCAGACTT 
TGGATTCATG 
CAGAGCTTTG 
ATCAGGATCC 
CTCTGTGATC 
GAAGCAGGAA 
CAAAACCAGC 
CGAGTGTTCC 
CCCTGTGCAT 
CAGG7GACTT 
CAAGGAACAT 
CCTTGATGAT 
ACAAGTTGTG 
GGAGATGCAT 
ACAGTGGTGC 
AAAACTAAAT 
GGATGAAAAG 
CCATCAGCAA 
CTCCACCCAG 
CTCTACCACT 
GCCCTTCACC 
GCGGACAGAG 
CAGCACTGGG 
ACAGGACCAG 
GAAGAAGGCA 
TTCTGCCAAA 
AGGGAAGAAA 
AGACTGCAGC 
GCCTTTTGGC 
TGGGTCAGCT 
TATATCCAGG 
TATGTAATCT 
ACAGTTATAT 
CATTAAAACA 



21 

I 

TGGCACGTCT 
CAGCGGGCTT 
CAGATTTGGG 
CCCTAGAGGA 
TGAGGGTTAG 
GTATTGAGAA 
GCAATGAACG 
CAGAAGTGGG 
TCAAAGGGCA 
CGATTCAAGG 
ATAGCCTCCA 
TAATCTGGCT 
ATGGAGGCCT 
GTCGGATAGG 
AGTGTACCAG 
CCCCACTACC 
ACAACGAACT 
TGCGGCTATG 
TGCAAGATGC 
CCAGCACCCA 
TACACCTTCA 
TGGCTOGCTC 
ACATTAACAC 
AGAACCGGTC 
AAGGTTTCTT 
CTACCTATGA 
GTGCATGCCC 
AGTCTTCAGG 
GATATTGAAA 
GAAGCCATGA 
CTCCGAGATG 
AGTGAACATT 
ATCCTCAAGG 
ATTGAAGAGC 
TCAGGGTCTG 
CAGCTTCAGG 
GAAGAGTTGC 
ATTGATGTGG 
CTTCAGAAAC 
GCAGGAAAAC 
ACTCTGGCTG 
GCATGTATTG 
AAGCGCCTTG 
CCATTCCTTC 
CCTTATGCCC 
AAAAAGTACT 
ACTCTCCTGA 
ATGCAATACT 
CATGTTGTTG 
TAAAGATATT 
AATTATAAAA 



31 
I 

TCGGACCTAG 
GCTGTCCGAT 
GTCTGTGGTA 
CAAGCAGCAG 
GCCCTTGTTA 
TGTGGAGACC 
GGGAATTGGC 
ACAGGCATCC 
GAACTGGCTC 
TACCATCAAG 
AGGCCAACTT 
AGACAGCAAG 
CCAAGAGGAG 
TACCAGCACC 
CAGTAGCCAG 
TGTCCCGGCA 
GCTTTATGAC 
CGAGGATCAA 
TGAGGAGGCC 
CCTCAACCAG 
GGGGGAAGGA 
AGAGCGCTGC 
CTCTCTACAC 
AAAGCAGAAC 
CACAGGCCGA 
TGAAACTCTT 
CACCTATGCA 
TATCCCCCAG 
ATGAAGCTGA 
AGACACTGCT 
AAATTTGCAA 
TGGACACCCA 
AGTCACTGAC 
TAGAAGCTCT 
AATTGGCCCT 
AGGTTAAAGC 
ATAAGTATCA 
ACAAGAAGTT 
TTGGTGAGTC 
TTCGTCAAGC 
AACTGCAGAA 
CTGAGCAGTA 
GTACCAACCA 
GAAATTTACT 
GGATCCTACG 
AAGGCTGTGG 
AGAAATAGGT 
CAGACACTAG 
TTTTTTTTTA 
ATTGTTCACA 
GGGACAGAAA 



41 
I 

GCTGCCCCTG 
GACGATGTCG 
CGCAAGAACC 
GTTCCATCTG 
CCTTCAGAGT 
CTTGTTCTAC 
CAAGCCACAC 
TTCTTCAACC 
ATCTATACAT 
GATGGAGGGA 
CATCCAACAC 
CAGATCCGAC 
GAGCTGTCCA 
AGCTrCGACA 
CTGGATGAAA 
AACATTCGCT 
CTATTAGAAC 
AATGGCAATC 
TGGAAGCTCC 
AACTCCAGCC 
GATATAGTCC 
AAAGATCAGA 
ACCCTGGGCC 
CTGGTTCCCT 
GGCCGTTCCT 
CATGTGGCCA 
ACTGGGATTC 
CTTAGAGAAA 
CATCTCCATG 
TTTGAAGGAA 
TGAGATGGTA 
AAAGGAACTA 
AAGTTTTTAC 
CTTGCAGGAA 
ACGGCGGTCA 
TAAATTACAG 
GAAAATGTTA 
AGAAGAGGGC 
TCTCCAATCA 
CTTGACCACT 
CAACATGGTG 
TCATACTGTG 
GGAAAATCAG 
TCCCCGAACA 
CTCACGGCGT 
GGAAAGAGAA 
CTCTTTTATG 
CTTTTTTCTC 
TTTACTTATA 
TTTTTTATTG 
AA 



51 
I 

CCGTCATGTC 
TAGTTTCTCC 
TGCTATCAGA 
AGGACAGTAT 
TGGAACGACA 
AAGCACCCAA 
ACAGGTTCAC 
TAACTGTGAA 
ATGGAGTCAC 
TTCTCCCCCG 
CTGATCTGAA 
AGGAGGAAAT 
CTTCCTTGAA 
GTGGCATTGC 
CAAGTCATCG 
TCTCCATCTG 
CGCCTAGCCA 
CCTATGTGAA 
TAAAAGTGGG 
GCAGTCACAG 
CCAAGATCAG 
AGAGTGGTGA 
GCTGTATTGC 
TCCGTGACAG 
GCATGATTGT 
AGTTCTCAGC 
CCATCCCTGC 
GGGGCTAAGG 
TATGGCAAAG 
CGACAGGAAA 
GAACAGATGC 
TTGGAGGAAA 
CAAGAAGAGA 
GCCAGACAAC 
CAAAGGTTGG 
CAGTGCAAAG 
GAACCACCAC 
CAGAAGAATA 
GCAGAGAGAG 
TGTGATGACA 
CTAGTGAAAC 
TTGAAACTCC 
CAACCAAACC 
CCAACCTGCC 
TCCCCTTTAC 
GAGCAGTCAT 
CTTTACCATA 
ACTTTTGTAT 
TGATTTCTAT 
AATTCCAAAT 



Seq ID NO: 26 Protein sequence; 
Protein Accession ; Eos sequence 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
. 780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



1 11 

I I 
MSQGILSPPA GLLSDDDVW 
SMEKVKVYLR VRPLLPSELE 
FTFSQIFGPE VGQASFFNLT 
PRSLALIFNS LQGQLHPTPD 
LKRSVYIESR IGTSTSFDSG 



21 31 41 51 

1 I I I 

SPMFESTAAD LGSWRKNLL SDCSWSTSL EDKQQVPSED 
RQEDOGCVRI ENVETLVLQA PKDSFALKSN ERGIGQATHR 
VKEMVKDVLK GQNWLIYTYG VTNSGKTHTI QGTIKDGGIL 
LKPLLSNEV1 WLDSKQIRQE EMKKLSLLNG GLQEEELSTS 
IAGLSSISQC TSSSQLDETS HRWAQPDTAP LPVPANIRFS 



60 
120 
180 
240 
300 



197 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

IWISFPEIYN ELLYDLLEPP S00RKRQTLR LCEDQNGNPY VKDLNWIHVQ DAEEAWKLLK 
VGRKNQSFAS THLNQNSSRS HSIFSIRILH LQGEGDIVPK ISELSIiCDLA GSERCKDQKS 
GERLKEAGNI NTSLHTUJRC IAALRQNQQN RSKQNLVPFR DSKLTRVFQG FFTGRGRSCM 
IVNVNPCAST YDETLHVAXF SAIASQVTCA CPTYATGIPI PALVHQGT 

Seq ID HO: 27 DNA sequence 

Nucleic Acid Accession ft: Bos sequence 

Coding sequences 13-1424 



PCT7US02/12476 



360 
420 
480 



TAGAAGTTTA 
CTTCCCCTGA 
TTAGAAAAAT 
GGAAACTTAA 
GGGCAACTGG 
GTCCATCATT 
AGAATCAATA 
GCTTTCCAAG 
GCTGACATTT 
AAAGGTGGAA 
TTCGATGAGG 
GTTCACGAGA 
TTCCCCACCT 
GGCATTCAGT 
TCAGAACCAG 
AAGATCTTTT 
AGTGTTAATT 
GAAATTGAAG 
AATTTAAGAC 
GTGAAAAAAA 
GATAACCAGT 
CTGATTACCA 
AACAAATACT 
CGTATCACCA 
TGGTTTTTGT 
GTGTACCACT 
TTATATAAAA 
CTCTACTATT 
CTCTGTAAGT 
TAAAATTAAG 




TCCTAGCCCA 
ACGAATTCTG 
TTGGCCATTC 
ACAAATATGT 
CCCTGTATGG 
CTCTCTGTGA 
TCTTCAAAGA 
TAATTTCTTC 
CCAGAAATCA 
CAGAGCCAAA 
TTGATGCAGC 
ATTGGAGGTA 
AGAACTTCCA 
ACTATTTCTT 
AAACACTGAA 
TAGTTCACTT 
ACTTAGAGAT 
TACATAATAT 
AAGTTTGAAA 
TGCTTCCTAA 
TATATATATT 



21 
I 

TCTTCTAATA 
AAGCCTGGAA 
TGAGATAAAC 
AATCCAAGAA 
CCTGGAGATG 
GCCAGGGGGG 
TGACATGAAC 
TGTTACCCCC 
TGCCOGTGGA 
TGCTTTTGGA 
GACTACACAT 
CTTAGGTCTT 
TGACATGAAC 
AGACCCAAAA 
CCCCAATTTG 
CAGGTTCTTC 
CTTATGGCCA 
AGTTTTTCTT 
TTATCCCAAG 
TGTTTTTAAC 
TGATGAAAGG 
AGGAATCGGG 
CCAAGGATCT 
AAGCAATAGC 
CAGCTTAATA 
ATGTATCATA 
TTTTCAATTT 
ATAGTTACCT 
CATCCTTGGA 
TTGGCTCAAA 



31 
I 

CTGCTCCTGC 
AAAAATAATG 
AAACTTCCAG 
ATGCAGCACT 
ATGCACGCAC 
CCCGTATGGA 
CGTGAGGATG 
TTGAAATTCA 
GCTCATGGAG 
CCTGGATCTG 
TCAGGAGGCA 
GGCCATTCTA 
ACATTTCGCC 
GAGAACCAAC 
AGTTTTGATG 
TGGCTGAAGG 
ACCTTGCCAT 
TTTAAAGATG 
AGCATACATT 
CCACGTTTTT 
AGACAGATGA 
CCTAAAATTG 
AACCAATTTG 
TGGTTTGGTT 
AGTATTTATT 
AAAATAAAAT 
TGAAAACTCT 
TCAAAGCAAG 
CTGAGAAATT 
TAAAATTG 



41 

I 

AGGCCACTGC 
TGCTATTTGG 
TGACAAAAAT 
TCTTGGGTCT 
CTCGATGTGG 
GGAAACATTA 
TTGACTACGC 
GCAAGATTAA 
ACTTCCATGC 
GCATTGGAGG 
CAAACTTGTT 
GTGATCCAAA 
TCTCTGCTGA 
GCTTGCCAAA 
CTGTCACTAC 
TTTCTGAGAG 
CTGGCATTGA 
ACAAATACTG 
CTTTTGGTTT 
ATAGGACCTA 
TGGACCCTGG 
ATGCAGTCTT 
AATATGACTT 
GTTGAAAATG 
GCATATTTGC 
CTGTAAACCA 
AATTGTCCAT 
ATAATTCTAT 
ATACTTACTT 



Seq ID HOs 28 Protein Bequence: 
Protein Accession B: Eos sequence 



1 

I 

MKFIiLILLLQ 
KEKIQEW2HF 
YTPDMNREDV 
LAHAPGPGSG 
KYVDINTFRL 
FKDRFFWLKV 



NFQGIGPKID 



11 
I 

ATASGALPLN 
LGUCVTGQLD 
DYAIRKAFQV 
IGGDAHPDED 
SADDIRGIQS 
SERPKTSVNL 
FGPPNFVKKI 
AVFYSKNKYY 



21 

1 

SSTSLEKNNV 
TSTLEMMHAP 
WSNVTPLKFS 
EFWTTHSGGT 
LYGDPKENQR 
ISSLWPTLPS 
DAAVFNPRFY 
YFFQGSNQFE 



31 

I 



RCGVPDVHHF 
KINTGMADIL 
NLFLTAVHEI 
LPNPDNSEPA 
GIEAAYEIEA 
RTYFFVDNQY 
YDFLLQRITK 



41 

I 

YGLEINKLPV 
REMPGGPVWR 
WFARGAHGD 
GHSLGLGHSS 
LCDPHLSFDA 
RNQVFLFKDD 
WRYDERRQMM 
TLKSNSWFGC 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_0 061 15.1 

Coding sequence i 236 . . 1765 



1 
I 

GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TATCAAGATG 
TACCTGGAAG 
GOGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 



11 

I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CCATTCAGAG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
ATCAGGACTT 
CAGAAGCAGC 
AGCA6CCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGATCACGGA 
CAACCTTAAG 



31 

I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACTCTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGATCAGCTC 
CTTCTACGGG 



41 
I 

GCAGCCCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 



51 
1 

TTCTGGAGCT 
TGAAAGATAC 
GAAATATAGT 
GAAAGTGACC 
AGTCCCCGAT 
TATCACCTAC 
AATCCGGAAA 
CACAGGCATG 
TTTTGATGGC 
GGATGCACAT 
CCTCACTGCT 
GGCCGTAATG 
TGACATACGT 
TCCTGACAAT 
CGTGGGAAAT 
ACCAAAGACC 
AGCTGCTTAT 
GTTAATTAGC 
TCCTAACTTT 
CTTCTTTGTA 
TTATCCCAAA 
CTACTCTAAA 
CCTACTCCAA 
GTGTAATTAA 
TATGTCCTCA 
TAGGTAATGA 
TCTTGCTTGA 
TTGAAGCATG 
CTGGCATAAC 



51 
I 

TKMKYSGNLM 
KHYITYRINN 
FHAFDGKGGI 
DPKAVMFPTY 
VTTVGNKIFF 
KYWLISNLRP 
DPGYPKLITK 



51 
I 

AGCACCGCTC 
GGTGTGGTGA 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 
CCTGCCTCCC 
CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGTAGA 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



198 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG 
TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC 
ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG 
GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

Seg ID MO: 30 Protein sequence t 
Protein Accession #: NP 006106.1 



PCTYUS02/12476 



i' 
1 

GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCAOGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TATCAAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 
CTTGCAGAGT 
TGTCCCCCTG 
TCTGCATGCC 
TAGTGCCAAC 
GTGCCCCTGT 
TTGGACACTA 
ACAAATGTTC 
GTTCAGTGAG 
GTGATCTTTG 
GATTCTGGCT 
TGTTGAAAAT 



11 
I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 
CTCCTGCAGC 
GAGAGTTATG 
AGGCTCAGGG 
CCCTGTCCTC 
TTCATGCCTA 
AAGCCAGGAT 
AGTGTGAGTG 
GAAAAAAAGG 
GGGAGATACA 
TGGGAAGTAC 
AAAGAGAAGC 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CCATTCAGAG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
ATCAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGATCACGGA 
CAACCTTAAG 
ACCTCATCGG 
AGGACATCCA 
AGTTGCTGTG 
ACTGTGGGGA 
ACTAGCTGGG 
GTGCATGCAT 
AGGAAAACAT 
GGAAGTTGGG 
TCTTATAGAG 
ATGTAGGAGT 
AATGTGAAGC 



31 
I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACTCTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGATCAGCTC 
CTTCTACGGG 
GCTGAGCAAT 
TGGTACCCTC 
TGAGTTGGGG 
CAGAACCTTC 
TGCACATATC 
CTTGAAGCAA 
GTTCAGTGAG 
GATAGGCAGA 
TTAGAAATAG 
TAATCCCTGT 
AAAAAAAAAA 



41 . 
I 

GCAGCGCCTC 
GCAACTTOGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 
CTGACCCACG 
CACCTGGAGA 
CGGCCCAGCA 
TATGACCCGG 
AAATGCTTCA 
CAAAGCAGCC 
GAAAAAACAT 
TGTTGACTTG 
AATCTGAATT 
GTAGACTGTT 



TGCTGTATCC 
GGCTTGCCTA 
TGGTCTGGCT 
AGCCCATCCT 
TTCTGCATAC 
ACAGTTTCAG 
TCAGACAAAT 
AGGAGTTAAT 
TCTAAAGGGA 
GTAAAGAAAC 



51 
I 

AGCACCGCTC 
GGTGTGGTGA 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 
CCTGCCTCCC 
CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGTAGA 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 
TGCTGTATCC 
GGCTTGCCTA 
TGGTCTGGCT 
AGCCCATCCT 
TTCTGCATAC 
ACAGTTTCAG 
TCAGACAAAT 
AGGAGTTAAT 
TCTAAAGGGA 
GTAAAGAAAC 



Seq ID NO: 31 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2754 



1 
I 

GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGAAA 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG 
CTATGCAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 
TAATCACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCAAAAT 
TGAATTATGA 
TTGCTAGAGA 
GGGATCTGGA 
ACTTAGCAGT 



21 
I 

CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 



31 

I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGO 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
ATCAACGGCT 



41 
I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 



51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTGCGG 
TGACCCCGAA 
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AATAGAAATG GCAATGQTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTOGTGOG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA, AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 

TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGOCC T TCTTCTSAG TGGCATTGGC 4020 

CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGOCATTTGA 4440 

GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCOGGGTT CATGCCATTC TCCTGCCTCA 4860 

GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

. TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4960 

ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAAOGGCTAC TGAAACACCC S280 

ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

TAACCATCTC TTTG TT CTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

rTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

TACAGATGTG GGGAGATGTA ATAAAAGAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGGATGGA 5940 

AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 

AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 

AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG GTTGAAOCAA AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

TGAGAAGGAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 
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ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 32 Protein sequence: 
Protein Accession #i NP_001932.1 



PCT/US02/12476 



6640 
6900 



1 
1 

MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAPVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGNQTLE 
DRMPSQDYVL 



11 
I 

RGAVCLHLLI* 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLPCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEXS 
CTHPTQCRAT 
NLIISNTEAP 



TYNYBGRGSP 



21 
I 

TLVIFSRDGE 
TARAVALSDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
AFFARDXPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 
AGSVGCCSEK 



31 
I 

ACKKVILNVP 
KRSFTIHLSD 
PFPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LRDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WAILAIUjGI 
FMTQTTNNSS 
VDNCRYTYSS 
QEEDGLDFLN 



41 

I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGPQEYTIPI 
AliLPSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 
NLEPKFITLA 



51 

I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 
IRVEDENDNH 
SPGLFSVHPS 
NAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VOGVFGATKG 
MKNGGQETIE 



Seq ID NO: 33 DMA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 64-2583 
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I 

GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGAAA 
AATAGAAATG 
ATTGATGAAA 
CCCAAAAATG 
ACTGGAACAC 
GAATATGTAG 
GATGAACCTG 
AGTAGACTGT 
AATGCTGGAT 
GCAACAAAAT 
ACTTCAAGGA 
ATAGCACTGC 
GGGAAACGTT 
CCTGGAGAOG 
AGCCAAGGTT 
GAAATGATGA 
ACCCTGGACT 
GAGTGGCACA 
TAAAAATTAA 
CCAAGATTAT 
CTGCTGCAGT 
ATTTATTACA 
TTTGTCAGAC 
TGTATATGAT 
AGCCAGTTGT 
TCTCAAACTC 
GATATTTTAG 
ATATCACATT 
AGTATCACTA 
ATGTTGCAGC 




GCATTTTGCA 
TAATCACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCAAAAT 
TGAATTATGA 
TTGCTAGAGA 
GGGATCTGGA 
ACTTAGCAGT 
GCAATGGTTT 
TTTCAGGGTC 
AGTTGTATAA 
TTGCTGTGAA 
TCATTTGCAA 
TCCATGGAGC 
GGAGCCTCAC 
TTCAAGAATA 
TATTGAGAGT 
GTACAGGAGT 
TCTTTTCTGT 
TTCCTGAAGA 
ATAGAGTGTG 
TTTGTGGTAC 
AAGGAGGAAA 
CCTGCAGGGG 
GTTTTACTCA 
ACATAAAAGA 
GTCCTCACTT 
GAAAAGCAGG 
TTAGCAGAAG 
ATTCTGGAGG 
GATTTTTTTC 
TGCTTATCTT 
CAGCACTGGA 
TAATAAATAT 
ATTATGTATT 
TGTGAAGAAA 
TCATAAAGAA 



21 
I 

CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 
AAGGTACAAA 
AATCATAACT 
TATTACAGTC 
CATTGAAGAT 
ACCAAAAATG 
TCCATTTTAT 
CAAAGTTAAT 
TACCATTCCT 
TAATCTGTGT 
AATACTTGGA 
ATTGCTAACT 
TTTAGCACAG 
CTCTGCCAAT 
TATGGGATCA 
CCAGACCTTG 
AGGACACACG 
ACCCCGTCTC 
AATTGCATCG 
ATAACTATGA 
AAGAAGATGG 
CATGCACAAA 
TTTCCAAAAA 
TCAATTTTGA 
TTCCAAAAAG 
ATTAAGGTCT 
GCTGGATAAA 
CACTTTAAGT 
GTTTTGGAAA 
TTGGGACTCA 



31 

I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
ATCAACGGCT 
AAATTGCATG 
TCCAAAATCC 
CTGGCAATAG 
GTAAATGATA 
GGGTATACCG 
TTCAGTTTGC 
GATACAGCTG 
ATTACTGTAA 
GAATGTACTC 
AAATGGGCAA 
TTAGTATGTG 
CAAAACTTAA 
GGATTTATGA 
GGAATGAAAA 
GAATCCTGCC 
GAGGTGGACA 
GGTGAAGAAT 
ATGTAATCAG 
GGGAAGAGGA 
CCTTGACTTT 
GAGATAATGT 
TAATATTGTA 
ATTATGCTAC 
TGAAAAATGT 
CTAAAGCATC 
TATTAGTCCA 
GATAGTTTAA 
AGAAACAATG 
CCCCTACTGC 



41 
I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTAOGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 
ATCCTAAAGG 
TGGATAGGGA 
ACAAAGATGA 
ATCCACCAGA 
ACATTTTAGC 
CCAATACTTC 
CCCGTCTTTC 
AAGACAGGGC 
ATCCAACTCA 
TCCTTGCAAT 
GAGTTTTTGG 
TTATATCAAA 
CCCAAACTAC 
ATGGAGGGCA 
GGGGGGCTGG 
ACTGCAGATA 
CCATTAGAGG 
AATGAAGACC 
TCTCCAGCTG 
TTAAATAATT 
CACAGTGCTA 
AAGTTCAATT 
TCACCAATTT 
TAAAACAGAC 
TGCTCTTTTT 
ACAATAGCTA 
AAAATAAACA 
AAGACTGAAT 
ACTACCAAAT 



60 
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180. 
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300 
360 
420 
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720 
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I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATO0CTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTGCGG 
TGACCCCGAA 
TTGGATCACC 
GGTTGAAACT 
TAGATCATGT 
AATACTTCAA 
TGTTGATCCT 
TCCAGAAATC 
ATATCAGAAA 
OGGCCAAGCT 
GTGTCGTGCG 
ATTACTGGGT 
TGCAACTAAA 
CACAGAAGCA 
CAACAACTCT 
GGAAACCATT 
GCATCATCAT 
CACTTACTCG 
ACACACTGGT 
GCATGCCATC 
GTTCTGTGGG 
TGGAACCCAA 
CAATTAGGTC 
TCAACATGTA 
ATATTTTTAA 
AACTGGTAAA 
TTTTTTTACG 
AGTTATGCTA 
AGAAATATTG 
TAAATTAAAA 
TCATTTGACT 
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1080 
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1200 
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1380 
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1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
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TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA 
GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC 
AACAAAGAGQ AAAATGGTAA AAACTTGAAA TGAGQCTGGG 
GAAAAAAGAQ AGAGCTTCCT AGGCCTGGOC TCTTAAATGC 
GA6GAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA 
TTCTGGTTTC TGTGGGPAGG AAATAGGGAA TCCAATGGAA 
CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT 
GGGGTTCCCT GCTTTTTQGT AGCAAGGGTC CAGAGATGAG 
ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG 
ATTGTCCTTA AACCTAAGCC CXACAAACTT GACACCTGAT 
TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT 
AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC 
CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA 
CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA 
AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG 
TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT 
AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC 
CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT 
CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA 
TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA 
AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC 
CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT 
TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT 
TTGAGACGGA GTCTOGCTCT GACGCACAGG CTGGAGTGCA 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC 
GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT 
TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC 
TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC 
GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA 
TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC 
GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC 
TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA 
CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT 
TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT 
TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA 
AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC 
TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT 
AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT 
TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG 
TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT 
CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT 
AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA 
TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT 
TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA 
GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG 
TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT 
TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT 
GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT 
ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA 
CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA 
GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA 
GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA 
AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA 
ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA 
ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG 
GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT 
ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT 
ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA 
ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT 
ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT 
CATATATATA ATCCOGAAAC ATG 

Seq ID NO: 34 Protein sequence: 
Protein Accession #: NP 077741.1 



PCT/US02/12476 



i 
I 

MAAAGPRRSV 
ADItlRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAPVEENAFK 
KPLNYEENRQ 
KENLAVG3KI 
KNELYNITVL 
EPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
MMK3GGNQTLE 



11 
I 

RGAVCLKLLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 



21 
I 

TLVI PSRDGE 
TARAVALSDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APFARDIPRV 
RNGNGIiRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 



31 
I 

ACKKVILWP 
KRSFTIWLSD 
PFPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WAILAILLGI 
PMTQTTNNSS 
VDNCRYTYSE 



TTTTCTATAG 
AATATTTAAA 
GTATAGTTTG 
TGCATTATAA 
ATTGTAAATA 
CAG7A6CTTT 
GGGGAATACT 
GTGTTTTTTT 
ATCCTCTATT 
AAATAACCAT 
AAAGCACCCT 
CAGGTCTGGG 
GGCCTGAATC 
AACACCTCCA 
ATTAATTTTT 
A3TTTGAATG 
AGAAAGCAGC 
CAAGGGCAAG 
ATACTTTTTC 
TGAGTCCGGT 
ATCAAGGAGA 
CTGGTTGTGC 
CTTCTGGGCA 
ACATTTTCTG 
TGCCCCCCCC 
GTGGCTCOGA 
TCAGCCTCCT 
ATTTTTAATA 
GTGATCOGCC 
GGCCTTGTTT 
AGTTGATCAT 
AGGGGGAGAA 
TAGTTGCTGA 
CCCACTGTGT 
ACTAGTGCCG 
CTAACCATCT 
ATTTGTAATT 
ACATATGTAG 
GCAAGAAAAT 
ATGTAAATAT 
TGGGGTTTGT 
TTGCTTTTAA 
TTACAGATGT 
ATTGGATGCA 
GGTTTCTTGT 
TTATGCTTAT 
GGAATATTTT 
GTTGGGAAGA 
GTATAATAAA 
TCCATGAATA 
ACATGAGTTA 
GATAATGCCT 
GTGAATCTTT 
AAAATCTATT 
TTCCAGTAAC 
TTTTTGTTTG 
TTGAATATTG 
GAATCCTGGA 
TAGCTTTCTC 
TTAGTTAATA 
GTGAGGTAGT 
TTAGGGAAAT 



41 
I 

SKLEADXIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGPCGTMGSG 
WHSFTQPRLG 



GAATATAGTT 
ATGAAATGAG 
TCCTACAATA 
CTGAGTCTAT 
AATTAAACTT 
GCTTTCCAGT 
CGCTGCAGCT 
OGGGGAGCTA 
GCTGTTTCTA 
GTCCTCCTAG 
GGGGAGATTG 
AGCTACAAAA 
AAGGAAAGCC 
GCAGAGATTC 
AATCAGTTTG 
TATAAAAGAA 
CCAAGTAGGT 
GAGAGGCCAC 
CTAGGCTTGG 
GAGGGATCAG 
CAGAGCTGAC 
AGAACAAACA 
CTAAGAAGGT 
TTTTCTAATT 
CCCTTTTTTT 
TCTCTGCTCA 
GAGTAGCTGG 
GAGACGGGGT 
TGCCTCGGCC 
TCCGTTTAAA 
ACGAATTGGA 
AGAACTCAGG 
AATTTCCTGC 
TTTGCTCACT 
ATAAACTTTC 
CTTTGTTCTT 
CTTTTCTCTC 
TATTATTATT 
ATATTTTTAA 
ACAGAATGTT 
TTTGCAATGT 
AGAAACTTGG 
GGGGAGATGT 
TAAAGTAATA 
TTTTGCTGTA 
GGCTGGCATG 
GGACAATGTT 
GATCACTATT 
TTTTGATCGG 
GTACACTGAC 
AAAAGAAAAG 
GAACTTTAAT 
TCCCCATGCA 
AG CTATATCA 
TTCTATTGTA 
TAATAGTAAA 
GGTCATAATG 
TCTGTCACTT 
ATTAAAATCA 
TACCTAAAGT 
TGGTAAAATT 
AAAGTTTGTG 



51 
I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4S0O 
4560 
4620 
4660 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6640 
6900 
6960 
7020 



SPGLFSVHPS 
NAPTFRQNAY 
ETNBGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VCGVPGATKG 
MKNGGQETIE 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 



Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 146-1273- 



202 



WO 02/086443 



1 



11 



21 



31 



41 



51 



GGGAGTGGGC GTGGCGGTGC TGCCCAGGTG AGCCACCGCT GCTTCTGCCC AGACACGGTC 60 

GCCTCCACAT CCAGGTCTTT GTGCTCCTCG CTTGCCTGTT CCTTTTCCAC GCATTTTCCA 120 

GGATAACTGT GACTCCAGGC CCGCAATGGA TGCCCTGCAA CTAGCAAATT CGGCTTTTGC 180 

CGTTGATCTG TTCAAACAAC TATGTGAAAA GGAGCCACTG GGCAATGTCC TCTTCTCTCC 240 

AATCTGTCTC TCCACCTCTC TGTCACTTGC TCAAGTGGGT GCTAAAGGTG ACACTGCAAA 300 

TGAAATTGGA CAGGTTCTTC ATTTTGAAAA TGTCAAAGAT ATACCCTTTG GATTTCAAAC 360 

AGTAACATCG GATGTAAACA AACTTAGTTC CTTTTACTCA CTGAAACTAA TCAAGOGGCT 420 

CTACGTAGAC AAATCTCTGA ATCTTTCTAC AGAGTTCATC AGCTCTACGA AGAGACCCTA 480 

TGCAAAGGAA TTGGAAACTG TTGACTTCAA AGATAAATTG GAAGAAACGA AAGGTCAGAT 540 

CAACAACTCA ATTAAGGATC TCACAGATGG CCACTTTGAG AACATTTTAG CTGACAACAG 600 

TGTGAACGAC CAGACCAAAA TCCTTGTGGT TAATGCTGCC TACTTTGTTG GCAAGTGGAT 660 

GAAGAAATTT CCTGAATCAG AAACAAAAGA ATGTCCTTTC AGACTCAACA AGACAGACAC 720 

CAAACCAGTG CAGATGATGA ACATGGAGGC CACGTTCTGT ATGGGAAACA TTGACAGTAT 780 

CAATTGTAAG ATCATAGAGC TTCCTTTTCA AAATAAGCAT CTCAGCATGT TCATCCTACT 840 

ACCCAAGGAT GTGGAGGATG AGTCCACAGG CTTGGAGAAG ATTGAAAAAC AACTCAACTC 900 

AGAGTCACTG TCACAGTGGA CTAATCCCAG CACCATGGCC AATGCCAAGG TCAAACTCTC 960 

CATTCCAAAA TTTAAGGTGG AAAAGATGAT TGATCCCAAG GCTTGTCTGG AAAATCTAGG 1020 

GCTGAAACAT ATCTTCAGTG AAGACACATC TGATTTCTCT GGAATGTCAG AGACCAAGGG 1080 

AGTGGCCCTA TCAAATGTTA TCCACAAAGT GTGCTTAGAA ATAACTGAAG ATGGTGGGGA 1140 

TTCCATAGAG GTGCCAGGAG CACGGATCCT GCAGCACAAG GATGAATTGA ATGCTGACCA 1200 

TCCCTTTATT TACATCATCA GGCACAACAA AACTCGAAAC ATCATTTTCT TTGGCAAATT 1260 

CTGTTCTCCT TAAGTGGCAT AGCCCATGTT AAGTCCTCCC TGACTTTTCT GTGGATGCCG 1320 

ATTTCTGTAA ACTCTGCATC CAGAGATTCA TTTTCTAGAT ACAATAAATT GCTAATGTTG 1380 

CTGGATCAGG AAGCCGCCAG TACTTGTCAT ATGTAGCCTT CACACAGATA GACCTTTTTT 1440 

TTTTTCCAAT TCTATCTTTT GTTTCCTTTT TTCCCATAAG ACAATGACAT ACGCTTTTAA 1500 

TGAAAAGGAA TCACGTTAGA GGAAAAATAT TTATTCATTA TTTGTCAAAT TGTCCGGGGT 1560 

AGTTGGCAGA AATACAGTCT TCCACAAAGA AAATTCCTAT AAGGAAGATT TGGAAGCTCT 1620 

TCTTCCCAGC ACTATGCTTT CC T TCT TTO Q GATAGAGAAT GTTCCAGACA TTCTCGCTTC 1680 

CCTGAAAGAC TGAAGAAAGT GTAGTGCATG GGACCCACGA AACTGCCCTG GCTCCAGTGA 1740 

AACTTGGGCA CATGCTCAGG CTACTATAGG TCCAGAAGTC CTTATGTTAA GCCCTGGCAG 1800 

GCAGGTGTTT ATTAAAATTC TGAATTTTGG GGATTTTCAA AAGATAATAT TTTACATACA 1860 

CTGTATGTTA TAGAACTTCA TGGATCAGAT CTGGGGCAGC AACCTATAAA TCAACACCTT 1920 

AATATGCTGC AACAAAATGT AGAATATTCA GACAAAATGG ATACATAAAG ACTAAGTAGC 1980 

CCATAAGGGG TCAAAATTTG CTGCCAAATG CGTATGCCAC CAACTTACAA AAACACTTCG 2040 

TTCGCAGAGC TTTTCAGATT GTGGAATGTT GGATAAGGAA TTATAGACCT CTAGTAGCTG 2100 

AAATGCAAGA CCCCAAGAGG AAGTTCAGAT CTTAATATAA ATTCACTTTC ATTTTTGATA 2160 

GCTGTCCCAT CTGGTCATGT GGTTGGCACT AGACTGGTGG CAGGGGCTTC TAGCTGACTC 2220 

GCACAGGGAT TCTCACAATA GCCGATATCA GAATTTGTGT TGAAGGAACT TGTCTCTTCA 2280 

TCTAATATGA TAGCGGGAAA AGGAGAGGAA ACTACTGCCT TTAGAAAATA TAAGTAAAGT 2340 

GATTAAAGTG CTCACGTTAC CTTGACACAT AGTTTTTCAG TCTATGGGTT TAGTTACTTT 2400 

AGATGGCAAG CATGTAACTT ATATTAATAG TAATTTGTAA AGTTGGGTGG ATAAGCTATC 2460 

CCTGTTGCCG GTTCATGGAT TACTTCTCTA TAAAAAATAT ATATTTACCA AAAAATTTTG 2520 

TGACATTCCT TCTCCCATCT CTTCCTTGAC ATGCATTGTA AATAGGTTCT TCTTGTTCTG 2580 
AGATTCAATA TTGAATTTCT CCTATGCTAT TGACAATAAA ATATTATTGA ACTACC 

Seq ID NO: 36 Protein sequence i 
Protein Accession #: NP_002630.l 

1 11 21 31 41 51 



ENVKDIPFGF QTVTSDVNKL SSFYSLKLIK RLYVDKSLNL STEFISSTKR PYAKELETVD 120 

PKDKLEBTKG QINNSIKDLT DGHFENILAD NSVNDQTKIL WNAAYFVGK WMKKPPESET 180 

KECPFRLNKT DTKPVQMMNM EATPCMGNID SINCKIIELP FQNKHLSMPI LLPKDVEDES 240 

TGXjEKIBKQL nseslsqwtn pstmanakvk LSIPKFKVBK MIDPKACLEN LGLKHIPSED 300 

TSDPSGMSET KGVALSNVIH KVCLEITEDG GDSIEVPGAR ILQHKDELNA DHPFIYIIRH 360 
NKTRNIIFPG KPCSP 



Seq ID NOt 37 DNA sequence 

Nucleic Acid Accession #: NM_016B583 

Coding sequence: 72-842 

1 11 21 31 41 51 



GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 120 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGAOCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 660 

CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 




MDALQLANSA 



60 



203 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

Seq ID NO: 38 Protein sequence: 
Protein Accession 8: NP_057efi7 

1 11 21 31 41 51 

I I I I I I 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPLDQ TI1PLNVNPAI1 PLSPTGLAGS LTNALSNGLL 
SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 
DIVNMLIHGL QFVIKV 



Seq ID NO: 39 DMA sequence 

Nucleic Acid Accession #: NM_004363.1 

Coding sequence » 115-2223 



PCT/US02/12476 



CTCAGGGCAG 
TCCTGGAACT 
TCTCCCTCGG 
TCACTTCTAA 
TTCAATGTCG 
TTTGGCTACA 
GTAATAGGAA 
CCCAATGCAT 
CACGTCATAA 
GAGCTGCCCA 
GTGGCCTTCA 
CAGAGCCTCC 
TTCAATGTCA 
GCCAGGOGCA 
TCCCCTCTAA 
TCTAACCCAC 
GAGCTCTTTA 
AACTCAGACA 
CCCAAACCCT 
TTAACCTGTG 
CTCCCGGTCA 
GTCACAAGQA 
CACAGCGACC 
TCATACACCT 
CCACCTGCAC 
TTTATCTCCA 
GCCAGTGGCC 
CCCTCCATCT 
TGTGAACCTG 
GTCAGTCCCA 
AGAAATGACG 
GACCCAGTCA 
TCGTCTTACC 
CCGCAGTATT 
GCCAAAATCA 
GGCCGCAATA 
CTCTCAGCTG 
TAGCAGCCCT 
TAAAGCATTT 
AGACTCTGAC 
AAATACAAAA 
TGAGGCAGGA 
ACTGCACTCC 
TCTGACCTGT 
AACTTTAATG 
TAATTAATTT 
TTCCCAGATT 
AAATATACTT 
AGACTTGGGA 
TCAATAAAAA 



11 
I 

AGGGAGGAAG 
CAAGCTCTTC 
CCCCTCCCCA 
CCTTCTGGAA 
CAGAGGGGAA 
GCTGGTACAA 
CTCAACAAGC 
CCCTGCTGAT 
AGTCAGATCT 
AGCCCTCCAT 
CCTGTGAACC 
CGGTCAGTCC 
CAAGAAATGA 
GTGATTCAGT 
ACACATCTTA 
CTGCACAGTA 
TCCCCAACAT 
CTGGCCTCAA 
TCATCACCAG 
AACCTGAGAT 
GTCCCAGGCT 
ATGATGTAGG 
CAGTCATCCT 
ATTACOGTCC 
AGTATTCTTG 
ACATCACTGA 
ACAGCAGGAC 
CCAGCAACAA 
AGGCTCAGAA 
GGCTGCAGCT 
CAAGAGCCTA 
CCCTGGATGT 
TTTCGGGAGC 
CTTGGOGTAT 
CGCCAAATAA 
ATTCCATAGT 
GGGCCACTGT 
GGTGTAGTTT 
GCAACAGCTA 
CAGAGATCGA 
ATGAGCTGGG 
GAATCGCTTG 
AGTCTGGCAA 
ACTCTTGAAT 
AACTAACTGA 
CATGGGACTA 
TCAGGAAACT 
TTGTGAACAA 
AACTATTCAT 
TCTGCTCTTT 



21 
I 

GACAGCAGAC 
TCGACAGAGG 
CAGATGGTGC 
CCCGCCCACC 
GGAGGTGCTT 
AGGTGAAAGA 
TACCCCAGGG 
CCAGAACATC 
TGTGAATGAA 
CTCCAGCAAC 
TGAGACTCAG 
CAGGCTGCAG 
CACAGCAAGC 
CATCCTGAAT 
CAGATCAGGG 
CTCTTGGTTT 
CACTGTGAAT 
TAGGACCACA 
CAACAACTCC 
TCAGAACACA 
GCAGCTGTCC 
ACCCTATGAG 
GAATGTCCTC 
AGGGGTGAAC 
GCTGATTGAT 
GAAGAACAGC 
TACAGTCAAG 
CTCCAAACCC 
CACAACCTAC 
GTCCAATGGC 
TGTATGTGGA 
CCTCTATGGG 
GAACCTCAAC 
CAATGGGATA 
TAACGGGACC 
CAAGAGCATC 
CGGCATCATG 
CTTCATTTCA 
CAGTCTAAAA 
GACCATCCTA 
CTTGGTGGCG 
AACCCGGGAG 
CAGAGCAAGA 
ACAAGTTTCT 
CAGCTTCATG 
AATGAACTAA 
TTTTTTCTTT 
AAATTGAGAC 
GAATATTTAT 
GTATAACAGA 



31 
I 

CAGACAGTCA 
AGGACAGAGC 
ATCCCCTGGC 
ACTGCCAAGC 
CTACTTGTCC 
GTGGATGGCA 
CCCGCATACA 
ATCCAGAATG 
GAAGCAACTG 
AACTCCAAAC 
GACGCAACCT 
CTGTCCAATG 
TACAAATGTG 
GTCCTCTATG 
GAAAATCTGA 
GTCAATGGGA 
AATAGTGGAT 
GTCACGACGA 
AACCCCGTGG 
ACCTACCTGT 
AATGACAACA 
TGTGGAATCC 
TATGGCCCAG 
CTCAGCCTCT 
GGGAACATCC 
GGACTCTATA 
ACAATCACAG 
GTGGAGGACA 
CTGTGGTGGG 
AACAGGACCC 
ATCCAGAACT 
CCGGACACCC 
CTCTCCTGCC 
CCGCAGCAAC 
TATGCCTGTT 
ACAGTCTCTG 
ATTGGAGTGC 
GGAAGACTGA 
TTGCTTCTTT 
GCCAACATCG 
CGGACCTGTA 
GTGGAGATTG 
CTCCATCTCA 
GATACCACTG 
AAACTGTCCA 
TGAGGATTGC 
TAAGCTATCC 
ATTTACATTT 
ATTGTATGGT 
AAAA 



Seq ID NO: 40 Protein sequence: 
Protein Accession #: NP_004354.1 



1 
I 

MESPSAPPHR 
HLFGYSWYKG 
TLHVIKSDLV 
NNQSLPVSPR 
TISPLNTSYR 
AHNSDTGLNR 
QSLPVSPRLrQ 
SPSYTYYRPG 
NSASGHSRTT 
LPVSPRLQLS 
PDSSYLSGAN 
ATGRNNSIVK 



11 
I 

WCIPWQRLLL 
ERVDGNRQII 
NEEATGQFRV 
LQLSNGNRTL 
SGENLNLSCH 
TTVTTITVYA 
LSNDNRTLTL 
VNLSLSCHAA 
VKTITVSAEL 
NGNRTLTLFN 



SITVSASGTS 



21 

I 

TASIjLTFWNP 
GYVIGTQQAT 
YPELPKPSIS 
TLFNVTRNDT 
AASNPPAQYS 
EPPKPPITSN 
LSVTRNDVGP 
SNPPAQYSWL 
PKPSISSNN9 
VTRNDARAYV 
PSPQYSWRIN 
PGLSAGATVG 



31 

! 

PTTAKLTIES 
PGPAYSGREI 



ASYKCBTQNP 
WFVNGTFQQS 
NSNPVEDEDA 
YECGIQNELS 
IDGNIQQHTQ 
KPVEDKDAVA 
CGIQNSVSAN 
GIPQQHTQVL, 
IMIGVLVGVA 



41 
I 

CAGCAGCCTT 
AGACAGCAGA 
AGAGGCTCCT 
TCACTATTGA 
ACAATCTGCC 
ACCGTCAAAT 
GTGGTOGAGA 
ACACAGGATT 
GCCAGTTCCG 
CCGTGGAGGA 
ACCTGTGGTG 
GCAACAGGAC 
AAACCCAGAA 
GCCCGGATGC 
ACCTCTCCTG 
CTTTCCAGCA 
CCTATACGTG 
TCACAGTCTA 
AGGATGAGGA 
GGTGGGTAAA 
GGACCCTCAC 
AGAACGAATT 
ACGACCCCAC 
CCTGCCATGC 
AGCAACACAC 
CCTGCCAGGC 
TCTCTGCGGA 
AGGATGCTGT 
TAAATGGTCA 
TCACTCTATT 
CAGTGAGTGC 
CCATCATTTC 
ACTCGGCCTC 
ACACACAAGT 
TTGTCTCTAA 
CATCTGGAAC 
TGGTTGGGGT 
CAGTTGTTTT 
ACCAAGGATA 
TGAAACCCCA 
GTCCCAGTTA 
CAGTGAGCCC 
AAAAGAAAAG 
CACTGTCTGA 
CCAAGATCAA 
TGATTCTTTA 
ACTCTTACAG 
TCTCCCTATG 
AATATAGTTA 



41 
I 

TPPNVAEGKB 
IYPNASLLIQ 
DAVAFTCEPE 
VSARRSDSVI 
TQELFIPNIT 
VALTCEPEIQ 
VDHSDPVILN 
ELFISNITEK 
FTCEPEAQNT 
RSDPVTLDVL 
FIAKITPNNN 
LI 



60 
120 
180 
240 



51 

I 

GACAAAACGT 
GACCATGGAG 
GCTCACAGCC 
ATCCACGCCG 
CCAGCATCTT 
TATAGGATAT 
GATAATATAC 
CTACACCCTA 
GGTATACCCG 
CAAGGATGCT 
GGTAAACAAT 
CCTCACTCTA 
CCCAGTGAGT 
CCCCACCATT 
CCACGCAGCC 
ATCCACCCAA 
CCAAGCCCAT 
TGCAGAGCCA 
TGCTGTAGCC 
TAATCAGAGC 
TCTACTCAGT 
AAGTGTTGAC 
CATTTCCCCC 
AGCCTCTAAC 
ACAAGAGCTC 
CAATAACTCA 
GCTGCCCAAG 
GGCCTTCACC 
GAGCCTCCCA 
CAATGTCACA 
AAACCGCAGT 
CCCCCCAGAC 
TAACCCATCC 
TCTCTTTATC 
CTTGGCTACT 
TTCTCCTGGT 
TGCTCTGATA 
GCTTCTTCCT 
TTTACAGAAA 
TCTCTACTAA 
CTCGGGAGGC 
AGATCGCACC 
AAAAGAAGAC 
GAATTTCCAA 
GCAGAGAAAA 
AATGTCTTGT 
CAATTTGATA 
TGGTCGCTCC 
TTGCACAAGT 



51 

I 

VLLLVHNLPQ 
NIIQNDTGFY 
TQDATYLWWV 
LNVLYGPDAP 
VNNSGSYTCQ 
NTTYLWWVNN 
VLYGPDDPTI 
NSGLYTCQAN 
TYLWWVNGQS 
YGPDTPIISP 
GTYACFVSNL 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2B80 
2940 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



204 



WO 02/086443 



Seq ID NO i 41 DNA sequence 

Nucleic Acid Accession #: NM_006952.i 

Coding sequence: 11-793 

1 11 21 31 , 41 51 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TOGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP_006883.l 

1 11 21 31 41 51 

I 1 I I 1 I 

MAKDNSTVRC FQGLLI FGNV IIGCCGIALT AECIPFVSDQ HSLYPLLEAT DNDDIYGAAW 60 
IGIFVGICLP CLSVLGIVGI MKSSRXILLA YFILMFIVYA FEVASCITAA TQRDFPTPNL 120 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 
DADYPWPRQC CVMNNLKEPI* NLEACKLGVP GPYHNQGCYE liISGPMNRHA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 



Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 63-2605 . 

1 11 21 31 41 SI 

! I I I I I 

GCCGGACAGA TCTGCGCGTA TCCTGGAGCC GGCCCAGTTG TGAACTAGGA GAGCTTTGGG 60 

ACCTCTGTCC CAAGCAAGAG AGATGAATGG AGAGTATAGA GGCAGAGGAT TTGGACGAGG 120 

AAGATTTCAA AGCTGGAAAA GGGGAAGAGG TGGTGGGAAC TTCTCAGGAA AATGGAGAGA 180 

AAGAGAACAC AGACCTGATC TGAGTAAAAC CACAGGAAAA CGTACTTCTG AACAAACCCC 240 

ACAGTTTTTG CTTTCAACAA AGACCCCACA GTCAATGCAG TCAACATTGG ATCGATTCAT 300 

ACCATATAAA GGCTGGAAGC TTTATTTCTC TGAAGTTTAC AGCGATAGCT CTCCTTTGAT 360 

TGAGAAGATT CAAGCATTTG AAAAATTTTT CACAAGGCAT ATTGATTTGT ATGACAAGGA 420 

TGAAATAGAA AGAAAGGGAA GTATTTTGGT AGATTTTAAA GAACTGACAG AAGGTGGTGA 480 

AGTAACTAAC TTGATACCAG ATATAGCAAC TGAACTAAGA GATGCACCTG AGAAAACCTT 540 

GGCTTGCATG GGTTTGGCAA TACATCAGGT GTTAACTAAG GACCTTGAAA GGCATGCAGC 600 

TGAGTTACAA GCCCAGGAAG GATTGTCTAA TGATGGAGAA ACAATGGTAA ATGTGCCACA 660 

TATTCATGCA AGGGTGTACA ACTATGAGCC TTTGACACAG CTCAAGAATG TCAGAGCAAA 720 

TTACTATGGA AAATACATTG CTCTAAGAGG GACAGTGGTT CGTGTCAGTA A TATA AAGCC 780 

TCTTTGCACC AAGATGGCTT TTCTTTGTGC TGCATGTGGA GAAATTCAGA GCTTTCCTCT 840 

TCCAGATGGA AAATACAGTC TTCCCACAAA GTGTCCTGTG CCTGTGTGTC GAGGCAGGTC 900 

ATTTACTGCT CTCCGCAGCT CTCCTCTCAC AGTTACGATG GACTGGCAGT CAATCAAAAT 960 

CCAGGAATTG ATGTCTGATG ATCAGAGAGA AGCAGGTCGG ATTCCACGAA CAATAGAATG 1020 

TGAGCTTGTT CATGATCTTG TGGATAGCTG TGTCCCGGGA GACACAGTGA CTATTACTGG 1080 

AATTGTCAAA GTCTCAAATG CGGAAGAAGG TTCTCGAAAT AAGAATGACA AGTGTATGTT 1140 

CCTTTTGTAT ATTGAAGCAA ATTCTATTAG TAATAGCAAA GGACAGAAAA CAAAGAGTTC 1200 

TGAGGATGGG TGTAAGCATG GAATGTTGAT GGAGTTCTCA CTTAAAGACC TTTATGCCAT 1260 

CCAAGAGATT CAAGCTGAAG AAAACCTGTT TAAACTCATT GTCAACTCGC TTTGCCCTGT 1320 

CATTTTTGGT CATGAACTTG TTAAAGCAGG TTTGGCATTA GCACTCTTTG GAGGAAGCCA 1380 

GAAATACGCA GATGACAAAA ACAGAATTCC AATTCGGGGA GACCCCCACA TCCTTGTTGT 1440 

TGGAGATCCA GGCCTAGGAA AAAGTCAAAT GCTACAGGCA GCGTGCAATG TTGCCCCACG 1500 

TGGCGTGTAT GTTTGTGGTA ACACCACGAC CACCTCTGGT CTGACGGTAA CTCTTTCAAA 1560 

AGATAGTTCC TCTGGAGATT TTGCTTTGGA AGCTGGTGCC CTGGTACTTG GTGATCAAGG 1620 

TAltTGTGGA ATCGATGAAT TTGATAAGAT GGGGAATCAA CATCAAGCCT TGTTGGAAGC 1680 

CATGGAGCAG CAAAGTATTA GTCTTGCTAA GGCTGGTGTG GTTTGTAGCC TTCCTGCAAG 1740 

AACTTCCATT ATTGCTGCTG CAAATCCAGT TGGAGGACAT TACAATAAAG CCAAAACAGT 1800 

TTCTGAGAAT TTAAAAATGG GGAGTGCACT ACTATCCAGA TTTGATTTGG TCTTTATCCT 1860 

GTTAGATACT CCAAATGAGC ATCATGATCA CTTACTCTCT GAACATGTGA TTGCAATAAG 1920 

AGCTGGAAAG CAGAGAACCA TTAGCAGTGC CACAGTAGCT CGTATGAATA GTCAAGATTC 1980 

AAATACTTCC GTACTTGAAG TAGTTTCTGA GAAGCCATTA TCAGAAAGAC TAAAGGTGGT 2040 

TCCTGGAGAA ACAATAGATC CCATTCCCCA CCAGCTATTG AGAAAGTACA TTGGCTATGC 2100 

TCGGCAGTAT GTGTACCCAA GGCTATCCAC AGAAGCTGCT CGAGTTCTTC AAGATTTTTA 2160 

CCTTGAGCTC CGGAAACAGA GCCAGAGGTT AAATAGCTCA CCAATCACTA CCAGGCAGCT 2220 

GGAATCTTTG ATTCGTCTGA CAGAGGCACG AGCAAGGTTG GAATTGAGAG AGGAAGCAAC 2280 

CAAAGAAGAC GCTGAGGATA TAGTGGAAAT TATGAAATAT AGCATGCTAG GAACTTACTC 2340 

TGATGAATTT GGGAACCTAG ATTTTGAGCG ATCCCAGCAT GGTTCTGGAA TGAGCAACAG 2400 

GTCAACAGCG AAAAGATTTA TTTCTGCTCT CAACAACGTT GCTGAAAGAA CTTATAATAA 2460 

TATATTTCAA TTTCATCAAC TTCGGCAGAT TGCCAAAGAA CTAAACATTC AGGTTGCTGA 2520 

TTTTGAAAAT TTTATTGGAT CACTAAATGA CCAGGGTTAC CTCTTGAAAA AAGGCCCAAA 2580 

AGTTTACCAG CTTCAAACTA TGTAAAAGGA CTTCACCAAG TTAGGGCCTC CTGGGTTTAT 2640 

TGCAGATTAA AGCCATCTCA GTGAAGATAT GOGTGCACGC ACAGACAGAC AGACACACAC 2700 

ACACACACAC ACACACACAC ACACACACAC ACACACAGTC AAATACTGTT CTCTGAAAAA 2760 

TGATGTCCCA AAAGTATTAT AATAGGAAAA AAGCATTAAA TATAATAAAC TAATTTAAGA 2820 
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WO 02/086443 

AGTGATAAAG TCTCCAQATG CAGTAGCTCA 
GGTGAGAGGA TTCCTTGAGG CCAGGGTTCQ 
CATTTCTTAA AAAAAAAAAA AAAAAATTTA 
TAGTCTCAGC TACTTGTGAG GCTGAGGCAG 
TACAGTGAGC CACAATCACA CCAATCACTG 
GACTCAAAAA AATAAAAAAA ATTGTAGTGG 
CCAAAGGGCT AAAAGTAAAT TACTTATAAA 
TTATATGTAT GAATATTTCA TAGTTTTGCA 
AAACCAATGA ATATATTACA TATTCTGTGT 
ATTTGAATTT CATAAAATTT TOCCATGTCA 
GCTATTTAAT AATAGGTCTC ATTTATTCCA 
AATAGAAACA GACTGATTAA GCAGGAGAAG 
AATTATTAGA AGGCAGGTGA ACCAGGAGGG 
GCCTTAGAAT TGGACTAAGG AAGAAGCTGC 
AGAAAGTGCT GCTGCCTCCC TGCCCCACCT 
GAATGCCCCC ACCCGCACCG GAACAGCAAC 
ATTGCTGAAT TCAAAAAAGA AGTTGCATAC 
TATGCCCCTT TCATAGGCTG CTAGGGAGTT 
AATAAGACCA GAATTTCTCA TATGTTGTGA 
AAACTATCAA TCATGTATAA ATCCAACAAA 
GTGAACCATT GTTGGAGAAT CTACTAAAAT 
AATGTAAATA AAAAGAACTG GCAGTGTATA 
GATGTGGAGA CTATTGCCAT AGACCACAAT 
GGAATCAAAA GGGGCCAGGT GCAGTGGCTC 
GAGGCAGGAG GATCACTTGA AGCCAGTTTT 
TATCTCTACA AAAAATAGAT TAGCTGGGCA 
GTGGAGGCTG AAGTAGGAAA TCACTTGAGC 
TTATACCACT GCACTCCAGC CTGGGCAAGA 

Seq ID NO: 44 Protein sequence: 
Protein Accession *h CABS5276.2 



PCT/US02/12476 



MNGEYRGRGF 
TPQSMQSTLD 
ILVDFKELTE 
LSNDGETMVN 
LCAACGEIQS 
QREAGRIPRT 
SISNSKGQKT 
KAGLALALFG 
TTTTSGLTVT 
LAKAGWCSL 
HDHLLSEHVI 
IPHQLLRKYI 
EARARLEIiRE 
SALNNVAERT 



11 

I 

GRGRFQSWKR 
RFIPYKGWKL 
GGEVTNLIPD 
VPHIHARVYN 
FPLPDGKYSL 
IECELVHDLV 
KSSEDGCKHG 
GSQKYADDKN 
LSKDSSSGDP 
PARTS I I AAA 
AIRAGKQRTI 
GYARQYVYPR 
EATKEDAEDI 
YNNIFQFHQL 



21 
I 

GRGGGNFSGK 
YPSEVYSDSS 
IATELRDAPE 
YEPLTQLKNV 
PTKCPVPVCR 
DSCVPGDTVT 
MLMEFSLKDL 
RIPIRGDPHI 
ALEAGALVLG 
NPVGGHYNKA 
SSATVARMNS 
LSTEAARVLQ 
VEIMKYSMLG 
RQIAKELNIQ 



CACTGTAATC 
AGACCAACCT 
AACTTAGCTG 
GAGGATTCTT 
CACTCCAGCC 
TAGCCATGTG 
TTTTTTATAG 
TATCAGATGT 
TCCAATAAAA 
AGAATACAAA 
CAGGCTGTAG 
TTTTTTGAAA 
TAAGCTTCCA 
TGACACTCCA 
TTGCCACTTC 
AAAAGGATTC 
AAAGACATCT 
TTCCTGGTTC 
GAGGATTCAA 
CACTTTGTAA 
ACGGCTTCCC 
TCAGATGTTT 
GTAAATTTTT 
ACATCTATAA 
GAGACCAGCC 
CGGTGGTGCA 
CCGAGAGTTT 
GAGCAAGACC 



31 

I . 

WREREHRPDL 
PLIEKIQAPE 
KTLACMGLAI 
RANYYGKYIA 
GRSFTALRSS 
ITGIVKVSNA 
YAIQEIQAEE 
LWGDPGLGK 
DQGICGIDEF 
KTVSENLiKMG 
QDSNTSVLEV 
DFYLELRKQS 
TYSDEFGNLD 
VADPBNFIGS 



ACAGTGACTC 
TGGGCAACAT 
GGTATGGTGG 
TGAGCCCAGG 
TGGGCAATAA 
TTAATTGTTA 
TTGTATTTTT 
AGGCATACAG 
CTTTATTTAT 
ATACTTGAGT 
TTTGTAGTCT 
GAATTTTGTT 
GCAGCAATTT 
CTGCCACACA 
TGCAGCAGGA 
TGCATGAGAT 
GATTGAAAAA 
TACTTTCAGG 
ATGTTACAGG 
CATACAAGAA 
GCAAACGAAG 
AACTATAGGA 
AAGTGAGGAA 
TCCCAGAGCT 
TATGCAACAC 
TGCCTATTGT 
GAGGTTACAG 
TTGTCTCTT 



41 

I 

SKTTGKRTSE 
KFFTRHIDLY 
HQVIjTKDLER 
LRGTWRVSN 
PLTVTMDWQS 
EEGSRNKNDK 
NLFKLIVNSL 
SQMLQAACNV 
DKMGNQHQAL 
SALLSRFDLV 
VSEKPLSERL 
QRLNSSPITT 



LNDQGYLIiKK 



AGGAGGCTGA 
AGCAAGACCC 
CACATGCCTA 
AGTTTGAGGT 
AGTAACTCTT 
AATAAATTCT 
GACCTGCCTT 
ACAAATACAT 
GGACACTAAA 
TTTGTTTTTA 
TGCTTGAAAC 
TGGCTCACGG 
GTAAAACCAT 
GGGCACTGGA 
ATAGGTAGAA 
GCCTCCCTAA 
GGGTATGTTA 
TGGTGGGATC 
GTTGCCAGCC 
CTCAGGAAAT 
ATGAATGGAA 
CCAGAACTAA 
GGAAAAATCA 
TTGGGAGTTC 
ATTGAGACCC 
CCTACCTACT 
TGAGCTATGA 



51 
I 

QTPQFLLSTK 
DKDEIERKGS 
HAAELQAQEG 
IKPLCTKMAF 
IKIQELMSDD 
CMFLLYIEAN 
CPVIFGHELV 
APRGVYVCGN 
LEAMEQQSIS 
FILLDTPNEH 
KWPGETIDP 
RQLESLIRLT 
SNRSTAKRFI 
GPKVYQLQTM 



Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_00S416.1 

Coding sequence: 149.. 658 



1 
I 

ACCAGATCCC 
CTGAAGACCA 
AAAGAGTGTG 
CCCACCACCT 
AATATTTGTT 
AAAGATTCCA 
GCCAGGCTGT 
CAAGGTCCCT 
ACCAGGCAGC 
CAAAGTTCCT 
GCCATGTCCT 
TGGTGCACAG 
TGTTTCTGTG 
AGTCTCTCTC 
CTGAAGAATC 
GGCTGCTCAG 
CTCATTAAAT 



11 

I 

AGAGGCTGAA 
GAAAAGCCAC 
TCCACGATCC 
CAGCTTCAAC 
CCCACAACCA 
GAGCCAGGCT 
ACCAAGGTCC 
GAGCCAGGTT 
ATCAAGGTCC 
GAGCAAGGAT 
TCAACGGTCA 
ACAAGCCCTT 
TCTTAATTGT 
TTATTTGTAT 
CTGTAAGCCC 
GGTTCATCTG 
TGCTTTTAAT 



21 
I 

CACCTCGACC 
TAAGACTTTC 
TTTGAAGCAT 
AGCAGCAGGT 
AGGAGCCATG 
GTACCAAGGT 
CTGAGCCAGG 
GTACCAAGGT 
CTGACCAAGG 
ACACCAAAGT 
CTCCAGGCCC 
GAGAAGCCAA 
CTGTAGACCT 
CCTAAAAATA 
CTGAATTAAG 
AAGATTCGAA 
TCCA 



31 
I 

TTCTCTGCAC 
TGCTTAATTC 
GAGTTCTTAC 
GAAACAACCC 
CCACTCAAAG 
CCCTGAGCCA 
TTGTACCAAG 
CCCTGAGCCA 
CTTCATCAAG 
TCCTGTGCCA 
AGCTCAGCAG 
CCACCAGATG 
TGTAATCAGC 
CGTACTATAA 
CAGAAAGTCT 
TGAAAAGAAA 



41 
I 

AGCAGATGAT 
AGGAGCTTAG 
CAGCAGAAGC 
AGCCAGCCTC 
GTTCCACAAC 
GGCTGTACCA 
GTCCCTGAGC 
GGCTACACCA 
TTTCCTGAGC 
GGCTACACAA 
AAGACCAAGC 
CTGGACACCC 
ACATTGTCAC 
AGCTTTTGTT 
TCATGGCTTT 
TGCATGTTTC 



Seq ID NO: 46 Protein sequence: 
Protein Accession #: NP_005407.l 



11 



21 



41 



51 



2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 



51 
I 

CCCTGAGCAG 
AGGATTCTTC 
AGACCTTTAC 
CACCTCAGGA 
CTGGAAACAC 
AGGTCCCTGA 
CAGGCTGTAC 
AGGTCCCTGA 
CAGGTGCCAT 
AGCTACCAGA 
AGAAGTAATT 
TCTTCCCATC 
CCCAAGCCAT 
CACACACACT 
TCTGGTCTTC 
CTGCTCTTCC 



31 

I I I 

MSSYQQKQTF TPPPQLQQQQ VKQPSQPPPQ EIFVPTTKEP CHSKVPQPGN TKIPEPGCTK 
VPEPGCTKVP EPGCTKVPEP GCTKVPEPGC TKVPEPGCTK VPEPGYTKVP EPGSIKVPDQ 
GPIKFPEPGA IKVPEOGYTK VPVPGYTKLP EPCPSTVTPG PAQQKTKQK 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 



206 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

1 11 21 31 

I I I I 

GCGTCGTGTG CAGGCGTCCC CGGGCTGTGG ATAATTAGAC 
AAGGCTCGTT AGAATTCGCC CTAGAGCTGT ATCATGTATT 
TTGCAATTAA GCTTAGGGAA CCAGCAACAA AAGCAAACTT 
GAAAATGGAT TAGAGAAACT TCTTCCCCGA TTTAAGGGGA 
TTTGGGGAAA GTGCCCCGAC CGCAGAGGCG ACGACAGGGG 
AGTCGGCGTT GGCGGCAGCG GTGGCCTTCC TCATCTGGGC 
TAAGGATAAC ATCCTGGAAA TGACTTCTGT ACGGTTTGAG 
TTGGAGCTGC CCTGTGGAGT TACAGTTTAC CAAACACATT 
CTAAAAACTT TGTGAGAATT TTCTTTTACT AAAATTTTTT 

Seq ID NO: 48 DNA sequence: 

Nucleic Acid Accession 8: CAT cluster 



PCT/US02/12476 



41 


51 




1 

ACGTTCTTCC 


1 

CTCATTGCCC 


60 


TTCTTTCAAA 


TTAACTTTGC 


120 


GGCCCGAGGT 


CGTTCACCGC 


160 


AAGATTCCTG 


CGGCCAGCGC 


240 


AGCAGGAAGC 


TGCTCAOGGT 


300 


GATGTGGGCT 


CCTAGAAGAG 


360 


CCCAACTGCA 


CACTCATGAC 


420 


CATGAACATA 


ATCTCATTTA 


480 


CTTATTACAA 


A 





TTCCAAATTT 
TTTTAGTAAA 
CTCCAAGTCA 
TCCTTACTCT 
CCGACTACCG 
CCCAAAGCGC 
ATTTTCGCGG 
TTGCAAGCAA 
AGCCTTGGGC 
CGACGCT 



11 

I 

TTTTTTTTGT 
TGAGATTATG 
TGAGTGTGCA 
TCTCGGAGCC 
TGAGCAGCTT 
TGGCCGCAGG 
TGAACGACCT 
AGTTAATTTG 
AATGAGGGAA 




AATCTTTCCC 
CGGGCCAAGT 
AAAGAAAATA 
GAACGTGTCT 



31 
I 

AATTTTAGTA 
TGTTTGGTAA 
AACCGTACAG 
AGATGAGGAA 
TGTCGTCGCC 
CTTAAATCGG 
TTGCTTTTGT 
CATGATACAG 
AGTTATCCAC 



41 
I 

AAAGAAAATT 
ACTGTAACTC 
AAGTCATTTC 
GGCCACCGCT 
TCTGCGGTCG 
GGAAGAAGTT 
TGCTGGTTCC 
CTCTAGGGCG 
AGCCCGGGGA 



51 
I 

CTCACAAAGT 
CACAGGGCAG 
CAGGATGTTA 
GCCGC CAACG 
GGGCACTTTC 
TCTCTAATGC 
CTAAGCTTAA 
AATTCTAACG 
CGCCTGCACA 



Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 
I 

TCTTTCTTCT 
CCTGCCGACC 
ACCGTAGACC 
GACACATGCA 
CCCAACCAAA 
TTCATTTAAA 
CTTTCTCTGA 
TCGCTGTAGC 
TAAAGAAACT 
GATTGAACCA 
TGCTGGTATC 
GACGTGGGGA 
CCGGGTCTCT 
GAGACAGCAT 
ACTG 



11 
) 

TCTGTTGTCT 
CGAAACCATT 
GACCAGTTTT 
GTGTTTAAAA 
AAACTCTAAT 
TCTGTGTCTT 
CATGGGAATC 
GACACAGGAG 
GTGCACTCCA 
GTCCTGCAGC 
GAGCTGGTCT 

CATTTATGAG 



21 

I 

TGTCTCTCCT 
CTTCTCTGAT 
GGGTGTCACA 
CCTGGAACNG 
CTTTTTAGGG 
ATTTATATTA 
TTTTCTTTGA 
CGTTTCATTA 
AATCACTTGA 
GCCTTGGCAG 
CCCATCCTCG 
ATATATCCGG 
GGGACCTAGT 
CCTGCAGCAT 



Seq ID NO: 50 DNA sequence 
Nucleic Acid Accession Jh 1*05187 
Coding sequence: 1991.. 2260 



CTGCAGGGAG 
TCAGAAAGGA 
CAGAAGAAGG 
TGAAGGAAAG 
AGAGTCATAA 
GGAAATGGAT 
ATTTCTAGCT 
CCCCTOCCTT 
ACAACCATCT 
CCAGGGTTAA 
CCCTGCACCT 
CAGCCAGCTA 
GATGAGGATG 
AGCTTCTATT 
TCACACCAAA 
ATTGCAACAA 
ATATGTGTAA 
TATTTTAAGT 
CCTCAGTAGA 
AGTTCATAGC 
TGACAAGATA 
ATTTAAGGCA 
AACATAAAAC 
AGTAATTGGC 
AGGAGACCTC 
AGATGGGAAG 
GAGGCTTAGA 
GAGGAAAGTG 
GAAGCCAGCT 
GAGCCAAGAA 
TTCAAAGGGC 



11 
I 

GCAGGTAGAA 
GGAAAAGGCC 
ATTAGCCCCT 
CAGGTTTTCC 
GTAAATTATT 
GGAAGGTCTT 
TCCACCTTCA 
TCCCACCTAT 
CAATGACAAG 
CTCATGAAAC 
GGGTCTGAGG 
GTGCCAAAAA 
GTAGTGTGAG 
TCCTTGAGGC 
CCCAAGGGAC 
ACTGGCAATT 
GCAGGTTAAT 
TAAATTACAG 
TAGTCATTGA 
AGAACTAGAA 
TTTATAGAAA 
GTATGCTAGG 
CTAGCAGGAA 
ATGACGGAGA 
TAGGGTGTCA 
AAAAGCATTT 
TGAATATAAA 
GTCTGATGCC 
TTAGTAGGGC 
GAGAACTCCA 
CTGAAAATTA 



21 
I 

AAGGCTTTTG 
AGGGCAGATG 
GAAAGTCCCT 
CAGATTAGCA 
CTGAATGTGT 
GGACTCTGAG 
CCAAGGCAGA 
TCATGTGTGC 
GACAGCAGGT 
CCTCCATGAA 
ATGAGGGTGG 
ATATCAGGTG 
TCATGTGTGA 
AGGGCTCATT 
CACACAGCCC 
CTAGTGTACT 
CCAGGGTTTC 
TCTGGATTTG 
ACTGGGAGTC 
CTCAGGCCAG 
TTTTAATTTA 
CACTTTGGAC 
GGTAATACAT 
TGGGCAGAGA 
AGTGATGTGA 
GGAAGGGACT 
GCCATCCTAT 
ATTTTCCAAA 
ATTTTTCCAG 
ATAAAATGGA 
TCCAAGCTTA 



31 
I 

GTGCTCTTCT 
GGCGGGGGGC 
AGCCGGTCGC 
CATGACCATG 
CACCCCCAAA 
AATACAAAGA 
CAGCATCTCC 
TTATGGTAGC 
ACTTGGGAGG 
CGGAGCAAGA 
GTTCCATTGC 
GTGAAGCTCA 
ATTTTTGCCA 
CCACCCTACT 



31 
I 

GGTTTTCAGG 
TCTGGGTGGA 
GAAGTAGGAG 
ACCAGTCAGG 
GTAGTTTAAT 
ACAAGGGGTC 
CAAGGAGGGC 
AAGAGTGCCC 
GGCAAGGCTC 
GCCTGCTGCT 
CAGTGAAAAT 
GTGTTCATCA 
CAGGTGAGGA 
CATCTTATAA 
ATTCTGCTCC 
TTTTCATTAT 
AATGGGAGAT 
AAAGGACCTT 
CTGGAGAAGA 
AGCACTCTCA 
TTAGATGGAT 
AAATCAATGC 
ATATATAAAT 
AGGGCTGTGC 
GCTATGATGG 
GTGTAAGCAC 
AAGTCACAGG 
AGACCTAATA 
AACAGATATA 
GCAGAAGAAA 
TTTCATTTTT 



41 

1 

TCTTTCTTTC 
GGGAGAAGCT 
CGGCTTTTTT 
TTATTACTAT 
ATTTTTTTTT 
TACCCAAACC 
ATTTTTTTTC 
AATATGGAGT 
CAGAGTTTGC 
TTCTGTCACA 
GCTGCCAGGC 
GCTGTGGCAC 
CGAGTGTACA 
GCTGTATCCA 



51 
I 

CCTCGCCGCT 
GACCGGTGAG 
GGGAGAACCC 
GGGCCGCCTC 
TTTTTTTTTT 
CTTTATGCTT 
TGCTGCTTCA 
GCTGTATTCC 
AGTGAGCCGA 
GTTCCTGAAG 
AGGGTGCTGG 
ACCTTGGATG 
CCAAACAAAG 
GTTTCCATTG 



41 
I 

TGGGGGGCAG 
GTGAAGGGAA 
AAGGGTAAAG 
GGGAGGAAGG 
GGAATTGGGA 
TATAATCAGT 
CCACCTCAGC 
TGTCCCACAG 
AACAGGACTC 
CACCCCTCCC 
TAGGCCAGTG 
AATAAGCCGA 
ATGAAAACAG 
AAGCCAGCTG 
GTATACCAGG 
TAGAAATTAG 
AGAGAATAGT 
AGAGATGGTT 
TTGTTCAAAT 
GTAACACTGC 
CTCTACTGAG 
CCTAACGTAC 
AAATGAAATG 
ACTTTTGGGA 
AGGGGTATTT 
AGACCAGAAG 
CTTTCTACAT 
TGCGGACCTC 
AGGTGCCTTG 
TTGCCTTTTA 
AAATGTAATG 



51 
I 

TCTAGCCTGA 
AAAGTGATCC 
GTGTGGTTGG 
TGAGAGTGGG 
AAAAGATGGG 
CCATTTCATT 
TCCTCTGCTC 
AACACGGGGA 
AGATGTCCCC 
TCAAGGCAAG 
ACATCATTTT 
GCCAACCGGT 
AGTGCCCGAG 
GCCATTGCCT 
TAAGTCTCTG 
CTAAAGGCAA 
GGAATATCTT 
AGGGCTCCCA 
GCCCATGGGA 
AATTTCCCCC 
CATTTATTCC 
TTACTTAACA 
CAAAGTAGAT 
GACTTGCTCA 
GGACAAGCAG 
CAAAACCATA 
GGTACTAGGA 
ATGTCCCTCA 
GGTAGGAAGG 
GCTCCTCCTC 
GGGGAGCTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
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GGGAGATGAA 
TTGTATCCAT 
CATTTGAAGC 
GCAGCAGCAG 
CAAGGAGCCC 
CCAGCCCAA6 
CACTCCAGCA 
TTGAGGAGCT 
GCCTATTGAC 
CTAAAAAGAT 
GTCTCACTGA 
AGGTCAAGTG 



AGGCTTTCTC 
CTTTCTTTAA 
ATGAATTCTC 
GTGAAACAAC 
TGCCAACCCA 
ATTCCAGAGC 
CCAGCCCAGC 
GGCCACTGGA 
CCTGCAGTTA 
GTCCCTTACC 
CTGAGCTAGT 
ACCATCCCTA 



TTCTAAAGGG 
TTGAATCACT 
AGCAGCAGAA 
CTTGCCAGCC 
AGGTGCCTGA 
CCTGCCAGCC 
AGAAGACCAA 
TACTGAACAC 
GCATGCTGTC 
CTCATTCTGG 
CTTCTTGTTG 
G 



TCCTGAAATA 
GTGTCAGCTT 
GCAGCCTTGC 
TCCACCCCAG 
GCCCTGCCAC 
CAAGGTGCCT 
GCAGAAGTAA 
CCTACTCCAT 
ACCCTGAATC 
AGGCTCCTGA 
CTOGGGTGCA 



AAATCTGTTT 
TCTGTCTCTA 
ACCCCACCCC 
GAACCATGCA 
CCCAAAGTGC 
GAGCCCTGCC 
TGTGGTCCAC 
TCTGCTTATG 
ATAATCGCTC 
GCCTCTGCGT 
TTTGAGGATG 



GGCATTGAAT 
GAAAAAAACA 
CTCAGCCTCA 
TCCCCAAAAC 
CTGAGCCCTG 
CTTCAACGGT 
AGCCATGCCC 
AATCCCATTT 
CTTTGCACCT 
AAGGCTGAAC 
GATTTGGGGA 



Seg ID NO: 51 Protein sequence: 
Protein Accession #:AAC26838 

1 11 21 



31 



41 



51 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 

i 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 

I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 

I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

COGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



Seq ID NO: 53 Protein sequence! 
Protein Accession #: NP_002629.l 

11 



51 



GGCACGAGCC 
GAGACAACCA 
ATCAATCAAT 
CCCTTCAGGG 
TTGCTGTTAT 
ATTTGGGAAT 
CATTGCAGCT 
CCTTCCTTTT 
CGGACTGGTT 
GGAAGTCATA 
GCAGCTTGGT 
AGTGTCATTT 
TAATGAAGAA 
GGAGAGCTGG 
CTGCATGAGT 
TGAAGATGCT 
CTCTGTTTCT 
CCAATATACC 
TAATTCTTGT 
AATAAACTTT 



11 

I 

ACGATTCAGT 
CACTATGAGA 
GTGTAAACCT 
TCAGAACCTT 
CACATGCAAG 
CCAGAATCCA 
AAAAGAGCAG 
CTACCGTGCC 
CATTGCCTCC 
CAACACTGCC 
CTTTGTCTTA 
TCACGCTGGT 
GAAGCAATTA 
GTGGTATAAG 
GACTTTAAGA 
TCAGAGCTCA 
GTTTTGCTTT 
TCATTGTGTG 
GTTAAGTTAA 
GTGTATTTAT 



21 
I 

CCCCTGGACT 
GGCACTCCAG 
ATTACTGGGA 
GTGGCAGTTC 
TATCCAGAGG 
GAAATGTGTT 
AAGATCATGG 
AAGACTGGTA 
TCCAAGAGAG 
TTTGAATTAA 
AAGTTTCTGG 
GCTGAGACAG 
CTTCATAGCA 
GCTGTCCTCT 
CTCAAAGACC 
TGCGCGTTAC 
ATTCCCTCTT 
TAATAGAACC 
ATCATTTTTG 
ATAATAAAAA 



31 
I 

GTAGATAAAG 
GAGACGCTGA 
CTATTAATGA 
CACGAAGTGA 
CTCTTGAGCA 
TGTATTGTGA 
ATCTGTATGG 
GGACCTCCAC 
ACCAGCCCAT 
ATATAAATGA 
TTCCCAATGT 
GGGCAAGGCT 
ACTGAAGAAC 
CAAGCTGGTG 
AAACACTGAG 
CCACGATGGC 
GGGATGATAT 
TTCTTAGCAT 
TCCTAATTGT 
AAAAAAAAAA 



41 

1 

ACCCTTTCTT 
TGGTGGAGGA 
TTTGAATCAG 
CAGTGTGACC 
AGGCAGAGGG 
GAAGGTTGGA 
CCAACCCGAG 
CCTTGAGTCT 
CATTCTGACT 
CTGAACTCAG 
GTTTTOGTCT 
GCTGTTATCA 
AGGATGTGGC 
CTGTGTAGGC 
CTTTCTTCTA 
ATGACTAGCA 
CATCCAGTCT 
TAAGACCTTG 
AATGTGTAAT 
AAA 



Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP_062564 



51 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession #: NM_002638. 

Coding sequence: 120-473 



51 
I 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



1 11 21 31 41 

I I I I 1 I 

MRASSPLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 54 DNA sequence 
Nucleic Acid Accession 8: NM_019618 
Coding sequence: 75-584 



51 
I 

GCCAGGTGCT 
AGGGCCGTCT 
CAAGTGTGGA 
CCAGTCACTG 
GATCCCATTT 
GAACAGCCCA 
CCCGTGAAAC 
GTGGCCTTCC 
TCAGAACTTG 
CCTAGAGGTG 
ACATTTTCTT 
TCTCATTTTA 
CTCAGAAGCA 
CACAAGGCAT 
GGGGTGGGTA 
CAGAGCTGAT 
TTATATGTTG 
TAAACAAAAA 
CTTAAAGTTA 



1 11 21 31 41 

I I I .1 1 I 

MRGTPGDADG GGRAVYQSMC KPITGTINDL NQQVWTLQGQ NLVAVPRSDS VTPVTVAVIT 
CKYPEALEQG RGDPIYLGIQ NPEMCLYCEK VGEQPTLQUC BQXIMDLYGQ PEPVKPFhFY 
RAKTGRTSTL ESVAFPDWFI ASSKRDQPII LTSELGKSYN TAFELNIND 

Seq ID NO: 56 DNA sequence 
Nucleic Acid Accession 8: NM_O03125 
Coding sequence: 65-334 



60 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 



60 
120 



208 



WO 02/086443 



1 11 21 31 . 41 SI 

I I I ! I I 

AGCAGTTCTA AGGGACCATA CAGAGTATTC CTCTCTTCAC ACCAGGACCA GCCACTGTTG 60 

CAGCATGAGT TCCCAGCAGC AGAAGCAGCC CTGCATCCCA CCCCCTCAGC TTCAGCAGCA 120 

GCAGGTGAAA CAGCCTTGCC AGCCTCCACC TCAGGAACCA TGCATCCCCA AAACCAAGGA 180 
GCCCTGCCAC CCCAAGGTGC CTGAGCCCTG CCACCCCAAA GTGCCTGAGC CCTGCCAGCC * 240 

CAAGCTTCCA GAGCCATGCC ACCCCAAGGT GCCTGAGCCC TGCCCTTCAA TAGTCACTCC 300 

AGCACCAGCC CAGCAGAAGA CCAAGCAGAA GTAATGTGGT CCACAGCCAT GCCCTTGAGG 360 

AGCOGGCCAC CAGATGCTGA ATCCCCTATC CCATTCTGTG TATGAGTCCC ATTTGCCTTG 420 

CAATTAGCAT TCTGTCTCCC CCAAAAAAGA ATGTGCTATG AAGCTTTCTT TCCTACACAC 480 

TCTGAGTCTC TGAATGAAGC TGAAGGTCTT AGTACCAGAG CTAGTTTTCA GCTGCTCAGA 540 

ATTCATCTGA AGAGAGACTT AAGATGAAAG CAAATGATTC AGCTCCCTTA TACCCCCATT 600 
AAATTCACTT TCAATTCCA 



Seq ID NO i 57 Protein sequences 
Protein Accession ft: NPJJ03116 

1 11 21 31 41 51 

1 | I I I I 

MSSQQQKQPC IPPPQLQQQQ VKQPCQPPPQ EPCIPKTKEP CHPKVPEPCH PKVPBPCQPK 60 
LPEPCHPKVP EPCPSIVTPA PAQQKTKQK 

Seq ID NOi 58 DNA sequence 

Nucleic Acid Accession #s NM_001793.2 

Coding sequence: 71-2560 

1 11 21 31 41 51 

I I I I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCAOGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGAGG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGG GTTTGG A 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT GCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC OGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAAOGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: 59 Protein sequence: 



209 



WO 02/086443 

Protein Accession fl: 



PCTYUS02/12476 



NP 001784.2 



11 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MGLPRGPLAS 
QEPALPSTDN 
KGPFPQRLNQ 
YELFGHAVSB 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPA6WLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALLPLL 
GLEARPEWL 
DYEGSGSDAA 



LLLLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWBILD 
GDDGDHPTIT 
VEDVKEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMP 
ANDNAPMFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVLNITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DI/SPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 



NLKAANTDPT 
DMYGGGEDD 



51 
I 

LGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
THEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



Seq ID NO i 60 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 162-428 



CATACGGACC 
AGCACCTOGG 
TCTCCCAGAG 
AGCGAAAGAA 
GTTTACTGTT 
GTAGAGTCAT 
GAGGTTAGAA 
AGATCATAAA 



11 

I 

GGCGGCGGAT 
GGATTGTTTT 
AAGCTGAGGC 
GAAGCAGATA 
GCCTCAACTT 
TGTTCATCGA 
TAACAAGGAG 
GTCAAAGAAC 
GACATTTTTT 



21 
I 

TCGAACGTTC 
CGCTGGCCCA 
AGCTGGTACT 
AAGCGGAAGG 
CGTCTGGAGA 
TTAGCAGAAG 
CATGTACTGG 
ATATTCTTGA 
ACACATCAGT 



31 
I 

GGACTGAGGT 
GTGTCCCCGG 
TGACAGAGAG 
CTCCCCGTGG 
AAAGTGGTGA 
AGTCCAGGAC 
CCGCAGCAAA 
AAGTTATGAT 
TAATATGGGA 



41 

! 

TTTTCTGCCT 
AGCTTGTGTG 
GATGGCGCTG 
CTTTCTAAAG 
CTTATTGGTC 
AAACGCTTGT 
GGTAATTCTA 
GCATTCTTTT 
TTATTAAATA 



51 
I 

GAAGAAGCGT 
CGATACAGAG 
TCGACCATAG 
CGAGTCTTCA 
CATCTGAACT 
GOGAGTAAAT 
AAGAAGAGCA 
GGGTGGTAAC 
TTGG 



Seq ID NO: 61 Protein sequence: 
Protein Accession #: Eos sequence 



11 



21 



31 



41 51 
I I 

MALSTIVSQR KQIKRKAPRG FLKRVFKRKK PQLRLEKSGD LLVHLNCLLF VHRLAEESRT 
NACASKCRVI NKEHVLAAAK VILKKSRG 

Seq ID NO: 62 DNA sequence 

Nucleic Acid Accession #: NMJ300094.2 

Coding sequence: 99-8933 



1- 
1 

GGGCTGGAGG 
GAGGCGGGGG 
CCGCGCTCTG 
GAGTGACCTG 
CCATTGGCCG 
TCTCTGGAGC 
CACGGACAGA 
GTGAGCTTAG 
ACCATGTCTT 
CAGACGGGAA 
TCAAGCTATT 
CACAGCCAAC 
TGCCCCTCGT 
CGGATGACTC 
TGAGAGTACA 
CTCTGACGGG 
GTGAGACCAG 
TTGCCCTCTA 
TAGAAGGGCC 
GGAGTGTGCC 
CACAGCAGCA 
GCACGGACTA 
CCCTGATGGC 
CCACATCCAT 
GGCGGCGTGA 
GCTACCAGTT 
TGGAGGGCCA 
TGAGCCCTGT 
GGAGCCCAGT 
AGCGGACCCT 
GGCTTAGCTA 
TCCTCACTGT 
TGTCAGATGC 
GGATTAGCTG 
CTGCCACAGA 
TGCGAGGCAG 
CAGTGAGGAC 
GGGTTCCTGG 



11 

I 

GGCGCTGGGC 
TCCTAGCTGA 
CGCCGGGATC 
CACGCGCCTT 
CAGCAATTTC 
AGCCAGTGCA 
GTTCGGCCTG 
CTACAAGGGG 
CCTGCCCCAG 
GTCCCAGGAC 
TGCTGTGGGG 
CTCCGACTTC 
TTCCCGGAGA 
GACCTCTGCT 
GTGGACAGCG 
GCTGGGACAG 
TGTGCGGCTG 
CGCCAACAGC 
GGAACTGACC 
AGGTGCCACT 
GGAGCTGGGC 
TGAGGTGACC 
TCGCACTGAC 
CCTCCTTTCC 
GACTGGCTTG 
GGATGGGCTG 
CGAGGTGGCC 
AACAGACCTG 
CCCTGGTGCC 
GGTGCTTCCT 
CACTGTGCGG 
CCGCCGGGAG 
AACGCGAGTG 
GAGCACAGGC 
CATCACAGGG 
AGAGGAGGGC 
GGTCCATGTG 
CGCCACAGGA 



21 
I 

TCGGACCTGC 
CGGCTTTTAC 
CTGGCAGAGG 
TACGCCGCTG 
CGCGAGGTCC 
CAGGGTGTGC 
GATGCACTTG 
GGCAACACTC 
CTGGCCCGAC 
CTGGTGGACA 
ATCAAGAATG 
TTCTTCTTCG 
GTGTGCACGA 
CCACGAGACC 
GCCAGTGGCC 
CCACTGCCGA 
CGGGGTCTCC 
ATCGGGGAGG 
ATCCAGAATA 
GGCTACCGTG 
CCTGGGCAGG 
GTGAGCACCC 
GCTTCTGTTG 
TGGAACTTGG 
GAGCCACCGC 
CAGCCGGGCA 
ACCCCTGCAA 
CAAGCCACCG 
ACCCAGTACC 
GGGAGTCAGA 
GTGTCTGCTC 
CCGGAAACTC 
AGGGTGGCCT 
AGTGGTCCGG 
CTGCAGCCTG 
CCTGCTGCAG 
ACTCAGGCCA 
TACAGGGTTT 



60 
120 
160 
240 
300 
360 
420 
'480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 




210 



WO 02/086443 

CCCAGTTGGT TTCTGGGGAG GCCAOGGTGG CTGAGCTGGA TGGACTGGAG CCAGATACTG 2340 

AGTATACGGT GCATGTGAGG GCCCATGTGG CTGGCGTGGA TGGGCCCCCT GCCTCTGTGG 2400 

TTGTGAGGAC TGCCCCTGAG CCTGTGGGTC GTGTGTCGAG GCTGCAGATC CTCAATGCTT 2460 

CCAGCGACGT TCTACGGATC ACCTGGGTAG GGGTCACTGG AGCCACAGCT TACAGACTGG 2S20 

CCTGGGGCCG GAGTGAAGGC GGCCCCATGA GGCACCAGAT ACTCCCAGGA AACACAGACT 2580 

CTGCAGAGAT CCGGGGTCTC GAAGGTGGAG TCAGCTACTC AGTGCGAGTG ACTGCACTTG 2640 

TCGGGGACCG CGAGGGCACA CCTGTCTCCA TTGTTGTCAC TACGCCGCCT GAGGCTCCGC 2700 

CAGCCCTGGG GACGCTTCAC GTGGTGCAGC GCGGGGAGCA CTOGCTGAGG CTGCGCTGGG 2760 

AGCCGGTGCC CAGAGCGCAG GGCTTCCTTC TGCACTGGCA ACCTGAGGGT GGCCAGGAAC 2820 

AGTCCCGGGT CCTGGGGCCC GAGCTCAGCA GCTATCACCT GGACGGGCTG GAGCCAGCGA 2880 

CACAGTACCG CGTGAGGCTG AGTGTCCTAG GGCCGGCTGG AGAAGGGCCC TCTGCAGAGG 2940 

TGACTGCQOG CACTGAGTCA CCTCGTGTTC CAAGCATTGA ACTACGTGTG GTGGACACCT 3000 

CGATCGACTC GGTGACTTTG GCCTGGACTC CAGTGTCCAG GGCATCCAGC TACATCCTAT 3060 

CCTGGCGGCC ACTCAGAGGC CCTGGCCAGG AAGTGCCTGG GTCCCCGCAG ACACTTCCAG 3120 

GGATCTCAAG CTCCCAGCGG GTGACAGGGC TAGAGCCTGG CGTCTCTTAC ATCTTCTCCC 3180 

TGACGCCTGT CCTGGATGGT GTGCGGGGTC CTGAGGCATC TGTCACACAG ACGCCAGTGT 3240 

GCCCCCGTGG CCTGGCGGAT GTGGTGTTCC TACCACATGC CACTCAAGAC AATGCTCACC 3300 

GTGCGGAGGC TACGAGGAGG GTCCTGGAGC GTCT GG TGTT GGCACTTGGG CCTCTTGGGC 3360 

CACAGGCAGT TCAGGTTGGC CTGCTGTCTT ACAGTCATCG GCCCTCCCCA CTGTrCCCAC 3420 

TGAATGGCTC CCATGACCTT GGCATTATCT TGCAAAGGAT CCGTGACATG CCCTACATGG 3480 

ACCCAAGTGG GAACAACCTG GGCACAGCCG TGGTCACAGC TCACAGATAC ATGTTGGCAC 3540 

CAGATGCTCC TGGGCGCCGC CAGCACGTAC CAGGGGTGAT GGTTCTGCTA GTGGATGAAC 3600 

CCTTGAGAGG TGACATATTC AGCCCCATCC GTGAGGCOCA GGCTTCTGGG CTTAATGTGG 3660 

TGATGTTGGG AATGGCTGGA GCGGACCCAG AGCAGCTGOG TCGCTTGGCG CCGGGTATGG 3720 

ACTCTGTCCA GACCTTCTTC GCCGTGGATG ATGGGCCAAG CCTGGACCAG GCAGTCAGTG 3780 

GTCTQGCCAC AGCCCTGTGT CAGGCATCCT TCACTACTCA GCCCCGGCCA QhGCCCTGCC 3840 

CAGTGTATTG TCCAAAGGGC CAGAAGGGGG AACCTGGAGA GATGGGCCTG AGAGGACAAG 3900 

TTGGGCCTCC TGGCGACCCT GGCCTCCCGG GCAGGACCGG TGCTCCCGGC CCCCAGGGGC 3960 

CCCCTGGAAG TGCCACTGCC AAGGGCGAGA GGGGCTTCCC TGGAGCAGAT GGGCGTCCAG 4020 

GCAGCCCTGG CCGOGCCGGG AATCCTGGGA CCCCTGGAGC CCCTGGCCTA AAGGGCTCTC 4080 

CAGGGTTGCC TGGCCCTCGT GGGGACCCGG GAGAGCGAGG ACCTCGAGGC CCAAAGGGGG 4140 

AGCCGGGGGC TCCGGGACAA GTCATCGGAG GTGAAGGACC TGGGCTTCCT GGGCGGAAAG 4200 

GGGACCCTGG ACCATCGGGC CCCCCTGGAC CTOGTGGACC ACTGGGGGAC CCAGGACCCC 4260 

GTGGCCCCCC AGGGCTTCCT GGAACAGCCA TGAAGGGTGA CAAAGGCGAT CGTGGGGAGC 4320 

GGGGTCCCCC TGGACCAGGT GAAGGTGGCA TTGCTCCTGG GGAGCCTGGG CTGCCGGGTC 4380 

T7CCCGGAAG CCCTGGACCC CAAGGCCCGG TTGGCCCCCC TGGAAAGAAA GGAGAAAAAG 4440 

GTGACTCTGA GGATGGAGCT CCAGGCCTCC CAGGACAACC TGGGTCTCCG GGTGAGCAGG 4500 

GCCCACGGGG ACCTCCTGGA GCTATTGGCC CCAAAGGTGA CCGGGGCTTT CCAGGGCCCC 4560 

TGGGTGAGGC TGGAGAGAAG GGCGAACGTG GACCCCCAGG CCCAGCGGGA TCCCGGGGGC 4620 

TGCCAGGGGT TGCTGGACGT CCTGGAGCCA AGGGTCCTGA AGGGCCACCA GGACCCACTG 4680 

GCCGCCAAGG AGAGAAGGGG GAGCCTGGTC GCCCTGGGGA CCCTGCAGTG GTGGGACCTG 4740 

CTGTTGCTGG ACCCAAAGGA GAAAAGGGAG ATGTGGGGCC CGCTGGGCCC AGAGGAGCTA 4800 

CCGGAGTCCA AGGGGAACGG GGCCCACCCG GCTTGGTTCT TCCTGGAGAC CCTGGCCCCA 4860 

AGGGAGACCC TGGAGACCGG GGTCCCATTG GCCTTACTGG CAGAGCAGGA CCCCCAGGTG 4920 

ACTCAGGGCC TCCTGGAGAG AAGGGAGACC CTGGGOGGCC TGGCCCCCCA GGACCTGTTG 4980 

GCCCCCGAGG AOGAGATGGT GAAGTTGGAG AGAAAGGTGA CGAGGGTCCT CCGGGTGACC 5040 

CGGGTTTGCC TGGAAAAGCA GGCGAGCGTG GCCTTCGGGG GGCACCTGGA GTTCGGGGGC 5100 

CTGTGGGTGA AAAGGGAGAC CAGGGAGATC CTGGAGAGGA TGGACGAAAT GGCAGCCCTG 5160 

GATCATCTGG ACCCAAGGGT GACCGTGGGG AGCCGGGTCC CCCAGGACCC CCGGGACGGC 5220 

. TGGTAGACAC AGGACCTGGA GCCAGAGAGA AGGGAGAGCC TGGGGACCGC GGACAAGAGG 5280 

GTCCTOGAGG GCCCAAGGGT GATCCTGGCC TCCCTGGAGC CCCTGGGGAA AGGGGCATTG 5340 

AAGGGTTTOG GGGACCCCCA GGCCCACAGG GGGACCCAGG TGTCCGAGGC CCAGCAGGAG 5400 

. AAAAGGGTGA COGGGGTCCC CCTGGGCTGG ATGGCCGGAG CGGACTGGAT GGGAAACCAG 5460 

GAGCCGCTGG GCCCTCTGGG CCGAATGGTG CTGCAGGCAA AGCTGGGGAC CCAGGGAGAG 5520 

ACGGGCTTCC AGGCCTCCGT GGAGAACAAG GCCTCCCTGG CCCCTCTGGT CCCCCTGGAT 5580 

TACCGGGAAA GCCAGGCGAG GATGGGAAAC CTGGCCTGAA TGGAAAAAAC GGAGAACCTG 5640 

GGGACCCTGG AGAAGACGGG AGGAAGGGAG AGAAAGGAGA TTCAGGCGCC TCTGGGAGAG 5700 

AAGGTCGTGA TGGCCCCAAG GGTGAGCGTG GAGCTCCTGG TATCCTTGGA CCCCAGGGGC 5760 

CTCCAGGCCT CCCAGGGCCA GTGGGCCCTC CTGGCCAGGG TTTTCCTGGT GTCCCAGGAG 5820 

GCACGGGCCC CAAGGGTGAC CGTGGGGAGA CTGGATCCAA AGGGGAGCAG GGCCTCCCTG 5880 

GAGAGCGTGG CCTGCGAGGA GAGCCTGGAA GTGTGCCGAA TGTGGATCGG TTGCTGGAAA 5940 

CTGCTGGCAT CAAGGCATCT GCCCTGCGGG AGATCGTGGA GACCTGGGAT GAGAGCTCTG 6000 

GTAGCTTCCT GOTGTGCCC GAACGGCGTC GAGGCCCCAA GGGGGACTCA GGCGAACAGG 6060 

GCCCCCCAGG CAAGGAGGGC CCCATCGGCT TTCCTGGAGA ACGCGGGCTG AAGGGCGACC 6120 

GTGGAGACCC TGGCCCTCAG GGGCCACCTG GTCTGGCCCT TGGGGAGAGG GGCCCCCCCG 6180 

GGCCTTCCGG CCTTGCCGGG GAGCCTGGAA AGCCTGGTAT TCCCGGGCTC CCAGGCAGGG 6240 

CTGGGGGTGT GGGAGAGGCA GGAAGGCCAG GAGAGAGGGG AGAACGGGGA GAGAAAGGAG 6300 

AACGTGGAGA ACAGGGCAGA GATGGCCCTC CTGGACTCCC TGGAACCCCT GGGCCCCCCG 6360 

GACCCCCTGG CCCCAAGGTG TCTGTGGATG AGCCAGGTCC TGGACTCTCT GGAGAACAGG 6420 

GACCCCCTGG ACTCAAGGGT GCTAAGGGGG AGCCGGGCAG CAATGGTGAC CAAGGTCCCA 6480 

AAGGAGACAG GGGTGTGCCA GGCATCAAAG GAGACCGGGG AGAGCCTGGA CCGAGGGGTC 6540 

AGGACGGCAA CCCGGGTCTA CCAGGAGAGC GTGGTATGGC TGGGCCTGAA GGGAAGCCGG 6600 

GTCTGCAGGG TCCAAGAGGC CCCCCTGGCC CAGTGGGTGG TCATGGAGAC CCTGGACCAC 6660 

CTGGTGCCCC GGGTCTTGCT GGCCCTGCAG GACCCCAAGG ACCTTCTGGC CTGAAGGGGG 6720 

AGCCTGGAGA GACAGGACCT CCAGGACGGG GCCTGACTGG ACCTACTGGA GCTGTGGGAC 6780 

TTCCTGGACC CCCCGGCCCT TCAGGCCTTG TGGGTCCACA GGGGTCTCCA GGTTTGCCTG 6840 

GACAAGTGGG GGAGACAGGG AAGCCGGGAG CCCCAGGTCG AGATGGTGCC AGTGGAAAAG 6900 

ATGGAGACAG AGGGAGCCCT GGTGTGCCAG GGTCACCAGG TCTGCCTGGC CCTGTCGGAC 6960 

CTAAAGGAGA ACCTGGCCCC ACGGGGGCCC CTGGACAGGC TGTGGTCGGG CTCCCTGGAG 7020 

CAAAGGGAGA GAAGGGAGCC CCTGGAGGCC TTGCTGGAGA CCTGGTGGGT GAGCCGGGAG 7080 

CCAAAGGTGA CCGAGGACTG CCAGGGCCGC GAGGCGAGAA GGGTGAAGCT GGCCGTGCAG 7140 

GGGAGCCCGG AGACCCTGGG GAAGATGGTC AGAAAGGGGC TCCAGGACCC AAAGGTTTCA 7200 

AGGGTGACCC AGGAGTCGGG GTCCCGGGCT CCCCTGGGCC TCCTGGCCCT. CCAGGTGTGA 7260 

AGGGAGATCT GGGCCTCCCT GGCCTGCCCG GTGCTCCTGG TGTTGTTGGG TTCCCGGGTC 7320 

AGACAGGCCC TCGAGGAGAG ATGGGTCAGC CAGGCCCTAG TGGAGAGCGG GGTCTGGCAG 7380 

GCCCCCCAGG GAGAGAAGGA ATCCCAGGAC CCCTGGGGCC ACCTGGACCA CCGGGGTCAG 7440 

TGGGACCACC TGGGGCCTCT GGACTCAAAG GAGACAAGGG AGACCCTGGA GTAGGGCTGC 7500 
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CTGGGCCCCG AGGCGAGCGT GGGGAGCCAG GCATCCGGGG TGAAGATGGC 
AGGAGGGACC CCGAGGACTC ACGGGGCCCC CTGGCAGCAG GGGAGAGCGT 
6TGATGTT6G GAGTGCAGGA CTAAAGGOTG ACAAGGGAGA CTCAGCTGTG 
CTCCAGGCCC ACGGGGTGCC AAGGGGGACA TGGGTGAAOG AGGGCCTCGG 
GTGACAAAGG ACCTCGGGGA GACAATGGGG ACCCTGGTGA CAAGGGCAGC 
CTGGTGACAA GGGCTCAGCC GGGTTGCCAG GACTGCGTGG ACTCCTGGGA 
AACCTGGTGC AGCAGGGATC CCTGGTGACC CGGGATCCCC AGGAAAGGAT 
GTATCCGAGG AGAAAAAGGA GATGTTGGCT TCATGGGTCC CCGGGGCCTC 
GGGGAGTGAA GGGAGCCTGT GGCCTTGATG GAGAGAAGGG AGACAAGGGA 
CCCCAGGCCG CCCCGGGCTG GCAGGACACA AAGGAGAGAT GGGGGAGCCT 
GCCAGTOGGG GGCCCCTGGC AAGGAGGGCC TGATCGGTCC CAAGGGTGAC 
ACGGGCAGCC AGGCCCCAAG GGTGACCAGG GCGAGAAAGG GGAGOGGGGA 
TTCGGGGCTT CCCAGGCCCC AGTGGAAATG ATGGCTCTGC TGGTCCCCCA 
GCAGTGTTGG TCCCAGAGGC CCCGAAGGAC TTCAGGGCCA GAAGGGTGAG 
CCGGAGAGAG AGTGGTGGGG GCTCCTGGGG TCCCTGGAGC TCCTGGCGAG 
AGGGGCGGCC AGGGCCTGCC GGTCCTGGAG GCGAGAAGGG AGAAGCTGCA 
ATGACATCCG GGGCTTTGTG CGCCAAGAGA TGAGTCAGCA CTGTGCCTGC 
TCATCGCATC TGGATCACGA CCCCTCCCTA GTTATGCTGC AGACACTGCC 
TCCATGCTGT GCCTGTGCTC CGCGTCTCTC ATGCAGAGGA GGAAGAGCGG 
AGGATGATGA GTACTCTGAA TACTCCGAGT ATTCTGTGGA GGAGTACCAG 
CTCCTTGGGA TAGTGATGAC CCCTGTTCCC TGCCACTGGA TGAGGGCTCC 
ACACCCTGCG CTGGTACCAT CGGGCTGTGA CAGGCAGCAC AGAGGCCTGT 
TCTATGGTGG CTGTGGAGGG AATGCCAACC GTTTTGGGAC CCGTGAGGCC 
GCTGCCCACC CCGGGTGGTC CAGAGCCAGG GGACAGGTAC TGCCCAGGAC 
ATAATGAGCT GAGATTCAGC ATCCCCTGGA GGAGTCGGGG TCTCAGCAGA 
CCCTCCCCTT GGTGCTAGAG GCTTGTGTGC ACGTGAGCGT GCGAGTGCAC 
TCAGTGACTT GGTCGCGTGG GTCTAGCCTT CCCCCCTGTG GACAAACCCC 
CCTGCCACCC TGGCAGATGA CTCACTGTGG GGGGGTGGCT GTGGGCAGTG 
ACTGGCGTCT GACCCGCCCC TTGACCCAAG CCTGTGATGA CATGGTGCTG 
GCATTAAAGC TGCTGTTTTA AAAGGCAAAA AA 

Seq ID NO: 63 Protein sequence: 
Protein Accession #: np 000085.1 



PCT/US02/12476 



MTLRLLVAAL 
LEGLVLPPSG 
AAILHVADHV 
EELKRVASQP 
SEPSSQSLRV 
TEYQVTVIAL 
RVLSGGPTQQ 
LRPVILGPTS 
RLTLYTLLEG 
VRSTQGVERT 
VPGLRVWSD 
YQVAVSVLRG 
SAHGPEKSQL 
RLQILNASSD 
SVRVTALVGD 
QPEGGQEQSR 
ELRWDTSID 
GVSYIFSLTP 
LALGPLGPOA 
AHRYMLAPDA 
RRLAPGMDSV 
EMGLRGQVGP 
APGLKGSPGL 
PLGDPGPRGP 
PGKKGEKGDS 
GPAGSRGLPG 
PAGPRGATGV 
PGPPGPVGPR 
DGRNGSPGSS 
APGERGIEGF 
KAGDPGRDGL 



11 
I 

CAGILAEAPR 
AASAQGVRFA 
PLPQIARPGV 
TSDFPPPVND 
QWTAASGPVT 
YANSIGEAVS 
QELGPGQGSV 
ILLSWNLVPE 
HEVATPATW 
LVLPGSQTAF 
ATRVRVAWGP 
REEGPAAVIV 
VSGEATVAEL 
VLRITWVGVT 
REGTPVSIW 
VLGPELSSYH 
SVTLAWTPVS 
VLDGVRGPEA 
VQVGLLSYSH 
PGRRQHVPGV 
QTFFAVDDGP 
PGDPGLPGRT 



PGLPGTAMKG 
EDGAPGLPGQ 
VAGRPGAKGP 
QGERGPPGLV 



KGEQGLPGER 
KGBSGEQGPP 
IPGLPGRAGG 
PGLSGEQGPP 
AGPEGKPGLQ 
GPTGAVGLPG 
GLPGPVGPKG 
KGEAGRAGEP 
GWGFPGQTG 
GDPGVGI/PGP 
DSAVILGPPG 
GLIiGPQGQPG 
GDKGEAGPPG 
GERGTPGIGG 
APGERGEQGR 
ADTAGSQLHA 
DEGSCTAYTIi 
TAQD 



GPKGDRGEPG 
RGPPGPQGDP 
PGLRGEQGLP 
DGPKGERGAP 
GLRGEPGSVP 
GKEGPIGPPG 
VGEAGRPGER 
GLKGAKGEPG 
GPRGPPGPVG 
PPGPSGLVGP 
EPGPTGAPGQ 
GDPGEDGQKG 
PRGENGQPGP 



21 
I 

VRAQHRERVT 
TVQYSDDPRT 
PKVCILITDG 
FSILRTLLPL 
GYKVQYTPLT 
GTARTTALEG 
LLRDLEPGTD' 
ARGYRLEWRR 
PTGPELPV9P 
DLDDVQAGLS 
VPGASGFRIS 
ARTDPLGPVR 
DGLEPDTEYT 
GATAYRLAWG 
TTPPEAPPAL 
LDGLEPATQY 
RASSYILSWR 
SVTQTPVCPR 
RPSPLFPLNG 
MVLLVDEPLR 
SLDQAVSGLA 
GAPGPQGPPG 
GPRGPKGEPG 
DKGDRGERGP 
PGSPGEQGPR 
EGPPGPTGRQ 
LPGDPGPKGD 
DEGPPGDPGL 
PPGPPGRLVD 
GVRGPAGEKG 
GPSGPPGliPG 
GILGPQGPPG 
NVDRIiLETAG 



31 
I 

CTRLYAADIV 
EFGLDALGSG 
KSQDLVDTAA 
VSRRVCTTAG 
GLGQPLPSER 
PELTIQNTTA 
YEVTVSTLFG 
ETGLEPPQKV 
VTDI#QATELP 
YTVRVSARVG 
WSTGSGPESS 
TVHVTQASSS 
VHVRAHVAGV 
RSEGGPMRHQ 
GTLHWQRGE 
RVRLSVLGPA 
PLRGPGQEVP 
GLADWPLPH 
SHDLGIILQR 
GDIFSPIREA 
TALCQASPTT 
SATAKGERGF 
APGQVIGGEG 
PGPGEGGIAP 
GPPGAIGPKG 



PGDRGPIGLT 



GHGDPGPPGA 
QGSPGLPGQV 
AWGLPGAKG 
APGPKGFKGD 



PRGARGDMGE 
AAGIPGDPGS 
RPGLAGHKGE 
PPGPSGNDGS 



GEDGRPGOEG 
RGPRGLDGDK 
PGKDGVPGIR 
MGEPGVPGQS 
AGPPGPPGSV 
GEAALTEDDI 



TGPGAREKGB 
DRGPPGIiDGR 
KPGEDGKPGL 
LPGPVGPPGQ 
IKASALREIV 
PGPQGPPGLA 
EQGRDGPPGL 
RGVPGIKGDR 
PGLAGPAGPQ 
GETGKPGAPG 
EKGAPGGLAG 
PGVGVPGSPG 
GREGIPGPLG 
PRGLTGPPGS 



41 
! 

FLLDGSSSIG 
GDVIRAIREL 
QRLKGQGVKL 
GVPVTRPPDD 
QEVKVPAGET 
HSLLVAWRSV 
RSVGPATSLM 
VLPSDVTRYQ 
GQRVRVSWSP 
PREGSASVLT 
QTLPPDSTAT 
SVTITWTRVP 
DGPPASVWR 
ILPGNTDSAE 
HSLRLRWEPV 
GEGPSAEVTA 
GSPQTLPGIS 
ATQDNAHRAE 
IRDMPYMDPS 
QASGLNWML 
QPRPEPCPVY 
PGADGRPGSP 
PGLPGRKGDP 
GEPGLPGLPG 
DRGFPGPLGE 
DPAWGPAVA 
GRAGPPGDSG 
GAPGVRGPVG 
PGDRGQEGPR 
SGLDGKPGAA 



GPPGVPGGTG 



LGERGPPGPS 
PGTPGPPGPP 
GEPGPRGQDG 
GPSGLKGEPG 
RDGASGKDGD 
DLVGEPGAKG 
PPGPPGVKGD 
PPGPPGSVGP 



GEKGDVGFMG 
GAPGKEGLIG 
GPRGPEGLOG 



PRGLKGERGV 
PKGDRGFDGQ 



VPVLRVBHAE 
RWYHRAVTGS 



TEACHPFVYG 



EYSEYSEYSV 
GCGGNANRFG 



HCACQGQFIA 
EEYQDPEAPW 
TREACERRCP 



CGCCCCGGCC 


7560 


GGGGAGAAGG 


7620 


ATCCTGGGGC 


7680 


GGCTTGGATG 


7740 


AAGGGAGAGC 


7800 


CCCCAGGGTC 


7860 


GGAGTGCCTG 


7920 


AAGGGTGAAC 


7980 


GAAGCTGGTC 


8040 


GGTGTGCOGG 


B100 


CGAGGCTTTG 


8160 


ACCCCAGGAA 


6220 


GGGCCACCTG 


8280 


CGAGGTCCCC 


8340 


AGAGGGGAGC 


8400 


CTGACGGAGG 


8460 


CAGGGCCAGT 


8520 


GGCTCCCAGC 


8580 


GTACCCCCTG 


8640 


GACOCTGAAG 


8700 




8760 


CACCCTTTTG 


8820 


TGCGAGCGCC 


8880 


TGAGGCCCAG 


8940 


ACCCCACTGT 


9000 


GTCCGTTATT 


9060 


CATTGTGGCT 


9120 


AGCGGATGTG 


9180 


ATTCTGGGGG 


9240 


51 
i 




i 

RSNFREVRSF 


60 


SYKGGNTRTG 


120 


FAVGIKNADP 


180 


STSAPRDLVL 


240 


SVRLRGLRPL 


300 


PGATGYRVTW 


360 


ARTDASVEQT 


420 


LDGLQPGTEY 


480 


VPGATQYRII 


540 


VRREPETPtiA 


600 


DITGLQPGTT 


660 


GATGYRVSWH 


720 


TAPEPVGRVS 


780 


IRGLEGGVSY 


840 


PRAQGFLLHW 


900 


RTESPRVPSI 


960 


SSQRVTGLEP 


1020 


ATRRVLERLV 


1080 


GNNLGTAWT 


1140 


GMAGADPEQL 


1200 


CPKGQKGEPG 


1260 


GRAGNPGTPG 


1320 


GPSGPPGPRG 


1380 


SPGPQGPVGP 


1440 


AGEKGERGPP 


1500 


GPKGEKGDVG 


1560 


PPGEKGDPGR 


1620 


EKGDQGDPGE 


1680 


GPKGDPGLPG 


1740 


GPSGPNGAAG 


1800 


GEDGRKGEKG 


1860 


PKGDRGETGS 


1920 


LPVPERRRGP 


1980 


GLAGEPGKPG 


2040 


GPKVSVDEPG 


2100 


NPGLPGERGM 


2160 


ETGPPGRGLT 


2220 


RGSPGVPGSP 


2260 


DRGLPGPRGB 


2340 


LGLPGLPGAP 


2400 


PGASGLKGDK 


2460 


GSAGLKGDXG 


2520 


KGSAGLPGLR 


2560 


KGACGLDGEK 


2640 


PGPKGDQGEK 


2700 


RWGAPGVPG 


2760 


SGSRPLPSYA 


2820 


DSDDPCSLPL 


2880 


PRWQSQGTG 


2940 
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Seq ID NO: 64 DNA sequence 
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Nucleic Acid Accession 
Coding sequence: 1-219 



ft: NM 006945 
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1 11 21 31 41 51 

I II I I I 

ATGTCTTATC AACAGCAGCA GTGCAAGCAG CCCTGCCAGC CACCTCCTQT GTGCCCCACQ 
CCAAAGTGCC CAGAGCCATG TCCACCCCCG AAGTGCCCTG AGCCCTGCCC ACCACCAAAG 
TGTCCACAGC CCTGCCCACC TCAGCAGTGC CAGCAGAAAT ATCCTCCTGT GACACCTTCC 
CCACOCTQCC AGCCAAAGTA TOCACCGAAG AGCAAGTAA 

Seq ID NO: 65 Protein sequence: 
Protein Accession ft: NP 008876 



11 



51 



21 31 41 

I E III I 

MSYQQQQCKQ PCQPPPVCPT PKCPEPCPPP KCPEPCPPPK CPQPCPPQQC QQKYPPVTPS 
PPCQPKYPPK SK 

Seq ID NO: 66 DNA sequence 

Nucleic Acid Accession ft: NMJ)0S629.1 

Coding sequence: 639-2546 



TAGTOGGAGC 
CCGCCGCCGG 
CCGCCGCCAC 



11 
I 

GAGGTGGCGA 
GAAGGAGAGG 
CACCGCCACC 



CCGATGTCGC 
GCCGCCGCCC 
GCCGGGCCCC 
CTCACTTTCA 
CGTCCGCCCG 
CCTCGGGGCC 
GCCGCGACCC 
ACGGCATCTA 
ACGGGGCOCC 
CCGTGCCGCC 

GTGTGTTCCT 
TAGAGATCTC 
GCCTGTTCAA 
ACATCATGGT 
CCTGGGCCAC 
AAGACTGTGC 
CCCCTGTCAT 
CAGGGGCCCT 
TCTGTGTCTG 
CCTACGTGGT 
GCATCATTTA 
ATGCGGGGAC 
GCAGCTACAA 
ACAGTGGGAC 
CAGAGCAGGG 
CCTACCCGCG 
TCATGCTGTT 
GCCTCCTCGA 
CCCTCTGTTG 
ACGTCTTCCA 
TTTGGGAGTG 
CCTGTATGAT 
CGCTGGTCTG 
ACAACACCTA 
CCATGCTGTG 
CTGAGCGCTG 
CTCA6GACGC 
TCGTCGTGGT 
GCCATAGCAG 
CACTTTTGGG 
ACTTTTTCCA 
TCCCCCTCCA 
CAGTGCTGCA 
CTCTGCAGCA 
TGTGAGGAAG 
GAGGGGCAGC 
TTATAGAAGC 
CCAGTATCAA 
TATAGATCTC 
GGAGGCTTGT 
TTTCTAAAAA 
CCTACCTTAC 
GCAAAACAAA 
GTCCCCAGCC 
GCTGTCCCTT 
GGGCTGCTAA 
GACCCTGGAC 



CCGCGCCCCG 
GAGCCGCGGG 
GACGCCGCCC 
AGTGCTTCGT 
CCGCCCCGTC 
CTCCCCGGTG 
CGGCCCGGCC 
TAGCGTGTCC 
GGCCAAGGGC 
GCGCGAGACC 
CTTGGGCAAC 
TATTCCCTAC 
GCTGGGCCAG 
AGGCCTGGGC 
GCTGGCCTGG 
ATGTGGCCAC 
CAATGCCAGC 
CQAGTTCTGG 
CAACTGGGAG 
GAAGGGGGTC 
CCTGGTCGTG 
CTATCTCAAG 
CCAGATTTTC 
CCGCTTCAAC 
CAGCTTCTTT 
CGTGCACATC 
GGCTGTCACG 
GCTGCTTGGT 
CCTCCTCCCG 
TGCCCTCTGC 
GCTGTTTGAC 
CGTGGTGGTG 
CGGGTACCGA 
CATGGGCATC 
CGTGTACCCG 
CGTGCCGCTG 
GCAGCACCTG 
AGATGTCAGG 
GGAGAGTGTC 
CCCCTGCTTC 
GTCTGCCTGG 
TTTTTAATAA 
GCCCTAGCCG 
CTCCTCCTGC 
CACCCGTGGG 
GGAAGGAGGG 
AGAACCAAGG 
TTAGAGAGCC 
TTGTGTGAGC 
TATCTCTTAG 
ATTTTGCACA 
GAGGAAGGAG 
CCCTCTGCCC 
AGCTTCGAGC 
CCAGACTGGA 
TGCCACAAGT 
CCTGGCCTGC 
ACGGCTCCCA 



21 
I 

GTCGCTGAGC 
GCGAGGCGCG 
GGAGTCGCGG 
ACAGGCCCCT 
TTAGGATGAG 
CAGGAGCCTC 
GCGCGCCCCC 
GGACTGCTTC 
CCCCGGCCCG 
CCGCCGGTGC 
GTGCGGCCCG 
GGCGACGAGA 
GACGGCCCCG 
TGGACGCGCC 
GTGTGGOGCT 
GTCCTGATCG 
TTCATGAAGG 
TACGCCTCCA 
GGCTTCTATT 
ACCTGGAACA 
CTGGCCAACC 
GAGAACAAAG 
GTGACCCTTT 
AAATCCACGG 
CTGCTGGTGC 
CCTGACTGGT 
TTTTCTTACG 
AACAACTGCT 
GCTGGCTTCG 
TCCAAGGTGG 
CTGATGCCAG 
CTCGACAGCC 
GCCTCCTACT 
TTTGTCATOG 
TACTACTCGG 
GCCTGGGTGT 
CCTTGCCCCT 
TTCATCTTCA 
TGGTGGGGTG 
CACCTCCTGG 
ACCCAGCCCA 
GGCCTGACCA 
ATGTGACAAC 
AGCCCCACCG 
GGGAGGAGGG 
AACGCCAAAA 
AGCTGGTCCT 
CCCTGCCACG 
TGACCCCTCA 
AGAGACGGGA 
CAAATATTTC 
AGCCAGCAAT 
TTGGGTGCGA 
CAAAGGTGAA 
TTTTATAAAA 
CCCAAACCAT 
CTAGCCAAGG 
TGTTGCGTGT 
TTGGAAAAGT 
CTGTGGGGCA 
TCAGGCTTCC 
CGTCCAGGCT 



31 
I 

CCGCCGCGGC 
CCCGAGCCGC 
GCCAGCCGGG 
GCTCCGGCCG 
TCTCGGGTCG 
GGGAGCCGCC 
GGGCCCCCGA 
TGACTGCGCC 
GCCGCCCCCC 
CCCCCGCCTG 
CCGGGGCCAT 
AGAAGGGCCC 
TGGGCCTGGG 
AGATGGACTT 
TCCCCTACCT 
CCCTGGTTGG 
CCGGCAGCAT 
TGGTGATCGT 
ACCTGGTCAA 
CTCCCGACTG 
TCACCTGTGA 
TCTTGAGGCT 
GTCTGCTGGC 
GAAAGATCGT 
GTGGAGTGCT 
CAAAGCTGGG 
CCATTGGCCT 
ACAAGGACGC 
TGGTCTTCTC 
CAGAGTCAGG 
TGGCCCCACT 
7AGTTTGTAGG 
ACTTCCGTTT 
ATCTCTCCAT 
CCAGCGGCAC 
ACGGAGCTGA 
GGATGAAATG 
ACGTTGTGTA 
AGGCCATGGG 
GCTGCCTCCT 
TCTGGGGCCT 
CCCTGACCCC 
TCAGCTCACA 
CACCCCTCCA 
GAGAAAGCAC 
ATATCACAAC 
AGGCCCCGCC 
CCCACCCCCT 
CCCCAGAAGC 
GGGAGGAGAG 
AGCTGGGCTA 
GGAACCTTCT 
GTGCACGCGT 
TGCCAGATGT 
ACTTGAGAGA 
CCTCTCCTTA 
AGTGTGAATT 
GTGAGTCTGT 
GCATGGTGGG 
AGAGGCTGCA 
CACCCTGTGC 
TAAGGTGGAT 



41 

I 

CCCGAGAGCG 
CGCCGCCGCC 
CAGCCTCCGC 
TCGTTTGCAG 
GGCGAGGAGC 
GCCGCCGCCG 
CACACATGAG 

GGCCCCCGGC 
ACCGCCGCCC 
GGCGAAGAAG 
CCTCATCGOG 
GACACCCGGC 
CATCATGTGG 
GTGCTACAAG 
AGGAATCCCC 
CAATGTCTGG 
CTTCTACTGC 
GTCCTTTACC 
CG7GGAGATC 
CCAGCTTGCT 
GTCTGGGGGA 
CTGCTGGGTG 
GTACTTCACT 
GCTGCCTGGC 
GTCCCCTCAG 
GGGGGCCCTC 
CATCATCCTG 
CATCCTGGGC 
GCCGGGCCTG 
CTGGGCTGCC 
TGTGGAGGGC 
CCAAAGGGAG 
GGTGACTGAT 
CACCCTGCTC 
CCGCTTCATG 
GTGCTGGTCC 
CTACGAGCCG 
CTGGGCCTTC 
CAGGGCCAAG 
CCACCACTTG 
AGTGTCCGAG 
TCACCAGCTC 
GGGGGCCTGC 
CATGAGTGCT 
CCACCAAAAA 
TAGTGCCCCA 
GCCCACCTCT 
AGCAGTGGCA 
AGAGGAGAAG 
TACCCCTCTC 
GGTTCCTGCG 
GCGTGAGTAC 
AAATGGCGCC 
ATGAGATTTC 
CCACTCCCAT 
TATAGATCTA 
TGTGTGGATG 
GGCCTCGGGG 
ATATTCCGTC 
GGGGCACACC 
GCACTTCCCG 



51 
I 

GCTGCAGCCG 
GCCACCGCCG 
GGGCCCCGGC 
ACCGCGGGCG 
CGCCGCAGCC 
CCGCCGCCCG 
ATTCTTCAGG 
CGCACCCCGC 
CGGCCOGCGC 
CCCGTGAGGC 
AGCGCCGAGA 
CCCGGGCCCG 
GGCCGCCTGG 
TGCGTGGGCT 
AACGGCGGAG 
ATTTTCTTCT 
AACATCTGTC 
AACACCTACT 
ACCACGCTGC 
TTCOGCCATG 
GACCGCCGGT 
CTGGAGGTGC 
CTGGTCTACT 
GCTACATTCC 
GCCCTGGATG 
GTGTGGATAG 
ACAGCCCTGG 
GCTCTCATCA 
TTCATGGCTG 
GCCTTCATCG 
CTGTTCTTCT 
TTCATCACCG 
ATCTCTGTGG 
GGCGGGATGT 
TGGCAGGCCT 
GACGACATTG 
TTCTTCACCC 
CTGGTCTACA 
GCCCTGTCCT 
GGCACCATGG 
GAGTACCGAG 
AGCAGCAAGG 
ACCTCTGGTA 
CTTTCCCTGA 
CACTAAAACA 
TAGATGCCTC 
CCCCCACCCA 
CCAGGCTCTG 
GCTTGGGAAA 
GGAGGCAGGG 
CCCATCCCTG 
CCAATCGCCA 
GGAGAGTATA 
TCTGGGCAAA 
TGCTTGTATA 
CCCTGTGAGC 
ACTTTCATAG 
TGCGTGTGTG 
CTGTCCCCAC 
CTGGGTGTCT 
CCCAGGAAGG 
CACCTCCAGT 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
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CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAOTCCOGA GACGGCTGAG 
TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 
GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 
AAAACATGTC ATTTTCC 



PCT/US02/12476 



3780 
3840 
3900 



Seq ID NO: 67 Protein sequence: 
Protein Accession #: MP_0 0562 0.1 



1 
I 

MAKKSAENGI 
PIMSCVGFAV 
INVWNICPLF 
CVEIFRHEDC 
ACWVLVYFCV 
GSPQVWIDAG 
SILGFMAAEQ 
GVEGFITGLL 
TTLLWQAFWB 
YYEPLVYNNT 
LHHLEYRAQD 



11 
I 

YSVSGDEKKG 
GLGNVWRFPY 
KGLGYASMVI 
ANASLANLTC 
WKGVKSTGKI 
TQIFPSYAIG 
GVHISKVAES 
DLLPASYYFR 
CWVAWVYGA 
YVYPWWGEAM 
ADVRGLTTLT 



21 
I 

PLIAPGPDGA 
LCYKNGGGVP 
VFYCNTYYIM 
DQLADRRSPV 
VYPTATPPYV 
LGALTALGSY 
GPGXAFIAYP 
PQREISVALC 
DRFMDDIACM 
GWAFALSSML 
PVSESSKWV 



31 

I 

PAKGDGPVGL 
LIPYVLIALV 
VLAWGFYYLV 
IEFWENKVLR 
VLWLLVRGV 
NRFNNNCYKD 
RAVTLMPVAP 
CALCFVIDLS 
IGYRPCPWMK 
CVPLHLLGCL 
VESVM 



41 
I 

GTPGGRLAVP 
GGIPIFFLEI 
KSFTTTLPWA 
LSGGLEVPGA 
LLPGALDGII 
AIILALINSG 
LKAALFFFML 
MVTDGGMYVF 
WCWSFFTPLV 
LRAKGTMABR 



51 
I 

PRETWTRQMD 
SLGQFMKAGS 
TCGHTWNTPD 
LNWEVTLCLL 
YYLKPDWSKL 
TSFFAGFWF 
LLLGLDSQFV 
QLFDYYSASG 
CMGIFIFNW 
WQHLTQPIWG 



Seq ID NO: 68 DNA sequence 

Nucleic Acid Accession #: NMJJ21953. 

Coding sequence: 178-2469 



1 
I 

GGCACGAGGG 
CCAGGTTGGA 
CAGTCTGGAG 
AAAGCTAGCC 
AATGCCCCAA 
AATCAAGCAG 
ATGAAGATTA 
GCTAATATTC 
GGGCCCAACA 
CGGCCTCAAA 
GGACCAAAAC 
CAGAAACGGG 
TCCAACATCC 
CAAGAGATGG 
CCTTCGAGAC 
ATGGCCATGA 
ATCTATACGT 
AAGAACTCCA 
AATGGCAAGG 
CAGGTGTTTA 
CAGAAACGAC 
GGCGCACGGC 
CAGTTCCCGG 
GCGGCTTCCC 
AAGGTGCTGC 
GAGAAACTCC 
GAAGAAATCC 
CCTCCCTTGG 
TGGGAGGATT 
TCCCCAACCC 
AGCCGGTCTC 
TTCTCAGAGG 
TCTGACCCTG 
ATTAAGGAAA 
GAATCCTGGA 
ACCTCCCAGG 
ACTCCCTTGC 
TTAGACCTCA 
CCAGGCTCCC 
CTGGTCCTGG 
GGCCTGGACG 
CTACAGTAGA 
AGGCTCAGTG 
ATAGCTCCCT 
CTTGTGTTCC 
AACAACAAAG 
CCAGCAGTCT 
TCCAAATTAT 
GCCAGAAGGT 
TTGATAGAAG 
GGTTCTTCAC 
CCCTGCCACC 
AAGTCTTTTG 
AGAGTGTGGG 
CCAGGGAGAC 
TGACCTGCCT 
GCTGACCGCA 
GACCNAAAAA 



11 
I 

GGACCCGGCC 
GGAGCCCGGA 
GGTCCACACT 
CCCGTCGGCC 
GTGAAACATC 
AGGCCTCCAA 
TTAACCACCC 
ACAGCATCAT 
AATTCATCCT 
CCCAAACCAG 
CTGCAGCTAG 
AGACCTGTGC 
AGTGGCTTCG 
AGGAAAAGGA 
CATCAGCGTC 
TACAATTCGC 
GGATTGAGGA 
TCCGCCACAA 
TCTCCTTCTG 
AGCCACTGGA 
CGAATCCAGA 
GGAAGATGAA 
TGAACCAGTC 
TCATGAGCTC 
TAGCTGAGGA 
TGTTTGGAGA 
AGCCTGGGGA 
AAGAGTGGCC 
CGTCCCAATC 
GGTGTGTCTC 
GGAGGAAACA 
GGCCCAGTAC 
CCTCCCAGCT 
CGCTGCCCAT 
GGCTCACGCC 
GTGCCTCTGA 
AAAGTGCTCC 
TCTCCGTCCC 
CGGAGCCACA 
ACACAATGAA 
AGGACCCACT 
GCCCTGCCCT 
CACCCCAAGC 
GCTGCCTGAT 
CCAAGAGTAT 
GCAATGGTGA 
CTTACCTTCC 
CCTCTAATTA 
GGGTAGGATG 
GGAAGACCTG 
TGCAGGGACC 
TCCCOGTGTT 
TATTGGGTCA 
TGCCCAGATG 
TGGCATTGAC 
GGCTTCCTTA 
TGGGTGTGAG 
AAAAAAAAAA 



21 
! 

GGTCCGGCGC 
GCCOGCCTTC 
TGTGATTCTC 
ACTGATTCTC 
AGAGGAGGAA 
GGAAGTGGCG 
CACCATGCCC 
CACAGCACTG 
CATCAGCTGT 
CTATGATGCC 
GGATGTGAAT 
AGATGGTGAG 
AAAGATGAGT 
GAATTGTCAC 
CTGGCAGAAC 
CATCAACAGC 
CCACTTTCCC 
CCTTTCCCTG 
GACCATTCAC 
CCCAGGGTCT 
GCTCCGCCGG 
GCCACTGCTA 
ACTGGTGTTG 
AGAGCTTGCC 
GGGGATAGCT 
AGGGTTTTCT 
GGAAATGCCA 
CTCCCCGGCC 
TCCCACCCCA 
GGAAATGCTT 
GCATCTACTG 
TTCCCGCTGG 
CAGCTACTCC 
CTCCTCCACC 
CCCAGCCAAA 
CCCCTTGCCT 
CCCCCTTGAA 
CTTTGGCAAC 
GGTTTCTGGC 
TGACAGCCTC 
GGGCCCTGAC 
TGCCCCTGTG 
CTCTGAGTGA 
TATGCAAAAG 
CTGATTCCTC 
AAAGAGATTA 
CTGATCTTTG 
TAAATGTAAG 
ACXTTGGGGTT 
CAGTGCACGG 
CAGACAAGTG 
TCCAAGTCAG 
GGAGTTGAAT 
TGCGCTATTA 
GAGAACTCAG 
GCTTGCCCCT 
CCAGCTTGAG 
AAAA 



31 
I 

GAGCCCCOGT 
GGAGCTACGG 
AATGGAGAGT 
AAAAGACGGA 
CCTAAGAGAT 
GAGTCCAACT 
AACACGCAAG 
ACTGCCAAGG 
GGGGGAGCCC 
AAAAGGACAG 
CTTCCTAGAC 
GCAGCAGGCT 
TCTGATGGAC 
CTGGAGCAGC 
TCTGTGTCTG 
ACTGAGAGGA 
TACTTTAAGC 
CAOGACATGT 
CCCAGTGCCA 
CCACAATTGC 
AACATGACCA 
CCACGGGTCA 
CAGCCCTCGG 
CGCCATAGCA 
CCTCTTTCTT 

CACTTAGCGA 
CCATCTTTCA 
AGACCCAAGA 
GTGATTCAAC 
CCTCCCTGTG 
GCCGCAGAGC 
CAGGAAGTGG 
CCGAGCAAAT 
GTAGGGGGAC 
GACCCCCTGG 
TCACCGCAAA 
TCTTCTCCCT 
CTTGCAGCCA 
AGCAAGATCC 
AACATCAACT 
CTCAAGCTGT 
GGACAGCAGG 
TAGCAGTCAC 

GGAACCCCCC 
CAGGGTGGTC 
CTTATTTCCT 
TCAATTGACT 
TTTCTTCCAG 
GATCTGCTTG 
CTTTCCTGCA 
TTGGGGTGGG 
GATGTTTCTC 
GTGGAGGCTT 
CAGCTTTGCA 
AACACTAACT 



41 

I 

CCGGGGCCCT 
CCTAACGGCG 
GAAAACGCAO 
GGCTGCCCCT 
CCCCTGCCCA 
CTTGCAAGTT 
TAGTGGCCAT 
GAAAAGAGAG 
CAACTCAGCC 
AAGTGACCCT 
CACCTGGAGC 
GCACTATCAA 
TGGGCTCCCG 
GACAGGTTAA 
AGCGGCCACC 
AGCGCATGAC 
ACATTGCCAA 
TTGTCCGGGA 
ACCGCTACTT 
CCGAGCACTT 
TCAAAACCGA 
GCTCATACCT 
TGAAGGTGCC 
AGCGAGTCCG 
CTGCAGGACC 
CAGTTCAGAC 
GACCCATCAA 
AAGAGGAATC 
AGTCCTACAG 
ACAGGGAGAG 
TGGATGAGCC 
TCCCGTTCCC 
GAGGACCTTT 
CTGTCCTCCC 
TGGATTTCAG 
GGCTGATGGA 
GGCTCCTCAG 
CAGATATAGA 
ATCGTTCTCT 
TGCTGGACAT 
GGTCCCAGTT 
CCACCATCCC 
CAGGGACTGT 
ACCCTAGCCA 
GCCAGGAGCT 
AGCCTGTTTC 
CGTGTAAATA 
TAGATCATTA 
TCTGTTCCTT 
GCTGAGGTAC 
CCAGAGTCCT 
AGAAGAAATC 
AGGATGGATG 
TGATAATGTC 
GAGAAGGCCG 
AAGAGCCACC 
ACTCAATAAA 



51 
I 

GGCTCGGCCC 
GCGGCGACTG 
ATTCATAATG 
TCCTGTTCAA 
ACAGGAGTCT 
TCCAGCTGGG 
CCCCAACAAT 
TGGCAGTAGT 
TCCAGGACTC 
GGAGACCTTG 
CCTTTGCGAG 
CAATAGCCTA 
CAGCATCAAG 
GGTTGAGGAG 
CTACTCTTAC 
TTTGAAAGAC 
GCCAGGCTGG 
GACGTCTGCC 
GACATTGGAC 
GGAATCACAG 
ACTCCCCCTG 
GGTACCTATC 
ATTGCCCCTG 
CATTGCCCCC 
AGGGAAAGAG 
TATCAAGGAG 
AGTGGAGAGC 
ATCTCACTCC 
TGGGCTTAGG 
GAGGGAGAGG 
GGAGCTGCTC 
AGCAGACTCC 
TAAGACACCC 
CAGAACCCCT 
CCCAGTACAA 
TCTCAGCACC 
TTCAGAACCC 
CGTCCCCAAG 
GACAGAAGGC 
CAGCTTTCCT 
TATTCCTGAG 
GGGCACTCCA 
TCTGCTCCTC 
CTGCTGGGAC 
GAAGGGTGGG 
CATTCTCTGC 
GTATAAATTC 
TCCAGAGACT 
GCTTTTAGTT 
CTGGATCTTG 
TTTTGCCXX^ , 
CTGGTTAAAA 
CAACTGAAGC 
CCCAATCATA 
AAAGGGCCCC 
CTAGGCCCCA 
AGCGAAGGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
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Seq ID NO: €9 Protein sequence: 
Protein Accession #: Np_068772.1 

1 11 21 31 41 51 

I I I I I I 

MKASPRRPLI LKRRRLPLPV QNAPSETSEE EPKRSPAQQE SNQAEASKEV AESUSCKFPA 60 

GIKIINHPTM PNTQWAIFN NANIHSIITA LTAKGKESGS SGPNKPILIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTBVTLET LGPKPAARDV NLPRPPGALC EQKRBTCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLQSRSI KQEMEEKENC HLEQRQVKVE EPSRPSASMQ NSVSEKPPYS 240 

YMAMIQFAIN STERKRMTUC DIYTWIEDHP PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSFWTI KPSANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

LGARRKMKPL LPRVSSYLVP IOFPVNQSLV LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 420 

PKVLLAEEGI APLSSAGPGK EEKLLFGEGF SPLLPVQTIK EEEIQPGEEM PHLARPIKVE 480 

SPPIiEEWPSP APSFKEESSH SWEDSSQSPT PRPKKSYSGL RSPTRCVSEM LVIQHRERRE 540 

RSRSRRKQHL LPPCVDEPEL LFSEGPSTSR WAAELPFPAD SSDPASQLSY SQEVGGPFKT 600 

PIKETLPIS6 TPSKSVLPRT PESWRLTPPA KVGGLDFSPV QTSQGASDPL PDPLGLMDLS 660 

TTPLQSAPPL ESPQRLLSSE PLDLISVPFG NSSPSDIDVP KPGSPEPQVS GIAANRSLTE 720 
GLVItDTMNDS LSKILLDISF PGLDEDPLGP DNINWSQFIP ELQ 



Seq ID KOt 70 DKA sequence 

Nucleic Acid Accession #: BC006S29.1 

Coding sequence: 178-2424 

1 11 21 31 41 51 

I I I I I ! 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGGC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGOCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCAGCAGAA ACGACCGAAT CCAGAGCTCC GCCGGAACAT GACCATCAAA 1200 

ACCGAACTCC CCCTGGGCGC ACGGCGGAAG ATGAAGCCAC TGCTACCACG GGTCAGCTCA 1260 

TACCTGGTAC CTATCCAGTT CCCGGTGAAC CAGTCACTGG TGTTGCAGCC CTCGGTGAAG 1320 

GTGCCATTGC CCCTGGCGGC TTCCCTCATG AGCTCAGAGC TTGCCCGCCA TAGCAAGCGA 1380 

GTCCGCATTG CCCCCAAGGT GCTGCTAGCT GAGGAGGGGA TAGCTCCTCT TTCTTCTGCA 1440 

GGACCAGGGA AAGAGGAGAA ACTCCTGTTT GGAGAAGGGT TTTCTCCTTT GCTTCCAGTT 1500 

CAGACTATCA AGGAGGAAGA AATCCAGCCT GGGGAGGAAA TGCCACACTT AGCGAGACCC 1560 

ATCAAAGTGG AGAGCCCTCC CTTGGAAGAG TGGCCCTCCC CGGCCCCATC TTTCAAAGAG 1620 

GAATCATCTC ACTCCTGGGA GGATTCGTCC CAATCTCCCA CCCCAAGACC CAAGAAGTCC 16 B0 

TACAGTGGGC TTAGGTCCCC AACCCGGTGT GTCTCGGAAA TGCTTGTGAT TCAACACAGG 1740 

GAGAGGAGGG AGAGGAGCCG GTCTCGGAGG AAA CAG CATC TACTGCCTCC CTGTGTGGAT 1800 

GAGCCGGAGC TGCTCTTCTC AGAGGGGCCC AGTACTTCCC GCTGGGCCGC AGAGCTCCCG 18 SO 

TTCCCAGCAG ACTCCTCTGA CCCTGCCTCC CAGCTCAGCT ACTCCCAGGA AGTGGGAGGA 1920 

CCTTTTAAGA CACCCATTAA GGAAACGCTG CCCATCTCCT CCACCCCGAG CAAATCTGTC 1980 

CTCCCCAGAA CCCCTGAATC CTGGAGGCTC ACGCCCCCAG CCAAAGTAGG GGGACTGGAT 2040 

TTCAGCCCAG TACAAACCCC CCAGGGTGCC TCTGACCCCT TGCCTGACCC CCTGGGGCTG 2100 

ATGGATCTCA GCACCACTCC CTTGCAAAGT GCTCCCCCCC TTGAATCACC GCAAAGGCTC 2160 

CTCAGTTCAG AACCCTTAGA CCTCATCTCC GTCCCCTTTG GCAACTCTTC TCCCTCAGAT 2220 

ATAGACGTCC CCAAGCCAGG CTCCCCGGAG CCACAGGTTT CTGGCCTTGC AGCCAATCGT 2280 

TCTCTGACAG AAGGCCTGGT CCTGGACACA ATGAATGACA GCCTCAGCAA GATCCTGCTG 2340 

GACATCAGCT TTCCTGGCCT GGACGAGGAC CCACTGGGCC CTGACAACAT CAACTGGTCC 2400 

CAGTTTATTC CTGAGCTACA GTAGAGCCCT GCCCTTGCCC CTGTGCTCAA GCTGTCCACC 2460 

ATCCCGGGCA CTCCAAGGCT CAGTGCACCC CAAGCCTCTG AGTGAGGACA GCAGGCAGGG 2520 

ACTGTTCTGC TCCTCATAGC TCCCTGCTGC CTGATTATGC AAAAGTAGCA GTCACACCCT 2580 

AGCCACTGCT GGGACCTTGT GTTCCCCAAG AGTATCTGAT TCCTCTGCTG TCCCTGCCAG 2640 

GAGCTGAAGG GTGGGAACAA CAAAGGCAAT GGTGAAAAGA GATTAGGAAC CCCCCAGCCT 2700 

GTTTCCATTC TCTGCCCAGC AGTCTCTTAC CTTCCCTGAT CTTTGCAGGG TGGTCCGTGT 2760 

AAATAGTATA AATTCTCCAA ATTATCCTCT AATTATAAAT GTAAGCTTAT TTCCTTAGAT 2820 

CATTATCCAG AGACTGCCAG AAGGTGGGTA GGATGACCTG GGGTTTCAAT TGACTTCTGT 28B0 

TCCTTGCTTT TAGTTTTGAT AGAAGGGAAG ACCTGCAGTG CACGGTTTCT TCCAGGCTGA 2940 

GGTACCTGGA TCTTGGGTTC TTCACTGCAG GGACCCAGAC AAGTGGATCT GCTTGCCAGA 3000 

GTCCTTTTTG CCCCTCCCTG CCACCTCCCC GTGTTTCCAA GTCAGCTTTC CTGCAAGAAG 3060 

AAATCCTGGT TAAAAAAGTC TTTTGTATTG GGTCAGGAGT TGAATTTGGG GTGGGAGGAT 3120 

GGATGCAACT GAAGCAGAGT GTGGGTGCCC AGATGTGCGC TATTAGATGT TTCTCTGATA 3180 

ATGTCCCCAA TCATACCAGG GAGACTGGCA TTGACGAGAA CTCAGGTGGA GGCTTGAGAA 3240 

GGCCGAAAGG GCCCCTGACC TGCCTGGCTT CCTTAGCTTG CCCCTCAGCT TTGCAAAGAG 3300 

CCACCCTAGG CCCCAGCTGA CCGCATGGGT GTGAGCCAGC TTGAGAACAC TAACTACTCA 3360 
ATAAAAGCGA AGGTGGAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO s 71 protein sequence: 
Protein Accession ft: AAH06529.1 



215 



WO 02/086443 



1 11 21 31 41 51 

I 1 I I I I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEE EPKRSPAQQS SNQAEASKEV AESNSCKFPA 60 

GIKI INHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKFILIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQFAIN STERKRMTIiK DIYTWIEDHF PYFKHIAKPG WKNSIRHNIiS LHDMFVRETS 300 

ANGKVSPWTI HPSANRYLTL DQVFKQQKRP NPELRRNMTI KTELPLGARR KMKPLLPRVS 360 

SYLVPIQFPV NQSLVLQPSV KVPLPLAASL MSSELARHSK RVRIAPKVLL AEEGIAPLSS 420 

AGPGKEEKLL FGEGFSPLLP VQTIKEEEIQ PGEEMPHLAR PIKVESPPLE EWPSPAPSFK 480 

EESSHSKEDS SQSPTPRPKK SYSGLRSPTR CVSEMLVIQH RERRERSRSR RKQHLLPPCV 540 

DEPELLFSEG PSTSRWAAEL PFPADSSDPA SQLSYSQEVG GPFKTPIKET LPISSTPSKS 600 

VLPRTPESWR LTPPAKVGGL DFSPVQTPQG ASDPLPDPIiG LMDLSTTPLQ SAPPLESPQR 660 

LLSSEPLDLI SVPFGNSSPS DIDVPKPGSP EPCVSGLAAN RSLTEGLVLD TMNDSLSKIL 720 
LDISFPGLDE DPLGPDNINW SQFIPELQ 



Seq ID NO: 72 DMA sequence 
Nucleic Acid Accession fh U74612. 
Coding sequence: 178-2583 



1 11 21 31 41 51 

1 I I 1-1 I 

GGCACGAGGG GGACCCGGCC GGTCCQGCGC GAGCCCCCGT CCGGGCCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA" 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTOGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 1200 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 1260 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTTTTTG GGGAACAGGT GGTGTTTGGT TACATGAGTA AGTTCTTTAG TGGCGATCTG 1500 

CGAGATTTTG GTACACCCAT CACCAGCTTG TTTAATTTTA TCTTTCTTTG TTTATCAGTG 1560 

CTGCTAGCTG AGGAGGGGAT AGCTCCTCTT TCTTCTGCAG GACCAGGGAA AGAGGAGAAA 1620 

CTCCTGTTTG GAGAAGGGTT TTCTCCTTTG CTTCCAGTTC AGACTATCAA GGAGGAAGAA 1680 

ATCCAGCCTG GGGAGGAAAT GCCACACTTA GCGAGAOCCA TCAAAGTGGA GAGCCCTCCC . 1740 

TTGGAAGAGT GGCCCTCCCC GGCCCCATCT TTCAAAGAGG AATCATCTCA CTCCTGGGAG 1800 

GATTCGTCCC AATCTCCCAC CCCAAGACCC AAGAAGTCCT ACAGTGGGCT TAGGTCCCCA 1860 

ACCCGGTGTG TCTCGGAAAT GCTTGTGATT CAACACAGGG AGAGGAGGGA GAGGAGCCGG 1920 

TCTCGGAGGA AACAGCATCT ACTGCCTCCC TGTGTGGATG AGCCGGAGCT GCTCTTCTCA 1980 

GAGGGGCCCA GTACTTCCCG CTGGGCCGCA GAGCTCCCGT TCCCAGCAGA CTCCTCTGAC 2040 

CCTGCCTCCC AGCTCAGCTA CTCCCAGGAA GTGGGAGGAC CTTTTAAGAC ACCCATTAAG 2100 

GAAACGCTGC CCATCTCCTC CACCCCGAGC AAATCTGTCC TCCGCAGAAC CCCTGAATCC 2160 

TGGAGGCTCA CGCCCCCAGC CAAAGTAGGG GGACTGGATT TCAGCCCAGT ACAAACCTCC 2220 

CAGGGTGCCT CTGACCCCTT GCCTGACCCC CTGGGGCTGA TGGATCTCAG CACCACTCCC 2280 

TTGCAAAGTG CTCCCCCCCT TGAATCACCG CAAAGGCTCC TCAGTTCAGA ACCCTTAGAC 2340 

CTCATCTCCG TCCCCTTTGG CAACTCTTCT CCCTCAGATA TAGAOGTCCC CAAGCCAGGC 2400 

TCCCCGGAGC CACAGGTTTC TGGCCTTGCA GCCAATCGTT CTCTGACAGA AGGCCTGGTC 2460 

CTGGACACAA TGAATGACAQ CCTCAGCAAG ATCCTGCTGG ACATCAGCTT TCCTGGCCTG 2520 

GACGAGGACC CACTGGGCCC TGACAACATC AACTGGTCCC AGTTTATTCC TGAGCTACAG 2580 

TAGAGCCCTG CCCTTGCCCC TCTOCTCAAG CTGTCCACCA TCCCGGGCAC TCCAAGGCTC 2640 

AGTGCACCCC AAGCCTCTGA GTGAGGACAG CAGGCAGGGA CTGTTCTGCT CCTCATAGCT 2700 

CCCTGCTGCC TGATTATGCA AAAGTAGCAG TCACACCCTA GCCACTGCTG GGACCTTGTG 2760 

TTCCCCAAGA GTATCTGATT CCTCTGCTGT CCCTGCCAGG AGCTGAAGGG TGGGAACAAC 2820 

AAAGGCAATG GTGAAAAGAG ATTAGGAACC CCCCAGCCTG TTTCCATTCT CTGCCCAGCA 2880 

GTCTCTTACC TTCCCTGATC TTTGCAGGGT GGTCCGTGTA AATAGTATAA ATTCTCCAAA 2940 

TTATCCTCTA ATTATAAATG TAAGCTTATT TCCTTAGATC ATTATCCAGA GACTGCCAGA 3000 

AGGTGGGTAG GATGACCTGG GGTTTCAATT GACTTCTGTT CCTTGCTTTT AGTTTTGATA 3060 

GAAGGGAAGA CCTGCAGTGC ACGGTTTCTT CCAGGCTGAG GTACCTGGAT CTTGGGTTCT 3120 

TCACTGCAGG GACCCAGACA AGTGGATCTG CTTGCCAGAG TCCTTTTTGC CCCTCCCTGC 3180 

CACCTCCCCG TGTTTCCAAG TCAGCTTTCC TGCAAGAAGA AATCCTGGTT AAAAAAGTCT 3240 

TTTGTATTGG GTCAGGAGTT GAATTTGGGG TGGGAGGATG GATGCAACTG AAGCAGAGTO 3300 

TGGGTGCCCA GATGTGCGCT ATTAGATGTT TCTCTGATAA TGTCCCCAAT CATACCAGGG 3360 

AGACTGGCAT TGACGAGAAC TCAGGTGGAG GCTTGAGAAG GCCGAAAGGG CCCCTGACCT 3420 

GCCTGGCTTC CTTAGCTTGC CCCTCAGCTT TGCAAAGAGC CACCCTAGGC CCCAGCTGAC 3480 

CGCATQGGTG TGAGCCAGCT TGAGAACACT AACTACTCAA TAAAAGCGAA GGTGGACAAA 3540 
AAAAAAAAAA AAAAA 



Seq ID NO j 73 Protein sequence: 
Protein Accession #: AAC51128.1 



216 



WO 02/086443 



PCT/US02/12476 



1 11 21 31 41 51 

I 1 I I 1 1 

MKTSPRRPLI LKRRRLPLPV QNAPSETSBE EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60 

5 GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKPILIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLEQRQVKVB EPSRPSASWQ NSVSERPPYS 240 

YMAMI QPAIN STERKRMTLK DIYTWIEDHP PYPKHIAKPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSPWTI HPSANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

10 LGARRKMKPL LPRVSSYLVP IQFPVNQSLV LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 420 

PKVFGEQWP GYMSKPFSGD LRDFGTPITS LFKFIFLCLS VLLAEEGIAP LSSAGPGKEE 480 

KLLFGEGFSP LLPVQTIKEE EIQPGEEMPB LARPIKVESP PLEEWPSPAP SFKEESSHSW 540 

EDSSQSPTPR PKKSYSGLRS PTRCVSEMLV IQHRERRERS RSRRKQHLLP PCVDEPELLF 600 

SEGPSTSRWA AELPFPADSS DPASQLSYSQ EVGGPFKTPI KETLPISSTP SKSVLPRTPE 660 

15 SWRLTPPAKV GGLDFSPVQT SQGASDPLPO PLGLMDLSTT PLQSAPPLES PQRLLSSEPL 720 

DLISVPFGNS SPSDIDVPKP GSPEPQVSGL AANRSLTEGL VLDTMNDSLS KILLDISFPG 780 
LDEDPLGPDN INWSQFIPEL Q 

Seq ID NO; 74 DNA sequence 
20 Nucleic Acid Accession #: Eos sequence 
Coding sequence! 111-416 

1 11 21 31 41 51 

rye I I I I I I 

Zj GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 

TCATCCTTCT ACTCGTGACG CTTCCCAGCT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCAGACGTG 180 

ATGACAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAACTTCC 240 

^ TTAGTGCCTG TGACAAAAAG GGCACAAATT ACCTCGCCGA TGTCTTTGAG AAAAAGGACA 300 

30 AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCA 360 

CAGACTACCA CAAGCAGAGC CATGGAGCAG CGCCCTGTTC CGGGGGCAGC CAGTGACCCA 420 
GCCCCACCAA TGGGCCTCCA GAGACCCCAG GAACAATAAA ATGTCTTCTC CCACCAGA 

Seq ID NO: 75 Protein sequence: 
35 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

* 1 I I I I I 

MSNTQAERSI IGMIDMFHKY TRRDDKIEKP SLLTMMKENF PNFLSACDKK GTNYLADVFE 60 
KKDKNEDKKI DFSEFLSLLG DIATDYHKQS HGAAPCSGGS Q 



40 



Seq ID NO» 76 dna sequence 
Nucleic Acid Accession #: Eos sequence 
45 Coding sequence: 111-416 

1 11 21 31 41 51 

I 1 ! 1 I I 

GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 60 

50 . TCATCCTTCT ACTCGTGACA CTTCCCAGTT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 120 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCGGACGTG 180 

ATGGCAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAATTTCC 240 

TCAGTGCCTG TGACAAAAAG GGCATACATT ACCTCGCCAC TGTCTTTGAG AAAAAGGACA 300 

AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCG 360 

55 CAGACTACCA CAAGCAGAGC. CATGGAGCGG CGCCCTGTTC TGGGGGAAGC CAGTGATCCA 420 
GCCCCACCAA GGGGCCTCCA GAGACCCCAG GAACAATAAG TGTCTCCTCC CACCAGA 



60 
65 
70 



Seq ID NO: 77 Protein sequence: 
Protein Accession #: XP_048124.l 

I 11 21 31 41 51 

II 1 I I I 

MSNTQAERSI IGMIDMFHKY TGRDGKIEKP SLLTMMKENF PNFLSACDKK GIHYLATVFE 60 
KKDKNEDKKI DFSEFLSLLG DIAADYHKQS HGAAPCSGGS Q 



Seq ID NO: 78 DNA sequence 
Nucleic Acid Accession #: Z73678. 
Coding sequence: 253-2433 



1 11 21 31 41 51 

I I I I I I 

GGGGTGGTGC AGGGCAGGGG TGGTATATCC TGTCTGACGG AGGGCGGGCC TCGCCAGTGC 60 

CAGAGAGGGA CGAACCAGGG TGGAAGCGCC AGGAGCAGCT GCAGGGAGCC CTCACGCGGA 120 

75 CCTCGCACTC TATGGCCGTA GGGAGCCGCT GAGAGCGAGA AGAGCACGCT CCTGCCCGCC 180 

CGCTGCACCG CACCTCGCCT CGCCTCTCTG CTCTCCTAGG CCCCGGCCGC GCGCCACCCG 240 

CCTCCCGCCA CCATGAACCA CTCGCCGCTC AAGACCGCCT TGGCGTACGA ATGCTTCCAG 300 

GACCAGGACA ACTCCACGTT GGCTTTGCCG TCGGACCAAA AGATGAAAAC AGGCACGTCT 360 

on GGCAGGCAGC GCGTGCAGGA GCAGGTGATG ATGACCGTCA AGCGGCAGAA GTCCAAGTCT 420 

80 TCOCAGTQGT CCACCCTGAG CCACTCCAAT CGAGGTTCCA TGTATGATGG CTTGGCTGAC 480 

AATTACAACT ATGGGACCAC CAGCAGGAGC AGCTACTACT CCAAGTTCCA GGCAGGGAAT 540 

GGCTCATGGG GATATCCGAT CTACAATGGA ACCCTCAAGC GGGAGCCTGA CAACAGGCGC 600 

TTCAGCTCCT ACAGCCAGAT GGAGAACTGG AGCCGGCACT ACCCCCGGGG CAGCTGTAAC 660 

ACCACCGGCG CAGGCAGCGA CATCTGCTTC ATGCAGAAAA TCAAGGCGAG CCGCAGTGAG 720 

85 CCCGACCTCT ACTGTGACCC ACGGGGCACC CTGCGCAAGG GCACGCTGGG CAGCAAGGGC 780 

CAGAAGACCA CCCAGAACCG CTACAGCTTT TACAGCACCT GCAGTGGTCA GAAGGCCATA 840 

AAGAAGTGCC CTGTGCGCCC GCCCTCTTGT GCCTCCAAGC AGGACCCTGT GTATATCCCG 900 
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CCCATCTCCT GCAACAAGGA CCTGTCCTTT GGCCACTCTA GGGCCAGCTC CAAGATCTGC 960 

AGTGAGGACA TCGAGTGCAG TGGGCTGACC ATCCCCAAGG CTGTGCAGTA CC7GAGCTCC 1020 

CAGGATGAGA AGTACCAGGC CATTGGGGCC TATTACATCC AGCATACCTG CTTCCAGGAT 1080 

GAATCTGCCA AGCAACAGGT CTATCAGCTG GGAGGCATCT GCAAGCTGGT GGACCTCCTC 1140 

CGCAGCCCCA ACCAGAACGT CCAGCAGGCC GCGGCAGGGG CCCTGCGCAA CCTGGTGTTC 1200 

AGGAGCACCA CCAACAAGCT GGAGACCCGG AGGCAGAATG GGATCCGCGA GGCAGTCAGC 1260 

CTCCTGAGGA GAACCGGGAA CGCCGAGATC CAGAAGCAGC TGACTGGGCT GCTCTGGAAC 1320 

CTGTCTTCCA CTGACGAGCT GAAGGAGGAA CTCATTGCOG ACGCCCTGCC TGTTCTGGCC 1380 

GACCGCGTCA TCATTCCCTT CTCTGGCTGG TGCGATGGCA ATAGCAACAT GTCCCGGGAA 1440 

GTGGTGGACC CTGAGGTCTT CTTCAATGCC ACAGGCTGCT TGAGGAACCT GAGCTCGGCC 1500 

GATGCAGGCC GCCAGACCAT GCGTAACTAC TCAGGGCTCA TTGATTCCCT CATGGCCTAT 1560 

GTCCAGAACT GTGTAGCGGC CAGCOGCTGT GACGACAAGT CTGTGGAAAA CTGCATGTGT 1620 

GTTCTGCACA ACCTCTCCTA CCGCCTGGAC GCGGAGGTGC CCACCCGCTA CCGCCAGCTG 1680 

GAGTATAACG CCCGCAACGC CTACACCGAG AAGTCCTCCA CTGGCTGCTT CAGCAACAAG 1740 

AGCGACAAGA TGATGAACAA CAACTATGAC TGCCCCCTGC CTGAGGAAGA GACCAACCCC 1800 

AAGGGCAGCG GCTGGTTGTA CCATTCAGAT GCCATCOGCA CCTACCTGAA CCTCATGGGC 1860 

AAGAGCAAGA AAGATGCTAC CCTGGAGGCC TGTGCTGGTG CCCTGCAGAA CCTGACAGCC 1920 

AGCAAGGGGC TGATGTCCAG TGGCATGAGC CAGTTGATTG GGCTGAAGGA AAAGGGCCTG 1980 

CCACAAATTG CCOGCCTCCT GCAATCTGGC AACTCTGATG TGGTGOGGTC CGGAGCCTCC 2040 

CTCCTGAGCA ACATGTCCCG CCACCCTCTG CTGCACAGAG TGATGGGGAA CCAGGTGTTC 2100 

CCGGAGGTGA CCAGGCTCCT CACCAGCCAC ACTGGCAATA CCAGCAACTC CGAAGACATC 2160 

TTGTCCTCGG CCTGCTACAC TGTGAGGAAC CTGATGGCCT CGCAGCCACA ACTGGCCAAG 2220 

CAGTACTTCT CCAGCAGCAT GCTCAACAAC ATCATCAACC TGTGCCGAAG CAGTGCCTCA 2280 

CCCAAGGCCG CAGAAGCTGC CCGGCTTCTC CTGTCTGACA TGTGGTCCAG CAAGGAACTG 2340 

CAGGGTGTCC TCAGACAGCA AGGTTTCGAT AGGAACATGC TGGGAACCTT AGCTGGGGCC 2400 

AACAGCCTCA GGAACTTCAC CTCCCGATTC TAAGAAGAGA CTGTCCAAGC AAGTTAGGCT 2460 

TGCAGGAAGA TATGACCCAG CTGAGAAGCC CTCAGGCCTC GCTGGATGGG GTTTTCTGTC 2S20 

CATCCTGTGC AGTATTTGGG AAAGTTCACA AGAAACTGAG AAGAAACCTA AAAACTGTGG 2580 

ATAGTGGAAA GATTTTTAGA TTTTTTTTTT CCTTGGGGAA ACTGGCAGGC AATGGGGGTT 2640 

AGGGAGGTTG GGGCGGGGGG GGCTTTCTTG AGTTAAAGGG GCTTATATGT GATGTCAATA 2700 

TTTCTTCCTC TGAGAAATGG TATATATATG TGTCTAATGT AAGTGTGTGC ATGCATGTGC 2760 

GCGTGCATGT GTGTGTGTGT GAGTGTCTTA AAGCATAACC ACAAACTGCA AAAAGCTAGG 2820 

TAAGCTATTT TGTTGCAGCT CATAAGGTGG TGAAAAGGAC TCTCCTGTGT TTCTTACTCA 2880 

TAGGCAAGGA CAACATGTGC TTTTTGGTGA GCTGCTCATA ATTCCTGAAA TGTGTGGTGC 2940 

CAGGGCAAGG GGGCCATCAC TGCAGTCAGG CCCTCAGhGG AGTCCTGCAG GCTTCCTACC 3000 

AGTGGTCTCC AAGGGTGCAG GAGTAACTGG GGCTGGGCCA GCCTCCCCCC TTACAAGGCT 3060 

GCTTTCCACG AAGGGAGGTC TGGTGTATCT CATGGGAGAA TCTGGGGTGT CTGTAGTGTC 3120 

ACCCCTCCAG CAGCGCCACA AGGACTGAGG TTGGGTAGGT GTGAGGTTCC AGAGGACAGC 3180 

AGGACACTCT CGCATACTTT GCCAAATGAG GCCTGCTCAG AGGAGTAGGA GCTGAAAGAT 3240 

GGTGCCTTCC ACCCTCTTGG GCTGTGTGCC CATCAGAGCA GGCTCAGCCT GCAAAGGCCC 3300 

TGCATTCAGA GGTCTTGTAA TCTACTTGTT GCAGGAGAAA GAAGGTAAAA AATGATTTTT 3360 

TTAAGAAAAG CTATTTTATT GCAGCTCTTT CCCAAGAGCT GTTCTGGGAA TGGCTGGTCT 3420 

TCATATTCCC AGTGGAGAGG GGAACAAGTG GGGCTGGGCA TATACCTATT CCGGCTTCTA 3480 

GTGGGATGGA GTTGGGGTAT AGAAATTAAC CAGGAAGATG TTTCCACCAA GCCTGCTGTG 3S40 

AGTCAATTGA GGGAGTGTTT GGGTCCCAGG AGACTTGGAC GGGGGGAGTT TGGGTAGACT 3600 

AGGAAAGGAA AGTGCCATAT CAGGGTACCG GTACCGGCAA GCTCACATCT CAGCCAGGGG 3660 

CCATGCCCCA CTTCCCCTGA CCCCAGCTGT CTTGTCTCCA CTCTGTGAAA CCCACAGGGG 3720 

ATGTGATAAA CAGGGCTATT AGGGGTATCA GCCACGTCGA GCCCCCAGAC TCTGTGCACT 3780 

TCAGACCAGC AGCAGCAGGA GGGCTCCCGA GGGCCTTATG AGAAAACCTG TGTGGACATC 3840 

CCTTGGTGTA CACTAAGACA GAGCAGAGCC CAGCGCTCCC AAGCCTTCCT CCTTCCAGCT 3900 

TCTACCTCCA TGCTAGCATT GCTGGTGTTA GAGAGGAATT AACTTCCTGG TCTGTGCCCT 3960 

TCTCTAGAAG AATATAAGAT GCTCCTCCTC CTCACCCCTT CTCAGCCTCC TCCCAAGTCT 4020 

TCCTCTTCTG CACCACCCCC GAGTCCAAAC CCACCTCTTG CCCCAGCATT CAGGCTGGAA 4080 

AACACTGATG TGGACTCAGT ATGACAACTG AGATGGGGGA AGCCAGACAT GTGAGGAOGC 4140 

TGTCCTCCGA GAGGTGTCCC CGGCTGTTAG CCAGCTGTGC TGTGGTGCTG TGGGTCTGTC 4200 

ATACCCTCCC TTGCTTCTGT TCACACTGGG AGGCCCACTC CTGGCTCACC TCTCCCTCTC 4260 

AGGGACCCAC GTGGGAGCCT GGATCCCTGG ACTGTCCTGG GCATAGGTTT CAGGGGCCTC 4320 

CTTTGTTGTC ATCAGAACCC AGAGGAATTC TTCTCCTAAA AAATACGTAT GGCATACCAA 4380 

TCTGTOCGGG GCAGTGTCCT AAGCACTTAG ACTACATCAG GGAAGAACAC AGACCACATC 4440 

CCCGTCCTCA TGCGGCTTAT GTTTTCTGGA GGAAAGTGGA GACACAAGTC CTTGGCTTTA 4500 

GGGCTCCCCC GGCTGGGGGC TGTGCAGTCC GGTCAGGGCG GGAGGGGAAA TGCACCGCTG 4560 

CATGTGAACC TTACCAGCCC AGGCGGATGC CCCTTCCCCT TAGCACTACC CTGGCCTCCT 4620 

GCATCCCCTC GCCTCATGTT CCTCCCACCT TCAAAGAATG AAGAGCCCCA TGGGCCCAGC 4680 

CCCTGCCCTG GGAACCAGGC AGCCTTCCAG ACCTCAGGGG CTGAGGCAGA CTATTAGGGC 4740 

AGGGCTGACT TTGGTGACAC TGCCCATTCC CTCTCAGGCC AGCTCAGGTC AOCOGOGCCT 4800 

CTGACCCAGG CCTGTCACTT TGAGAGGGGC AAAACTGAGA GGGGCTTTTC CTAGAGAAAG 4860 

AGAACAAGGA GCTTGCCAGG CTTCATGTAG CCGACACACG TCTCAGGATT TTAAGTCCAC 4920 

ATTGGCCTCA CACTAGCCTA GGCCAATGCC CAAAATAAGG AGTTCCAATT TGGGGCCAAA 4980 

TGAGGAAGGA CACAGACTCT GCCCTGGGAT CTCCTGTGCT AGCGGCCAAT GACAAATCCA 5040 

GTCATTGGCC ACCAGCCACC TCTGCAGTGG GGACCACACT AGCAGCCCTG ACTCCACACT 5100 

CCTCCTGGGG ACCCAAGAGG CAGTGTTGCT GTCTGCGTGT CCACCTTGGA ATCTGGCTGA 5160 

ACTGGCTGGG AGGACCAAGA CTGCGGCTGG GGTGGGCAGG GAAGGGAAGC OGGGGGCTGC 5220 

TGTGAGGGAT CTTGGAGCTT CCCTGTAGCC CACCTTCCCC TTGCTTCATG TTTGTAGAGG 5280. 

AACCTTGTGC CGGCCAGGCC CAGTTTCCTT GTGTGArACA CTAATGTATT TGCTTTTTTT 5340 
GGAAATAGAG AAAATCAATA AATTGCTAGT GTTTCTTTGA AAAAAAAAA 

Seq ID NO: 79 Protein sequence: 
Protein Accession #: CAA98022.1 

1 11 21 31 41 51 

I . I I I I I 

MNHSPLKTAI* AYECFQDQDN STIiALPSDQK MKTGTSGRQR VQEQVMMTVK RQKSKSSQSS 60 

TLSHSNRGSM YDGLADNYNY GTTSRSSYYS KPQAGNGSWG YPIYNGTLKR EPDNRRFSSY 120 

SQMENWSRHY PRGSCNTTGA GSDICPMQKI KASRSEPDLY CDPRGTLRKG TLGSKGQKTT 180 

QNRYSFYSTC SGQKAIKKCP VRPPSCASKQ DPVYIPPISC NKDLSFGHSR ASSKICSBDI 240 

ECSGLTIPKA VQYLSSQDEK YQAIGAYYIQ HTCFQDESAK QQVYQLGGIC KLVDLLRSPN 300 

QNVQQAAAGA LRNLVFRSTT NKLETRRQNG IREAVSLLRR TGNAKTQKQL TGLLWNLSST 3 60 
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DELKEELIAD M.PVLADRVI IPPSGWCDGN SNMSREWDP EVFFNATGCb RNLSSADAGR 420 

QTMBNYSGLI DSLMAYVQNC VAASRCDDKS VENCMCVLHN LSYELDAEVP TRYRQLEVNA 480 

RNAYTEKSST GCF5NKSDXN KNNNYDCPLP EEETNPKGSG WLYHSDAIRT YLNLMGKSKK 540 

DATLEACAGA LQNLTASKGL MSSGHSQLIG LKEKGLPQIA RLLQSGNSDV VR33ASLLSN 600 

MSRHPLLHRV MGNQVFPEVT RLLTSHTGNT SNSEDILSSA CYTVRNLMAS QPQLAKQYFS 660 

SSMLNNIINL CRSSASPKAA EAARLLLSDM WSSKELQGVL RQQGFDRNML GTLAGANSUt 720 
NPTSRF 



PCIYUS02/12476 



10 
15 
20 
25 
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Seq ZD NO: 60 DNA sequence 

Nucleic Acid Accession #: NM_006516.1 

Coding sequence: 180-1658 



TAGTCGCGGG 
QTCAGAGTCG 
CGCACGCCCG 
TGGAGCCCAG 
TTGGCTCCCT 
AGGAGTTCTA 
TCACCAOOCT 
TCTCTGTGGG 
TGCTGGCCTT 
TGCTGATCCT 
CCATGTATGT 
AGCTGGGCAT 
GCAACAAGGA 
GCATCGTGCT 
AGAACCGGGC 
TGCAGGAGAT 
AGCTGTTCCG 
CCCAGCAGCT 
CGGGGGTGCA 
CTGTCGTGTC 
TCGCTGGCAT 
TACCCTGGAT 
TGGGTCCTGG 
CAGCTGCCAT 
GCTTCCAGTA 
TGGTTCTGTT 
ATGAGATCGC 
AGCTGTTCCA 
GCCTGCTCCC 
AACCTGACAG 
CCAGAAGAAT" 
AAATCTATTC 
ATATCAGCCT 
GAGGGTGGAG 
CTGGACCTAT 
GAGGTGGCTA 
CATTAGGATT 
CCTGAGACCA 
OCOSOGTTCT 
GGGAGCCTGC 
TGCAAGATAT 
ATATCTGGAC 
TATAAATGGC 
TTTGGATGGG 
GACTCAGGAT 
TTTGATCCCT 
ATCACATATT 
AGGCTTGAAA 



11 
I 

TCCCCGAGTG 
CAGTGGGAGT 
TCGCCACCCG 
CAGCAAGAAG 
GCAGTTTGGC 
CAACCAGACA 
CTGGTCCCTC 
CCTTTTCGTT 
CGTGTCCGCC 
GGGCCGCTTC 
GGGTGAAGTG 

CCTGTGGCCC 
GCCCTTCTGC 
CAAGAGTGTG 
GAAGGAAGAG 
CTCCCCCGCC 
GTCTGGCATC 
GCAGCCTGTG 
GCTGTTTGTG 
GGCGGGTTGT 
GTCCTATCTG 
CCCCATCCCA 
TGCCGTTGCA 
TGTGGAGCAA 
CTTCATCTTC 
TTCCGGCTTC 
TCCCCTGGGG 
AGCAGCCCTA 
ATGTCAGCCG 
ATTCAGGACT 
AGACAAGCAA 
GAGTCTCCTG 
ACTAAGCCCT 
GTCCTAAGGA 
TGGCCACCCG 
TGCCCCTTCC 
GTTGGGAGCA 
AOTCTCCTTT 
AAACTCACTG 
TTATATATAT 
AAGCCAACTT 
TGGTTTTTAG 
AGTGAGACAG 
CCAGTCCCTT 
GTTACCCAGA 
TGATAGTTGG 
TCGCATTATT 



21 
I 

AGCACGCCAG 
CCCCGGACCG 
CGTACCCGGC 
CTGACGGGTC 
TACAACACTG 
TGGGTCCACC 
TCAGTGGCCA 
AACCGCTTTG 
GTGCTCATGG 
ATCATCGGTG 
TCACCCACAG 
ATCCTCATCG 
CTGCTGCTGA 
CCCGAGAGTC 
CTAAAGAAGC 
AGTCGGCAGA 
TACCGCCAGC 
AACGCTGTCT 
TATGCCACCA 
GTGGAGCGAG 
GCCATACTCA 
AGCATCGTGG 
TGGTTCATCG 
GGCTTCTCCA 
CTGTGTGGTC 
ACCTACTTCA 
CGGCAGG GGG 
GCTGATTCCC 
AGGATCTCTC 
AGCCGGGCCT 
TAACGGCTCC 
CAGGTTTTAT 
TGCCCACATC 
GTCGAGACAC 
CACACTAATC 
TTCTGCTGGC 
CATCTCTTCC 
CTGGAGTGCA 
GCACTGAGGG 
CTCAAGAAGA 
TTTTGGTTGT 
GTAAATACAC 
AAACATGGTT 
AAGTAAGTGG 
ACACGTACCT 
GAATATATAC 
TGTTCAAAAA 
TTGAATGTGA 




GAACTATGAA 
CTGGATCTCC 
TACCCAACCA 
GGGAGGAGAG 
CCACACTATT 
CATGGAGACT 
CAATATTAAA 
CACCTCACTC 
TTGAAATGCT 
GGTTGCAACC 
CTCATCAGTG 
ATTCTTTATC 
AACACTAGTT 
AGGGAA 



Seq ID NO; 61 Protein sequence: 
Protein Accession #: NP 006507.1 



MEPSSKKLTG 
LTTLWSLSVA 
MLILGRFIIG 
GNKDLWPLLL 
LQEMKEESRQ 
AGVQQPVYAT 
LPWMSYLSIV 
CPQYVEQLCG 
ELPHPLGADS 



11 
I 

RLMLAVGGAV 
IFSVGGMIGS 
VYCGLTTGFV 
SIIPIPALLQ 
MMREKKVTIL 
IGSGIVNTAP 
AIFGPVAFFE 
PYVFIIFTVL 
QV 



21 
I 

LGSLOFGYNT 
FSVGLPVNRP 
PMYVGEVSPT 
CIVLPPCPES 
BLPPvSPAYRQ 
TWSLFWBR 
VGPGPIPWFI 
LVLFFIFTYF 



31 

i 

GVINAPQKVI 
GRKNSMLMMN 
APRGAIjGTLH 
PRFIiLtNRNE 
PILIAWLQI* 
AGRRTLHLIG 
VAELFSQGPR 
KVPETKGRTF 



41 
I 

EEPYNQTWVH 
LLAFVSAVLM 
QLGIWGILI 
BNRAKSVLKK 
SQQLSGINAV 
LAGMAGCAIL 
PAAIAVAGFS 
DEIASGFRQG 



51 

I 

RYGESIIiPTT 
GPSKLGKSFE 
ACVFGLDSIM 
liRGTADVTHD 
FYYSTSIFEK 
MTIAIiALLEQ 
NWTSNFIVGM 
GASQSDKTPE 



Seq ID NO; 82 DNA sequence 
Nucleic Acid Accession #; BC001291 
Coding sequence: 44-541 



60 
120 
180 
240 
300 
360 
420 
480 



41 



51 



1 11 21 31 

I I I I • I ! 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCTCGCCTT 
GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 



60 
120 
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WO 02/086443 

AGATCCAGAG GACTCCCAGC GAACGGACGA 
TGAGAGAGAA AACACTTTCG AGTGCCAGAA 
CTGCGTTATA GCGGCCGTGA AAATATTTCC 
CGCTGGTTGT GCAGCGATGG AGAGACCCAA 
GCCCATGCCC TTCTTTTACC TCAAGTGTTG 
ACCTATCAAC TCATCAGTGT TCAAAGAATA 
GCTGTGGCTG GCCATCCTCC TGCTGCTGGC 
AGCCACGGGA CTGCCACAGA CTGAGCCTTC 
ACCTGTTGCA TTAAACTTGT TTTCTGTTGA 
GGGATGGGAG AGTGGGGATC AGGTGCAGTT 
ACATTCAGAG GAAGTCCAGA TCTCCTGAGT 
AAATCAAACC TTGTAACTCA TTTATTGCTG 
CCTCTGAGGG CTTCAGTATT GATGGGGAGG 
TGCTGAGATG CTTCCGACCT TTCAGGTGAC 
GGGTGAAGAC ATCCCTGGAG TGAAGGACTC 
AGGGCTGCCC CCATTCCAGT GGTGGAGGCG 
CTACCAGATT CCAGGAGGCA GAAGATAACT 
ACCAGCTGGC ACAGGTGCAC AGATTCATAA 
ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA 
CATCCATGGG GAGCTGAGAA ATCAGACTCA 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA 

Seq ID NO: 83 Protein sequence: 
Protein Accession #: AAH01291 



PCT/US02/12476 



GGGTGACAAT 
CCCAAGGAGG 
ACGTTTTTTC 
GCCAGAGGAG 
TAAAATTCGC 
TGCTGGGAGC 
CTCCATTGCA 
CGGAGCATGG 
TTACCTCTTQ 
GGCTCTTAAC 
AGTGATTTTG 
ATGGCCACTC 
GAGGCCTAAG 
GCAGGAACAC 
CTCAGCATGG 
CTGTGGATGG 
AATTGTGTTG 
ATTCCCACAC 
ATGGCGTTCA 
AAGTTCCACC 
AAAAAAAAAA 



AGAGTGTGGT 
TGCAAATGGA 
ATGGTTGCGA 
AAGCGGTTTC 
TACTGCAATT 
ATGGGTGAGA 
GCCGGCCTCA 
ACTCGCTCCA 
GTTTGACTTC 
CCTCAAGGGT 
GTGACAAGTT 
TTTTCCTTGA 
TACCACTCAT 
TGGGGGAGTC 
GGGGCAGTGG 
CTGCTTTTCC 
AAGAAACTTA 
GTGTGTGTTC 
TTTCTCTGTT 
AAAAACAAAT 
AAAAAAAAAA 



GTCATGTTTG 
CAGAGCCATA 
AGCAGTGCTC 
TCCTGGAAGA 
TAGAGGGGCC 
GCTGTGGTGG 
GCCTGTCTTG 
GACCGTTGTC 
CCAGGGTCTT 
TCTTTAACTC 
TTTCTCTTTG 
CTCCCCTCTG 
GGAGAGTATG 
TGAATGATTG 
GGCACACGTT 
TCAACCTTTC 
GACTTCACCC 
AACATCTGAA 
AAGATGCAGC 
ACAAGGGGAC 



51 



TTTTTTTTTT 
TGCGCCATCT 
TTTTCTCTGG 

cgcccgccgc 
aagcaaggca 

ATTCTTACAG 
CTCCTCACCT 
GAGCACAAAC 

GTCACGCCAG 
GAACACATAG 
GGAGCTCTAA 
GATGAGCCCA 
CTCTTGCAAC 
AGTCCCCTGA 
CCACCTCTCC 
GGATCAGTAT 
CTGTTTAGTC 
GAAGAAATGG 
CCAATGGCTA 
AACACGTCTA 
CCATTCCAGC 
TCCGCCCCTC 
ACGTTCAAAT 
TACAAGTGCA 
AAGACGCACA 
GCCAGCTCCC 
TCOGTGGTGG 
GAGGAGGAAG 
CTGACGGAGA 
CACGAGAACA 
GACGTCATGC 
GTCCTGGGCG 
TGCGACGAAG 
CGCGGCTGCT 
AGCCCCAGCT 
CCCCCGGCCA 
GCCTCCAGGC 
GCCTCCTCGT 
GAGCTGGACG 
ATTAGTGGTC 
GAGTACTGTG 
ACGGGCGAAA 
CTCACCAGGC 
TGTAAGATGC 
GATCGAGTGT 
CTCCCACCTG 
CCTGTAGGAT 
ACGAAGCTAA 
TTCTTTTTTC 




ATGGGATTCA 
CGAGAGAGGC 
CACCACCGAG 
CCCTGGCCAC 
TGGAGCCTCC 
GCCCACCGCT 
CAGGTAGCAA 
CTCCCTCCCA 
TTCAGAGCAA 
ACCTGTGCGA 
TGCACAAATC 
CGGAACCCGG 
CCAAGTTCAA 
AGGAGGACGA 
GCGAGAGGGT 
GCTCGCGGGG 
AGGGCATGGT 
AGAAGCATAA 
ACTOGGTGGC 
CCCOGGGCGA 
OGCTGAGCCC 
CGATGCCCAA 
AGCTCAAAGA 
CGGAGCACTC 
GAGGGATCTC 
CGGGCACGGG 
GGAAAGTCTT 
GGCCTTATAA 
ACATGAAAAC 
CTTTTAGCGT 
TGAATAATGA 
ACACCCCCTT 
TTTTTTCTAG 
GAATATGAGA 
TTTTTCCTTT 



21 
I 

AAAAAAAAGC 
TCTAATTTAT 
TTTCTAACCC 
GCCGCCGCCG 
CTTAAGCAAA 
AGACCACGGC 
CCAGATGAAC 
CAATGGCAGC 
GATGAAAAAA 
TTGTTTATCA 
TCTGCACTGG 
TGGGATGAGT 
ATGTACAACT 
CACTCATGGA 
TGGTATCCCT 
TATTGCAGAC 
TTCCGGCCTG 
ACATCACTTG 
CCATCACCCG 
CGCCATGGAT 
GTCCCCAGGC 
GCCGCCCTTC 
GCCCCCGGTC 
CCTGGTGGTG 
CCACGCGTGC 
GTCCCCCATG 
CACCAGCGAC 
GAGCGAGAAC 
CGAGGAAGAG 
GGACTACGGC 
CGCGGTCGTG 
GCTCAGCTCC 
GCGCGGCCAC 
CGGCGAGTCG 
GTOGGCCTCG 
CTTCTCTAAG 
CACGGAGAAC 
TCCCTTCCTT 
CTCGGAGAAC 
GGGGCGCAGC 
CAGGCCCAGC 
CAAGAACTGT 
ATGCGAGCTG 
GCATGGCCAG 
GTACAGTACC 
TATAAAAACT 
TTTCACCACT 
TCCCATGTGA 
GTGCTTGTCA 
TTTTTTTTTT 



31 

I 

CATGACGGCT 
TTTGGATGTC 
GGCTCTCCCG 
CCCGCCCCGC 
CGGGAATTCT 
CCGTTGGGAG 
TTCCCATTGG 
CTCTGCTTAG 
GCATCCAATC 
ACGTCATCTA 
AGGGGCCTCT 
GCAGAATATG 
TGCAAACAGC 
TTAAGAATCT 
TCAGGACTAG 
AATAACCCCT 
GCAGAAGGGC 
GACCCCCACC 
AGTGCCTTTG 
TTCTCTAGGA 
CGGCCCAGCC 
CTGGCGACGC 
AAGTCCAAGT 
CACCGGCGCA 
ACCCAGGCCA 
ACGGTCAAGT 
TTGGTGGGCA 
GACCCCAACC 
GAAGAAGAGG 
TTCGGGCTGA 
GGCGTGGGCG 
ATGCAGCACT 
CTGGCCGAGG 
GACCGCATAG 
GGGGGCCTGT 
CGCATCAAGC 
GTGTACTCGC 
AGCTTCGGAG 
GGGAGCTTGC 
GGCACGGGAA 
TCAAAAGAGG 
AGCAATCTCA 
TGCAACTATG 
GTGGGGAAGG 
CTGGAGAAAC 
GAATAGAGGT 



TTTAAACAAA 
CCAGCACACC 
TCCTTTATGT 



41 
I 

CTCCCACAAT 
AAAAGGCACT 
ATGTGAACCG 
AGCCCACCAT 
CGCCCGAGCC 
CTCCAGAAGG 
GGGACATTCT 
AAAAAGCTGT 
CCGTGGAGGT 
GAAGAATTTG 
CCTCCCCTOG 
CCCCGCAGGG 
CATTCACCAG 
ACTTAGAAAG 
GTGCAGAATG 
TTAACCTGCT 
GCTTTCCACC 
GCATAGAGCG 
ACAGGGTGCT 
GACTTAGAGA 
CTATGCAAAG 
CCCCCCTCCC 
CATGCGAGTT 
GCCACACGGG 
GCAAGCTGAA 
CCGACGACGG 
GCGCCAGCAG 
TGATCCCGGA 
AGGAAGAGGA 
GCCTGGAGGC 
ACGAGAGCCQ 
TCAGCGAGGC 
CCGAGGGCCA 
ACGATGGCAC 
CCAAAAAGCT 
TCGAGAAGGA 
AGTGGCTCGC 
ACTCCAGACA 
GCTTCTCCAC 
GTGGAGGGAG 
GCAGACGCAG 
CTGTCCACAG 
CCTGTGCCCA 
ACGTTTACAA 
ACATGAAAAA 
ATATTAATAC 
ATCGCCCTCC 
CAAACAAACA 
TGTTTTTTTT 
TCTCACCGTT 



51 
I 

TCATCTTCCC 

GATGAAGATA 

AGCCGTCGTC 

GTCTCGCCGC 

TCTTGAAGCC 

GGATCATGAC 

TATTTTTATC 

GGATAAGCCA 

TGGCATCCAG . 

CCCCAAACAG 

TTCTGCACAT 

TATTTGTAAA 

TGCATGGTTT 

CGAACACGGA 

TCCTTCCCAG 

AAGAATACCA 

CACTCCCCCC 

CCTGGGGGCG 

GCGGTTGAAT 

GCTGGCAGGG 

GTTACTGCAA 

TCCTCTGCAA 

CTGCGGCAAG 

CGAGAAGCCC 

GCGCCACATG 

TCTCTCCACC 

CGCGCTCAAG 

GAACGGGGAC 

GGAGGAGGAG 

GGCGCGCCAC 

CGCCCTGCCC 

CTTCCACCAG 

CAGGGACACT 

TGTTAATGGC 

GCTGCTGGGC 

GTTOGAOCTG 

CGGCTACGCG 

ATCQCCTTTT 

ACCGCCCGGG 

CACGCCCCAT 

CGACACTTGT 

GAGAAGCCAC 

GAGTAGCAAG 

ATGTGAAATT 

ATGGCACAGT 

CCCTCOCTCA 

AGCCCCACTC 

AACAGAAGTA 

CTTTTTCTTT 

TGAATGCATG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1 11 21 31 41 

I . i 1 . I I ! 

MALLALLLW ALPRVWTDAN LTARQRDPED SQRTDEGDNR VWCHVCEREN TPECQNPRRC 
KWTEPYCVIA AVKIPPRPFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPP PYIaKCCKIRY 
CNLEGPPINS SVPKEYAGSM GESCGGLWLA ILLLLASIAA GLSLS 

Seq ID NO: 84 DNA sequence 

Nucleic Acid Accession #i NMJ)22893.1 

Coding sequence: 229-2726 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



220 



WO 02/086443 



PCI7US02/12476 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ATCTGTATGG 
AATTTACATG 
TATTTTAAAT 
TGAGAGAAAA 
CAT6TACAGT 
AGACAGAATA 
TTAAATGTAT 

cagaacaagt 
cacatcgatg 
taataaacct' 
taaaaagatg 

AACAAAACTG 
TGGGCTGTTT 
TGCAAAAGCC 
TGCTTTAATA 
GGTTGTCAAG 
AGAGGCTGAC 
CAACGTGGTA 
TGATGATTAA 
GAGAAAGAAT 
TTCTTTTCCA 
TTTTATTTTT 
ATAGCACTTG 
GAGTATAAAA 
TACAGGTCTA 
CATTGAGGAG 
TATTGAGCTT 
TTAACATAGA 
CTTTTTTAAA 
GGGGAACTGC 
AAAAATGGTA 
AAAAACATAC 
ACTCATGTTG 
GTATTTGAAT 
TTTTTAACTG 
AAAAAAGACA 
ATTTGTATGC 
GAAGCCATAT 
GGGGTTGTAC 
AAGCTAATAA 
ACATTCCAGC 
AAAAAAAAAC 
TTAGATTGGA 
ACTTTTTTGT 
AAAGCTGTTT 
GCTTATGTGC 
TATTTTTTTT 
TTTCTGGTCT 
CTGCATTAAC 
A 



GGCAATACTA 
TTGTGTATGT 
TGTCCCTAAT 
AGAGAGAGAG 
TTTATTTTAT 
GATAGCACTG 
CAATTGGAAA 
TTATTCTATC 
GTTCTTTCTA 
TAGGAACTAG 
ACCCATTATG 
ATTATACCAG 
TGCCCAAAGT 
CTGGAACGCA 
ATGTCTTTTT 
TGGACAATCA 
AATAAGGTTT 
CTATTTGCCA 
CTATGAATTC 
TTTAAGAGTA 
TACACTGTGT 
TTAATTTTTT 
ACTCTGCCTO 
TAAACCTGCT 
TACTTAAGGG 
CATTTTTTAA 
ACTTACTTGG 
AATGAATGAT 
TATAAATGTT 
ATTTTAGGGT 
GTGGAAATTC 
ATTGGGGAAA 
ATTCCTATGC 
TAAATGTTCA 
TTGCTTGTTC 
GGCTGCCACA 
TTCAAAAAAA 
AATGGCGGTT 
ATATCCTTTT 
CGTTGGTTTG 
ATCTTACCTT 
CAATGTTTTG 
AAGAATTTCA 
AAATGGCAAT 
GTOGTAACTT 
CCTGTTCAAA 
TTTATTTAGA 
TTGTTAAGTT 
CTGTTCATAC 



TTGCATTTTA 
TTTTTTTTAA 
TCGTTGCTGA 
AGAGAAAAAG 
AACGTGAGGA 
AAAAAAAATC 
GGAAGAAAAA 
ATTTGGTTTT 
CTTGTTATAA 
CATATAATTC 
TCCTAGTTAA 
TATAAAAGCT 
TTTATTTTTT 
ATTAAATACA 
AAAAATACTG 
AATGATAAAC 
GACAGAGGGG 
TTTAAAACTA 
TTAAGACTTG 
GCAGTATATA 
GCTATTTGTG 
CTTTTTTATT 
TGATATCTGT 
CAGATAGGAC 
AAAAGTTGCA 
AAACTAAAAA 
ACGCAACATT 
TGCTTTCGCT 
AAGAAAAATT 
TCCATTGTCT 
TATGCCTTGG 
GGTTTAAGAT 
TAAAATACAT 
TCTAGTGTTA 
TCTTAAAAGG 
GTATATTTTT 
AAAAAAAGAG 
TGGGGGAGCC 
TTGTTCCTTT 
TTATGTAGTG 
CATATGCAGT 
CAGTTTTTTT 
TATGCAAAGC 
GCAGAATATT 
GAAATTTTAT 
ACAGAGGCAC 
TGACCAAAGG 
CTATTGGAAA 
ACCCATTTTG 



GGCAAACTTT 
ACTTAGACAG 
GCAAACATGT 
ACCATGCTGC 
GGAAAAACAG 
TCTATGAGCT 
AGGCCTTGAA 
AAAATATGAG 
ACTTGTAGCT 
TATATTGTAT 
TCATCATTTT 
ACTTTGCTCC 
TTAAACAATG 
CTAGTAAGGA 
GCACCAAAAG 
TTTAAGACCT 
AACAGAAGAA 
GAACAGGTAT 
CATTTAAATG 
TGTCTGTGCT 
TTAACATGGA 
AAGCTAGCAT 
ATCTTTTCTC 
AATTAAGTGC 
AGAATGCTGA 
AAAAAAAACT 
GCAAGCGCTG 
TCTACAGTGC 
TTTTTTAAAA 

ATACACACCG 
TATATAGTAC 
TTATGGTCTT 
GGCACTATAQ 
TATCAATGTA 
TTAATTTGGC 
AGAAACAAAA 
TGCTAGAATG 
TTCCTGCTGC 
TGCTTTTTGT 
AAAAGAAAGA 
CATTGCCAAA 
ATATTAAAGA 
TTGTTATTGG 
CTTTTACTAT 
TTAATTTGAT 
TCATTACAAC 
AACCACTGTC 
TCCCTTTATT 



GAGCCTTTCT 
CATGTATGGT 
TGCTGTTTCC 
ATACATTCTG 
TCTTTGGATT 
AAATGTCTGT 
TTGACAAATT 
TGCCTTGGAT 
TAATTCAGCA 
TTCTCACAAC 
TCCTTTAGTT 
TGGTGAGAGC 
ATTAAATTGA 
GTTCATTTTA 
AAATAGATCC 
TGTATACCAT 
AATAATATGA 
ATAAGCTAAT 
TGACATTCTT 
GCCTAAAAGT 
AGAGGATTCA 
CTGCCCCAGT 
TAATCAGAGA 
ACTGTACAAT 
AAAAAAATTG 
TTGCCAGCCA 
TGAATGGAAA 
AAGGATTTTT 
AACACTTCAT 
ACAAGACTTG 
CTCTTCAGGT 
TTAAATATAG 
TTTTCTGTAT 
TATTTATATT 
CCTTTTTTGG 
AGGATAATAT 
AAGTGTGACA 
TCACATGGAT 
CATACTGTAT 
CCCTTTCCTT 
AAGAAAAAAA 
AACTAAATGG 
GAAAGCCCGC 
CCTTTTCTAT 
GGGAGTCACT 
CTTTTATTTT 
CTGGCTTTTT 
TGTGTTTTTT 
QAAAAAATAA 



CTTGTGCAAT 
ATGTTATGGC 
AGTTCOGTTC 
TAATACATAT 
AACCCTCTAT 
CTCTAAAGGG 
AACAGAAAAA 
CTATTAAAAC 
TTGGGTGAGG 
AATGGCTACC 
TAATTTTATA 
TTAAAAGAAA 
ATGTGTAATG 
TGAAGATATT 
AGATCTACTT 
ATTGAAAGGA 
TTTATTAGCA 
ATTGATACAA 
AAAAAAAGAA 
TGTACTTCAT 
TTGTTTTTAT 
TGGTGTTCAA 
TACAGAGGTT 
TTTCGCAGTT 
AACACAATCT 
TTTACTTGAC 
CAGAATACAC 
TTGTACAAAA 
TATGTTTAGG 
TTATCCATTT 
TGTAAAAAAA 
GAAAATOCAC 
TTCTAGAATG 
GAAGCTTGTA 
TAGTGGAAAA 
AGTGCAAATT 
TTACAGATGA 
GGCTGTCATA 
GCAGTACTGC 
CTATCACCCT 
AAGGAAAAAA 
TGCTTTATAT 
TTTAGTCAAT 
TCCTGTAATG 
ATTTATTATT 
TCTTTGTTTT 
ATTGTATTTG 
TGGCAGTTGT 
AAAAAATTAA 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5560 
5640 
5700 
5760 
5820 
5880 
5940 



Seq ID NO: 85 Protein sequence: 
Protein Accession #» NP_075044.1 



I 

MSRRKQGKPQ 
blFIEHKRXQ 
CPKQERIADK 
SAWFLLQHAQ 
LRIPGSVSRE 
LRLNPMAMEP 
PPLQSAPPPS 
KRHMKTHMHK 



RALPDVMQGM 
TVNGRGCSPG 
AGYAASRQLK 
STPHISGPGT 
QSSKLTRHMK 



11 

I 

HLSKREPSPE 
CNGSIiCLEKA 
LLHWRGLSSP 
NTHGLRIYLE 
ASGLAEGRFP 
PAMDPSRRI»R 
QPPVKSKSCE 
SSPMTVKSDD 
DEEEEEEEEE 
VLSSMQHFSB 
ESASGGLSKK 
DPFLSFGDSR 
GRPSSKEGRR 
THGQVGKDVY 



21 
I 

PLEAILTDDE 
VDKPPSPSPI 
RSAHGALIPT 
SEHGSPLTPR 
PTPPLFSPPP 
ELAGNTSSPP 
PCGKTFKFQS 
GLSTASSPEP 



AFHQVLGEKE 
LLLGSPSSIiS 
QSPPASSSEH 
SDTCEYCGKV 
KCEICKMPPS 



31 
I 

PDHGPLGAPE 
EMKKASNPVE 
PGMSAEYAPQ 
VGIPSGLGAE 
RHHLDPHRIE 
LSPGRPSPMQ 
NLWHRRSHT 
GTSDLVGSAS 
VDYGFGLSLE 
KRGHLABAEG 
PFSKRIKLBK 
SSENGSLRFS 
PKNCSNLTVH 
VYSTLEKHMK 



41 

I 

GDHDLLTCGQ 
VGIQVTPEDD 
GICKDEPSSY 
CPSQPPLHGI 
RLGAEEMALA 
RIjLQPFQPGS 
GEKPYKCNLC 
SALKSWAKF 
AARHHENSSR 
HRDTCDEDSV 
EPDLPPATMP 
TPPGELDGGI 
RRSHTGERPY 
KWHSDRVLNN 



51 



I 

CQMNFPLGDI 
DCLSTSSRRI 
TCTTCKQPFT 
HIADNNPFNL 
THHPSAPDRV 
KPPFLATPPL 
DHACTQASKL 
KSENDPNLIP 
GAWGVGDES 
AGESORIDDG 
NTENVYSQWL 
SGRSGTGSGG 
KCELCNYACA 
DIKTE 



Seq ID NO: 86 DNA sequence 

Nucleic Acid Accession ft: XM_035292.2 

Coding sequence: 53-1576 



GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 



11 
I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
RTCGTGGCCC 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 



31 

I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 



41 
I 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTAGA 
CTGCTCATCA 
CTCAAGCCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



221 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 
CCCCCAGGAG 



CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 
ACATAGCCAG 



CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
COGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 
GAGGCCGAGT 



CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 
GGCTGCCGGA 



CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 
GGAGCATGC 



TGCTGCTCAC 
TTGCCGCCGC 
GGAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



Seq ID NO: 87 Protein sequence: 
Protein Accession ft: XP_035292.2 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 
I 

LAAPAAEEKE 
TPTGVLKEAG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNLALPVFFI 
STTVLCQKLM 



21 

I 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAFS 
LACLFLIAVS 
QWPQBT 



31 
I 

SADGSAPAGB 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRKLPLAIII 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



41 

I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEBA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGLP 



51 
I 

TLLNGVAI IV 
SKSGGDYAYM 
AKLVACLCVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWXN 



Seq ID NO t 88 DNA sequence 

Nucleic Acid Accession ft: NM_005268. 

Coding sequence: 168-989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTGGGTTTC 



11 

I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCXTTCrOGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 

I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 
I 

GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



Seq ID NO: 89 Protein sequence! 
Protein Accession #: NP_005259.l 



11 



21 



CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



31 41 51 

I 1 I I I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 
GKKRGGIiWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 
SEKNIFTLFM VATAAICIIiL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession ft: NMJJ02391. 

Coding sequence: 26-457 



60 
120 
180 
240 
300 
360 
420 
460 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
160 
240 



21 31 41 SI 

I 1 I I 

GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 



222 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CAATGCTCAO TGCCAGGAGA CCATCCGOGT CACCAAGCCC 
AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG 
GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT 
CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA 
ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG 
TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCOGCTT 
ATTACTAAGA AACACATCAA ATAAACTGAC 
TAATAT 

Seq ID NO: 91 Protein sequence: 
Protein Accession fi: NP_0 023 82.1 



PCT/US02/12476 



TGCACCCCCA AGACCAAAGC 
CCAAGCCTGG ATGCCAAGGA 
CCCAGGCCCG AGATGTGACC 
TGCCCTGCCT TGTCCCTCTC 
ACAAGGGATT CTGGGAAGCT 
TTGTTCTTCC CCACAATTCC 
CAATAAAAGC TCTTCTTTTT 



31 



41 



51 



1 11 21 

I I I I I I 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEF GADCXYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession #: NM_005130. 

Coding sequence: 98-802 



CTCTACCTGA 
CGTGTGCTCA 
GCTCTCCTTC 
GAATGGACTT 
TAAGCAGAAA 
CAGATGGGCT 
GGACCATGAA 
TGAGAGAGTC 
ATA7TCCAAG 
TAAGCTAGTC 
GTCCCCCAGG 
GACCATGGCC 
GACTGCCCTG 
AGTGCAGGAC 
TGTCGTAAGT 
TGTGCTTAGT 
TGGAATTTGC 
TTCCATGGCC 
GAGTGATAAT 
TTTTTCAAAA 



11 
I 

CACAGCTGCA 
GAACAAGGTG 
CTCCTACTGG 
CACAGCAAAG 
AGCAGGCCCG 
GCTACTGAGC 
TTTTCCTGTG 
TATTGGAAAC 
ACAGCTGTGA 
AGCTCCACTC 
GAGCACATCA 
ACCAAAGCTC 
GAGTTCTGTG 
ACGTCATGCT 
CCCTCTGTAT 
GAGTGCAACG 
CTTATTTTTC 
CACACAGCTA 
TTCAGTGCAA 
AAAAAAAAAA 



21 

I 

GCCTGCAATT 
AACGCCCAGC 
CTGCTCAGGT 
TGGTCTCAGA 
GGAACAAAGG 
AGGAGGAGGG 
TCTTTGCTGG 
AAGTTGCCCG 
AAACCAGAGT 
TATTTGGGAA 
AGGGCAAAGA 
CCGAGTGTGT 
GAGAGACTTG 
AATGAGGTCA 
ACTTTAAAGC 
AAATATTTAA 
TTGGATGCGA 
TGTGTTTGAG 
CGAACTTTCT 
AAA 



31 
I 

CACTCCCACT 
TGCAGCCATG 
GCTCCTGGTG 
ACAAAAGGAC 
CAAGTTTGTC 
CATCTCTCTC 
CAATCCAACC 
GAATCTGCGC 
GTGCAGAAAG 
CACAAAGCCC 
GACCACCCCC 
GGAGGACCCA 
GAGCTCTCTC 
AAAGAGAACG 
TCTCTACAGT 
ACAAGTTTTG 
TGTTCAGAGG 
CAGCGAAGAG 
GCTGAATTAA 



41 

I 

GCCTGGGATT 
AAGATCTGTA 
GAGGGGAAAA 
ACTCTGGGCA 
ACCAAAGACC 
AAGGTTGAGT 
TCATGCCTAA 
TCACAGAAAG 
GATTTTCCAG 
AGGAAGGAGA 
TCTAGCCTAG 
GATATGGCAA 
TGCACATTCT 
GGTTCCTTTA 
CCCCCCAAAA 
TATTTTTTGC 
CTGTTTCCTG 
TCTTTGAGCT 
TGGTAATAAA 



Seq ID NO: 93 Protein sequence: 
Protein Accession #: NP_005121.1 

1 11 21 



31 



41 



51 

I I II I I 

MKICSLTLLS FLLIAAQVLL VBGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKP 
VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKQVARNL 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 
PSSLAVTQTM ATKAPECVED PDMANQRKTA LEFCGETWSS LCTFFLSIVQ DTSC 

Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession #: NM_012101 

Coding sequence: 125-1891 



1 
I 

CTCCTCACAG 
TGCCAGAAAG 
TGCGATGGAA 
CCGGAGCCCG 
TGCCAAGACC 
CCTGAAGCCA 
CATCCAGTTT 
GGAAGGCAAG 
TACCTTTGCC 
GGTGTCCATC 
CCTTTTTTCA 
CAAGCAGAAG 
CAAGCCCCAC 
CTTTGAGGCC 
CCAGACCTGC 
AGTGGAGGAG 
GCTCAAGATC 
CAAGAGCTTC 
GGACCTGGAG 
TGTGGACCAA 
GGACAAGCAG 
ATTTGGTGCA 
GCTGGAGGGG 
ATGCATGCGC 
GAACCACATG 



11 
I 

GTGTGTCTCT 
GTCACCTATC 
GCTGCAGATG 
TCGGGCCCCA 
ACCAACGGGC 
GGGGAAGGTA 
GTCGAGTCCG 
AGGTCGCCGT 
GAAAAGGGCG 
ATGGAGCCCG 
CGGTCCAAGT 
GCGGTCAAGT 
CTGGAGGGCG 
CGCAAGTGTC 
ATCTGCTACC 
GCCAAGGCCG 
ATTGAGATTG 
ACCACCAATG 
AAGCAAAAGG 
GTGAAGGTGA 
ACCCGGGAGC 
TTGATGAGCA 
GAGGGCCTGG 
CACGTTGAGA 
GAGAACGGTG 



21 

I 

AGTCCTCGTG 
CTGAACCCCA 
CCTCCAGGAG 
GTGGCAGCCT 
ACGGCGGGGA 
GGAGCGCCCT 
GGGACGACAA 
ACGCAGGGCT 
ACGTGCGCAA 
GGGAGACCCG 
CCGGCTCCGA 
CCTGCCTGGT 
CCGCCTTCCG 
CCGTGCATGG 
TTTGCATGTT 
AGAAGGAGAC 
AGGATGAAGC 
AGAAGGCCAT 
AGGAAGTGAG 
TCATGGATGC 
AGCTGCATAG 
ATTACTCTCT 
GACAGTCACT 
AGATGTGCAA 
GTGACCATCG 



31 
I 

GTTGCCTGCC 
GCAAGCCTGA 
CAACGGGTCG 
GGAGAATGGC 
GGCAGCTGAG 
GTTCGCGGGC 
GAACTCCAAC 
CCAGCTGGGG 
GTCCATTTTC 
GCGGAACAGC 
GGAGGTGCTG 
GTGCCAGGCC 
AGACCACCAG 
CAAGACGATG 
CCAGGAGCAC 
GGAGCTGTCA 
TGAGAAGTGG 
CCTGGAGCAG 
GGCTGCGCTG 
TCTGGATGAG 
CATCAGCGAC 
CCCCCCACCC 
AGGCAACTTC 
GGCGGACCTG 
CTATGTGAAC 



41 

I 

CCACTCCCTG 
AACAGCTCAG 
AGCCCAGAAG 
ACCAAGGCTG 
GGCAAGAGCC 
AATGAGTGGC 
TACTTCAGCA 
GCTGCCAAGA 
TCGGAGTCCC 
TACCCCCGGG 
TGOGACTCCT 
TCCTTCTGCG 
CTGCTCGAGC 
GAGCTCTTCT 
AAGAATCATA 
CTGCAAAAGG 
CAGAAGGAGA 
AACTTCCGGG 
GAGCAGCGGG 
AGAGCCAAGG 
TCTGTGTTGT 
CTGCCCACCT 
AAGGACGACC 
AGCCGTAACT 
AACTACACGA 



420 
480 
540 
600 
660 
720 
780 



60 
120 



51 
I 

GCACTGGATC 
GCCTCACCCT 
AAAAAGTGAA 
ACACCCAGAT 
AAGCCAACTG 
GCACTCAATT 
AGCTCAAGGA 
ACATCTGTAG 
AATCCAGTCT 
AAACAGAGAT 
CAGTGACCCA 
ACCAGAGGAA 
TCCTCAGCAT 
AGAGATGTCA 
TATGAACTTT 
TTTTGTGTTT 
CAGCATGTAT 
GAATGAGCCA 
ACTCTGGGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



51 

I 

CCGAGACGCC 
CCAAGCACCC 
CCAGGGATGC 
ACGGCAAGGA 
TGGGCAGCGC 
GGCGACCCAT 
TGGACTCTAT 
AGCCACCCGT 
GGAAGCCCAC 
CCGACACGGG 
GCATCGGCAA 
AGCTGCATCT 
CCATCCGGGA 
GCCAGACCGA 
GCACCGTGAC 
AGCAGCTGCA 
AGGACCGCAT 
ACCTGGTGCG 
AGCAGGATGC 
TGCTGCATGA 
TTCTGCAGGA 
ATCATGTCCT 
TGCTCAATGT 
TCATTGAGAG 
ACAGCTTCGG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



223 



WO 02/086443 



5 
10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



GGGTGAGTGG 
TGGGGTCCGG 
GAAGAATTTC 
CTCCTCCAGC 
CTCCCTGAAA 
TTGGAAATCT 
CAACGGGATT 
CCCCTGCTCT 
TGGGAGGGAG 
GTTCCGGCCT 
TGACCTCAGA 
TAGGTTGGGG 
CAQTGAGTAC 
CTATAGACGT 
ACAGCCACCC 
GTGCTCTCTC 
CTGCAGATGG 
ACAGCCACTT 
TAGCCAAGAT 
TCCACCCATG 
TTCAGTCTAC 
GTGCCTTACA 
TGATTACCCC 
AGCAGCACAG 
GATGGGCCAG 
ATAAACCATT 



AGTGCACCGG 
ACATCATACC 
AACAATCTCT 
ATTCAGAACT 
GGCTATCCCT 
GGCAAGCAGA 
GGGTCCAACG 
TCCTCCTGAC 
CCTGGTCCTG 
CTCCGACTTC 
TGGTCACCAT 
CCTGCCCTAA 
CCGCATGGTA 
TTCTCTCCAA 
ATCTCCCATT 
TCGTCCTACC 
AAACCTCTCA 
TGAGTCTGTG 
ATTCCTCTGT 
CAAATAGCTA 
ACTTTGGCAT 
CACTGCCCCC 
CCATGTTGCA 
TGGGGACATC 
CTTGCAGGGG 
GGTCTGTC 



ACACCATGAA 
AGCCCTCGTC 
ATGGCACCAA 
CTGACAATGA 
CCCTCATGCG 
CTATGCTGTC 
AAGCCCCATG 
CCTGCTGCTC 
CACCTGCCCT 
OCCACTGGCC 
CATTCCTGTG 
CCCGCCAGCC 
TCAGCCTGCC 
GGCCCTATCC 
CACATGGCCC 
TATCAATGCC 
GTGTCTTGAC 
GTCCCTGGAG 
TCCCTCTGCT 
CTGGCCCAGC 
TCTCTCTOGC 
ACCCTCAGCC 
TATCAGGGTG 
TCCCGTCTCA 
TTGGGGAGGG 



GAGATACTCC 
TCCTGGCCGC 
AGGTAACTAC 
CCTGCCCGTC 
GAGCCAAAGC 
TCACTACCGG 
AGCTCCTGGC 
TTGCCTTCTA 
CTGCAGCCCT 
ACACTCCATT 
CTCAGAGGCC 
TCCTCCTCTC 
TCTCCCGCCC 
CCCAATGTTG 
ACCTCCTGCT 
CAGCATGGCA 
ATCACCCTAC 
GGTGGCTTCT 
GAGATAAAGA 
TACCATTTAC 
GATGGAGTGT 
GTTGCCCCAT 
CTCAAGGATT 
ACAGCCCCAG 
AGACATCCAG 



ATGTACCTGA 
TTCACCAAGG 
ACCTCCCGGG 
GTCCAAGGCA 
CCCAAGGCCC 
CCATTCTACG 
GGAAGGAACG 
AGCTACTGTG 
CTGCCAGCCT 
CAGACTCCTT 
AACCCATCAC 
GGGCTGGATC 
ACGCCCTGCT 
TCAGCAGATG 
TCCCAGAGGA 
GAACCTGCAG 
CCAGGCGGTG 
CCTGACTGGC 
ATTCCCTTAA 
CATTTGCCTA 
GGCTGGGCTG 
CAGAGGCTGC 
GGAGAGGAGA 
GCCTATGGGG 
CTTGGGCTTT 



CACCCAAAGG 
AGACCACCCA 
TCTGGGAGTA 
GCTCCTCCTT 
AGCCCCAGAC 
TCAACAAAGG 
AGGCGCCACA 
CTTGTCTGGG 
CTTGGGGGCA 
TCCTGCCTTG 
AGGGGTGAGA 
TGGGGGCTAG 
GTCTCCAGGC 
CCTGGACAGC 
CTGGCCCTAC 
TGGCCAAGGG 
GGTCTCCACC 
AGGATGACCT 
CATGATATAA 
CAGAATTTCA 
ACCGCAAAAG 
CTCCTCCTTC 
CAAAACCAGG 
GCTCTGGAAG 
CCCCTTTGGA 



1560 
1620 
1660 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



PCT/US02/12476 



Seq ID NO: 95 Protein sequence: 
Protein Accession #: NP_036233.1 



1 

1 

MEAADASRSN 
KPGEGRSALF 
FAEKGDVRKS 
QKAVKSCLVC 
TCICYLCMFQ 
SFTTNEKAIL 
KQTREQLHSI 
MRHVEKMCKA 
VRTSYQPSSP 
LKGYPSLMRS 



11 
I 

GSSPEARDAR 
AGNEWRRPII 
IFSESRKPTV 
QASFCKLHLK 
EHXNHSTVTV 
EQNFRDIiVRD 
SDSVLFLQEF 
DLSRNFIERN 
GRPTKETTQK 
QSPKAQPQTW 



21 
I 

SPSGPSGSLE 
QFVESGDDKN 
SIMEPGETRR 
PHLEGAAFRD 



LEKQKEEVRA 
GALMSNYSLP 
HMENGGDHRY 
NFNNLYGTKG 
KSGKQTMLSH 



31 
I 

NGTKADGKDA 
SNYFSMDSME 
NSYPRADTGL 
HQLLEPIROF 
LSLQKEQLQL 
ALEQREQDAV 
PPLPTYHVLL 
VNNYTNSFGG 
NYTSRVWEYS 
YRPFYVNKGN 



41 
I 

KTTNGHGGEA 
GKRSPYAGLQ 
FSRSKSGSEE 
EARKCPVHGK 
KIIEIEDEAE 
DQVKVIMDAL 
EGEGLGQSLG 
EWSAPDTMKR 
SSIQNSDNDLi 
GIGSNEAP 



51 

I 

AEGKSLGSAL 
LGAAKKPPVT 
VLCDSCIGNK 
TMELFCQTDQ 
KWQKEKDRIK 
DERAKVLHED 
NFKDDLLNVC 
YSMYLTPKGG 
PWQGSSSFS 



Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession #x NM_080668.1 

Coding sequences 83-841 



GGCACGAGGG 
GAGCTCGAGA 
GOGCTCCGGG 
AGGCTCTGAA 
AGTCAGAAAG 
CCAATCACCT 
TGGCAGGGAG 
CAGCACTCCT 
CAGAGACTTG 
CTCTGCCTCT 
GGCAGAAGAC 
GGTTTGTGCA 
GAAACAGAAA 
GGCTGCGGCC 
AGATGCAGTG 
CCTGTGGAGA 
GCTGCGCATG 
TTAGGAAATG 
CCTTCCTATC 
GAATCAGTGG 
CTTGTTGGGA 
CCCTCCTGAG 
TTCAGTTCCG 
GAGGGCAATT 
AAGGATGTAG 
GTTTTCTAGT 
GCTGCTTGGA 
GCTCATGGCC 
TGCAGGGGTG 
ACCGAGTCTA 
CAACTATGCT 
ATAGCAATTT 
CTCATGATCT 
TTTGGATTTG 
TTTGATGTTT 
CAGAGAATGC 
AGGCGGGCCC 



11 
I 

CAGCGAGTGG 
CGGAGCCTAG 
CCAAGGGCCC 
CTCCCGAGCA 
CCCATCGTCT 
CQCAGGAGCC 
CTTACTAAGG 
GTGCCGAACC 
GAAATGTCTA 
ACCTCCACCC 
TTGTCCGGAG 
AAGCCCTGGG 
CGTAAGAAGA 
ATGAATGCCG 
GGGGGTGCAC 
GGACACTTAG 
AGGACTGTCT 
GGGCCGCCTG 
TCCCCAAAGT 
CCCAGCCAGA 
GGGGTGGCTG 
TTGCCTTCTG 
TGGGTTTCCC 
CTGTCTTGGA 
GGAGCCTTAG 
TCTGGTCTGC 
GCAAAGGGTG 
TCTGGGCTGG 
GAAGGTGGCC 
CTTGGCCCTA 
TGTAAAGTCC 
TAGTTTTTGG 
CTGGAGAATT 
AAGGCTGCCC 
AGAAGTTCGT 
CAGGGAAGAT 
TAATAAAACC 



21 
I 

CCTTCCCGGT 
TTATGTCTGG 
CATCTCCTAC 
TCCTCCCTGA 
TAAAGAGGAT 
CTAGGATTTC 
AGGACCTTTT 
CTGAGGCCGA 
AGAAAGTCAG 
CAGGCCGCCG 
TCTCGCCAGT 
CCCCAGACAT 
AGAAAATGCC 
AGTTTGAAGC 
CTGGOCAGAC 
GGTCCCCTCC 
GCCTTTGAGG 
GCCCAGCCAC 
ACCATAGCCA 
AGTTAAAGGG 
CTTGGAAATA 
TTGGTGGCTT 
CTTTGATTCT 
GGAAGAAGCT 
TTGGGTCTCA 
TGTTGAGATG 
CATTTCAGGG 
TCCCTTGCAC 
CCTCTTGTCA 
GAAGAGAAAG 
TCGGAAAGTT 
CCTCCCTATC 
TCCCCATCTT 
TGCCCCCGAC 
GGAAGTAGAC 
GAGTGCTGGG 
CTCTGCCAGG 



31 

1 

TGGCGCGCGC 
GAGGCGAACG 
TAAGCCTCTG 
AATCTGGCCG 
CGTGGCCCAT 
CTTTTTCTTG 
CAAGACACAC 
GTCCAGCTCC 
GCGTTCCTAC 
GTCCTGCTTT 
GGTGTGCTCC 
GACTCTCCCT 
AGAGATCTTG 
TGCTGAGCAG 
TCTCCCTCCT 
CCTGGTCTTG 
GCTTGGGCAG 
TCACTGGTGT 
GTTTCCAGAT 
CTGAGGGTTG 
GGCCCAGGGG 
TCTTCTTGAA 
GTAAATAGTC 
GGACATTCAG 
GACCATAAGT 
TTTGGTAAAT 
TGTGGCCACC 
AGGGCCCACG 
CCCATACCCA 
TTGAAGAGTC 
TCCTCGCGTA 
CTCTCACATG 
TCTCTTCTTT 
TCTCCTGCCG 
GCTGAGGTGT 
TCAGGGTACT 
TCTGGGAGTC 



41 



51 



CCGGGGCGGC 
CGGTCOGGAG 
CGGAGGTCCC 
AAGACACCCA 
GCTGTAGAGG 
GAGAAAGAAA 
AGCGTCCCTG 
AAGGAAGGAG 
AGCCGGCTGG 
GGCTTCGAGG 
AAACTCACCG 
GGAATCTCCC 
AAAACGGAGC 
TTTGATCTCC 
GTCCTGTACA 
TTACCfGTGT 
CAGCGGCAGC 
CCTGTCTCTT 
GGGCCACAGA 
AGGTGAGAGG 
CTCTGCCAGC 
CCCACCTGTG 
CCAGAGAGAA 
CCTGTGGAGT 
GTGTACTACA 
GCCAGGTTGA 
AGGTGCTGTG 
CTGGAGTCTT 
TTTCTTACAA 
CCAGACCTAC 
CCAGACAGOG 
AGAACACTGC 
CCATCGTGTG 
CACCCCTGGC 
GCAGAGGAGC 
TGGATGAAAC 
CCAGGCCATC 



GGCGCTGGAG 
GAGCCGCTCA 
AGCGGAAATC 
GTGCGGCTGC 
TCCCAGCTGT 
ACGAGCCCCC 
CCACCCCCAC 
AGCTGGACGC 
AGACCCTGGG 
GGCTGCTGGG 
AGGTCCCCAG 
CACCACCCGA 
TGGATGAGTG 
TGGTTGAATG 
TAGCCACCTC 
GTGTGCTGGT 
CATCTTGGTT 
GTCGTCCTGT 
CTGGGGAGGA 
CACCTCTGCT 
CTCGGCCTCT 
TAAAGAGGTT 
TTCGTGGGCT 
CTGAGTTTTG 
CAGAAGCTGT 
TAGGGCGCTG 
AGTTTCTGTG 
ACCACTCTGC 
AATAAGTTAC 
TAGCATTTTG 
GCGGGGGCTG 
CTGGATGCAT 
GATTCAATAG 
CATTGTACCT 
TGGTGGATAA 
GGTGCAGGCC 
TGCTCAACGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
6€0 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 



224 



WO 02/086443 

TCTGTGGTTT GTCAGACCTG CAAGCAAGCC CCCTGCTGGG GAAGCCTAGQ TGTCCTTGAG 2280 

CTGAACCGCA CTGAAGAACT CTTGTCCTCA CTGGCTGATG CAGCAGAACT CTTGGGAAAT 2340 

GTCTTAGTCC TGCAGAATCA GGAGTCACCA GATGATGCAG AGTTGAGATC ATCATTGCAA 2400 

AGTTCTCTGT TCCTGAGGAA CTAAATTTAA GGAAAAAATG GGATTTTGTT TTAGAGTTGG 2460 
AAAAAAAGCC TGATTAAAGA GTTTCTGCCT GTTAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 97 Protein sequence: 
Protein Accession #t NP_542399.1 

1 11 21 31 41 SI 

I I I I I I 

MSGRRTRSGG AAQRSGPRAP SPTKPLRRSQ RKSGSELPSI LPBIWPKTPS AAAVRKPIVL 60 
KHIVAHAVEV PAVQSPRRSP RISPPLEKEN EPPGRELTKE DLFKTHSVPA TPTSTPVPNP 120 
EAESSSKEGE LDARDLEMSK KVRRSYSRLE TLGSASTSTP GRRSCFGFEG LLGAEDIiSGV 180 
SPWCSKLTE VPRVCAKPWA PDMTLPGISP PPEKQKRKKK KMPEILKTEL DEWAAAMNAB 240 
FEAAEQFDIiL VE 

Seq ID NO: 9B DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 58-12444 

1 11 21 31 41 SI 

I I I I I I 

GGGGCATTTC CGGGTCCGGG CCGAGCGGGC GCACGCGCGG GAGCGGGACT CGGCGGCATG 60 
GCGGGCTCCG GAGCCGGTGT GCGTTGCTCC CTGCTGCGGC TGCAGGAGAC CTTGTCCGCT 120 
QCGGACCGCT GCGGTGCTGC CCTGGCCGGT CATCAACTGA TCCGCGGCCT GGGGCAGGAA 180 
TGCGTCCTGA GCAGCAGCCC CGCGGTGCTG GCATTACAGA CATCTTTAGT TTTTTCCAGA 240 
GATTTCGGTT TGCTTGTATT TGTCCGGAAG TCACTCAACA GTATTGAATT TCGTGAATGT 300 

AGAGAAGAAA TCCTAAAGTT TTTATGTATT TTCTTAGAAA AAATGGGCCA GAAGATCGCA . 360 
CCTTACTCTG TTGAAATTAA GAACACTTGT ACCAGTGTTT ATACAAAAGA TAGAGCTGCT 420 
AAATGTAAAA TTCCAGCCCT GGACCTTCTT ATTAAGTTAC TTCAGACTTT TAGAAGTTCT 480 
AGACTCATGG ATGAATTTAA AATTGGAGAA TTATTTAGTA AATTCTATGG AGAACTTGCA S40 
TTGAAAAAAA AAATACCAGA TACAGTTTTA GAAAAAGTAT ATGAGCTCCT AGGATTATTG 600 
GGTGAAGTTC ATCCTAGTGA GATGATAAAT AATGCAGAAA ACCTGTTCCG CGCTTTTCTG 660 
GGTGAACTTA AGACCCAGAT GACATCAGCA GTAAGAGAGC CCAAACTACC TGTTCTGGCA 720 
GGATGTCTGA AGGGGTTGTC CTCACTTCTG TGCAACTTCA CTAAGTCCAT GGAAGAAGA? 780 
CCCCAGACTT CAAGGGAGAT TTTTAATTTT GTACTAAAGG CAATTCGTCC TCAGATTGAT 840 
CTGAAGAGAT ATGCTGTGCC CTCAGCTGGC TTGCGCCTAT TTGCCCTGCA TGCATCTCAG 900 
TTTAGCACCT GCCTTCTGGA CAACTACGTG TCTCTATTTG AAGTCTTGTT AAAGTGGTGT 960 

GCCCACACAA ATGTAGAATT GAAAAAAGCT GCACTTTCAG CCCTGGAATC CTTTCTGAAA 1020 

CAGGTTTCTA ATATGGTGGC GAAAAATGCA GAAATGCATA AAAATAAACT GCAGTACTTT 1080 

ATGGAGCAGT TTTATGGAAT CATCAGAAAT GTGGATTCGA ACAACAAGGA GTTATCTATT 1140 

GCTATCCGTG GATATGGACT TTTTGCAGGA CCGTGCAAGG TTATAAACGC AAAAGATGTT 1200 

GACTTCATGT ACGTTGAGCT CATTCAGCGC TGCAAGCAGA TGTTCCTCAC CCAGACAGAC 1260 

ACTGGTGACG ACCGTGTTTA TCAGATGCCA AGCTTCCTCC AGTCTGTTGC AAGCGTCTTG 1320 

CTGTACCTTG ACACAGTTCC TGAGGTGTAT ACTCCAGTTC TGGAGCACCT CGTGGTGATG 1380 

CAGATAGACA GTTTCCCACA GTACAGTCCA AAAATGCAGC TGGTGTGTTG CAGAGCCATA 1440 

GTGAAGGTGT TCCTAGCTTT GGCAGCAAAA GGGCCAGTTC TCAGGAATTG CATTAGTACT IS 00 

GTGGTGCATC AGGGTTTAAT CAGAATATGT TCTAAACCAG TGGTCCTTCC AAAGGGCCCT 1560 

GAGTCTGAAT CTGAAGACCA CCGTGCTTCA GGGGAAGTCA GAACTGGCAA ATGGAAGGTG 1620 

CCCACATACA AAGACTACGT GGATCTCTTC AGACATCTCC TGAGCTCTGA CCAGATGATG 1680 

GATTCTATTT TAGCAGATGA AGCATTTTTC TCTGTGAATT CCTCCAGTGA AAGTCTGAAT 1740 

CATTTACTTT ATGATGAATT TGTAAAATCC GTTTTGAAGA TTGTTGAGAA ATTGGATCTT 1800 

ACACTTGAAA TACAGACTGT TGGGGAACAA GAGAATGGAG ATGAGGCGCC TGGTGTTTGG 1860 

ATGATCCCAA CTTCAGATCC AGCGGCTAAC TTGCATCCAG CTAAACCTAA AGATTTTTCG 1920 

GCTTTCATTA ACCTGGTGGA ATTTTGCAGA GAGATTCTCC CTGAGAAACA AGCAGAATTT 1980 

TTTGAACCAT GGGTGTACTC ATTTTCATAT GAATTAATTT TGCAATCTAC AAGGTTGCCC 2040 

CTCATCAGTG GTTTCTACAA ATTGCTTTCT ATTACAGTAA GAAATGCCAA GAAAATAAAA 2100 

TATTTCGAGG GAGTTAGTCC AAAGAGTCTG AAACACTCTC CTGAAGACCC AGAAAAGTAT 2160 

TCTTGCTTTG CTTTATTTGT GAAATTTGGC AAAGAGGTGG CAGTTAAAAT GAAGCAGTAC 2220 

AAAGATGAAC TTTTGGCCTC TTGTTTGACC TTTCTTCTGT CCTTGCCACA CAACATCATT 2280 

GAACTCGATG TTAGAGCCTA CGTTCCTGCA CTGCAGATGG CTTTCAAACT GGGCCTGAGC 2340 

TATACCCCCT TGGCAGAAGT AGGCCTGAAT GCTCTAGAAG AATGGTCAAT TTATATTGAC 2400 

AGACATGTAA TGCAGCCTTA TTACAAAGAC ATTCTCCCCT GCCTGGATGG ATACCTGAAG 2460 

ACTTCAGCCT TGTCAGATGA GACCAAGAAT AACTGGGAAG TGTCAGCTCT TTCTCGGGCT 2520 

GCCCAGAAAG GATTTAATAA AGTGGTGTTA AAGCATCTGA AGAAGACAAA GAACCTTTCA 2 5 B0 

TCAAACGAAG CAATATCCTT AGAAGAAATA AGAATTAGAG TAGTACAAAT GCTTGGATCT 2640 

CTAGGAGGAC AAATAAACAA AAATCTTCTG ACAGTCACGT CCTCAGATGA GATGATGAAG 2700 

AGCTATGTGG CCTGGGACAG AGAGAAGCGG CTGAGCTTTG CAGTGCCCTT TAGAGAGATG 2760 

AAACCTGTCA TTTTCCTGGA TGTGTTCCTG CCTCGAGTCA CAGAATTAGC GCTCACAGCC 2820 

AGTGACAGAC AAACTAAAGT TGCAGCCTGT GAACTTTTAC ATAGCATGGT TATGTTTATG 2880 

TTGGGCAAAG CCACGCAGAT GCCAGAAGGG GGACAGGGAG CCCCACCCAT GTACCAGCTC 2940 

TATAAGCGGA CGTTTCCTGT GCTGCTTCGA CTTGCGTGTG ATGTTGATCA GGTGACAAGG 3000 

CAACTGTATG AGCCACTAGT TATGCAGCTG ATTCACTGGT TCACTAACAA CAAGAAATTT 3060 

GAAAGTCAGG ATACTGTTGC CTTACTAGAA GCTATATTGG ATGGAATTGT GGACCCTGTT 3120 

GACAGTACTT TAAGAGATTT TTGTGGTCGG TGTATTCGAG AATTCCTTAA ATGGTCCATT 3180 

AAGCAAATAA CACCACAGCA GCAGGAGAAG AGTCCAGTAA ACACCAAATC GCTTTTCAAG 3240 

CGACTTTATA GCCTTGCGCT TCACCCCAAT GCTTTCAAGA GGCTGGGAGC ATCACTTGCC 3300 

TTTAATAATA TCTACAGGGA ATTCAGGGAA GAAGAGTCTC TGGTGGAACA GTTTGTGTTT 3360 

GAAGCCTTGG TGATATACAT GGAGAGTCTG GCCTTAGCAC ATGCAGATGA GAAGTCCTTA 3420 

GGTACAATTC AAGAGTGTTG TGATGCCATT GATCACCTAT GCCGCATCAT TGAAAAGAAG 3460 

CATGTTTCTT TAAATAAAGC AAAGAAACGA CGTTTG CCGC GAGGATTTCC ACCTTCCGCA 3540 

TCATTGTGTT TATTGGATCT GGTCAAGTGG CTTTTAGCTC ATTGTGGGAG GCCCCAGACA 3600 

GAATGTCGAC ACAAATCCAT TGAACTCTTT TATAAATTCG TTCCTTTATT GCCAGGCAAC 3660 

AGATCCCCTA ATTTGTGGCT GAAAGATGTT CTCAAGGAAG AAGGTGTCTC TTTTCTCATC 3720 

AACACCTTTG AGGGGGGTGG CTGTGGOCAG CCCTCGGGCA TCCTGGCCCA GCCCACCCTC 3780 

TTGTACCTTC GGGGGCCATT CAGCCTGCAG GCCACGCTAT GCTGGCTGGA CCTGCTCCTG 3840 
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GCCGCGTTGG 
CTAGGTACTG 
GCCATGCATG 
ACAAGCCCAC 
ATGGAGTTTA 
GACTTGTGTA 
ATAGGTTTCA 
CTGATGAAAG 
AAAATAACAO 
CAAGTGGACA 
GGGCTTCTGC 
GAACTTCTTT 
TCTCTAGACC 
GGAGGACTGT 
TCCTTGGGCA 
TTGTTCTCAG 
CTCATGCAGT 
GACCAGAGCT 
ATTCTGCAAC 
AAAATGGCAG 
AATACAAGTC 
ACAAAGCTGG 
CTCACTGGAG 
TTCCOCATGC 
TGCATGAAAA 
ATGACAGAAG 
TTCAGGAGGA 
TATGAAATGT 
CGCTCCCTCC 
ACAATTGTGG 
ACCTTTGATA 
TCTCGCCTTC 
GGCTCGTGTA 
GCATTTACAG 
TGTGCAGCAT 
TACCAAGGTT 
ATCGACCTGA 
AAAAAGTACA 
CCTTCCTATA 
CAATTTGATT 
GCCACTGGTC 
GAGCTGGAGA 
AAGCACATGC 
CTTCCTTCTT 
ATCCGTCTCT 
AAGCACTGGC 
ATTCACTACA 
CCAACAGGGG 
CATGTCTTTC 
GTCGAGTGCT 
GGTAAAGATC 
GCCAATGACC 
GCTTTGGTGA 
GTTCTAGGAC 
TGTGAACTGG 
GTGTGCTTGA 
GTGTTCTTTC 
CTTTGTCGTG 
CAAGTCATGA 
ATGATGCCAA 
GTTTCCCATC 
GATAATTACA 
GCAAAAGATG 
OGAAATTTCT 
CTAAATTCCT 
CTGCTCGAAA 
TCAGAATGCG 
CTCACTCCGA 
GAAGGGTCCC 
CATGACTTCA 
AGCAGCACTG 
CACAAGAGGA 
AAAAAAAGGC 
CGGACGGACC 
ATGTATGCCA 
AAAATGAAGC 
GACATTCAGA 
CCAATAATTG 
AAATTTAAGA 
CGTTTTCTTA 
AGCTGTCAGC 
GCCAGCCTAC 
CCTGCTGAGC 
AGATGGGTGG 
ATTTTTACCA 
AGAAGTGATT 
GTAGATGGTG 



AGTGCTACAA 
AAGCCCAGTC 
ACATTATAGC 
AAGAGGGAGA 
CCACGACTCT 
ATACACACCT 
ACATCGGAGA 
CTCTAAAGAT 
CACAGAGCAT 
GGAGCAGGCT 
ATAATATATT 
CCCTGGTTTA 
TCAGTTGTAA 
GTGAGCGCCT 
GCTCACAGGG 
AAACGATCAA 
CTTCAGTGGA 
TCAGGGAGCG 
ACTGGAAGAA 
TGCTGGCCTT 
ATGGTTCATT 
ATCTACATTT 
GCAGTCTGGA 
AGTCCAGGGA 
AGTTTCTAGA 
TTCTTTGTCG 
TTGCCAGAAG 
TCAGGAAGGA 
TCACTCTGCT 
TGGATGCCAT 
CTCAAATCAC 
CCAAAGATGA 
TTACAGAAGG 
AGAACATGGC 
ACAACTGCGC 
TTCTGTTTAG 
AGCGCCGCTA 
TTGAAATTAG 
TGTCTTCCCT 
TCTCAACCGG 
GTTTTCGGAG 
TGGACGAGCT 
ACAGAAGCCT 
GGATGAAATT 
TCTTAGCCAA 
TTAGCCCCTT 
TGGTGGTTGA 
TCCCTAAAGA 
ATCCAAAAAG 
GGAAGGATTG 
CTAATrCTAA 
TGCCTCCCTA 
ATAATATGTC 
TTATACTTCG 
TTGCGAAACA 
ACAAAGTGAC 
TGCTGCCAAA 
TGGAGGGAAT 
GACATAGAGA 
AGTTAAAACC 
CTTCTACAAC 
GAGATCCAGA 
TGCTGATTCA 
GGAGCCATGA 
TATATTCTCC 
TGACCAGCAT 
AATTTCAGGA 
TGTTTGTGGA 
TCTCAGCTCG 
CACTGACACA 
ACCCGCTGGT 
GTGAAAGGTT 
TGGGCCTTCC 
TACTAOGACT 
GAAAAGGCGT 
AGGATGCCCA 
TCAAGCACAG 
CAAAACAGCT 
CACTGTCTGA 
ATACCACCTT 
ACGCAGCCCT 
AGCAGCCCGT 
TGCCTGCCAA 
AGCTTGCTAA 
GTGAGATAGG 
ATTCTGAAGC 
AGCCCACAGA 



CACGTTCATT 
TTCACTTTTG 
AGCAGAAAAG 
AAGGTACAAC 
GCTAAACACC 
GATGAGAGTC 
CGTCCAGGTT 
GTCCCCATAC 
TGAGGAGCTT 
GGCTGCTGTT 
ACCGTCTCAG 
TAAAGGCATT 
GCAGCTGGCC 
TGTGAGTCTT 
CAGOGTCATC 
CACGGAATTA 
TAATACCAAA 
AGCAAACCAG 
GTGTGATTCA 
ACTGGCAAAA 
CCCTGAAGTC 
AAAGGGCCAA 
GGAACTTAGA 
ATTTCCTCCA 
TGCATTGGAA 
GGAACAGCAG 
GGGTTCATGT 
TGACCCCCGC 
GTGGCACTGT 
TGATGTGTTG 
CAAGAAGATG 
TGTTCATGCT 
AAATGAACTT 
AGGAGAGAAT 
CATATCTGTC 
TGAAAAACCA 
TAATTTTCCT 
GAAAGAAGCC 
GTCATATTTG 
AGTTCAGAGC 
ACGGGAGCAG 
CAATCGGCAT 
GGGCCCGCCT 
CCTCCATGGC 
GCTTGTTATT 
GCTGCAGCTG 
GATAGTGGCC 
TGAAGTGTTA 
AGCTGTGTTT 
TTTATCCATC 
AGACAACTCA 
TGACCCACAG 
CTTTGTAAGA 
ATATGTTATG 
ATTGAAGCAA 
CAAGAGCTTC 
ATTTCATGGA 
GACAGAGCTG 
TGATGAAAGA 
AGTAGAACTC 
ATGTAGGGAA 
AAGTGAGACA 
AGGATTGATC 
AACTAGGTTA 
TAAGATAGAA 
GAGCCCAGAT 
ATATACCATT 
GACCCAGGCC 
CTGGCCAGTG 
GACTGCAGAT 
CGACCACACC 
ACAGAGAGCA 
AGGGGAOGAG 
GCGCAGACGG 
TGCTGAGCAA 
GGTOGTTCTG 
CAGCCTCATC 
CTTTAGCAGC 
AAAAAACAAC 
CTCTTTCTTT 
GCTGAGCCTC 
GGGCATCOGC 
GCGAGTCCGT 
GCTGTATAGA 
AACAAAGCAA 
TGCTAAGCAG 
AGCCGAGAAG 



GGCGAGAGAA 
AAAGCAGTGG 
TGCTTTGGCA 
TACAGCAAAT 
TCCCCGGAAG 
CTGGTGCAGA 
ATGGCTCATC 
AAAGATATCC 
TGTGCCGTCA 
GTGTCTGCCT 
TCCACAGATT 
GCCCCTGGAG 
AGOGGACTTC 
CTCCTGAACC 
CACTTCTCCC 
TTGAAAAATC 
ATGGTGAGTG 
AAACACCAAG 
TGGTGGGCCA 
ATTTTACAGA 
TTTACAACAT 
GCTGTCACTC 
CGTGTTCTGG 
GGAACTCCGC 
TTATCTCAAA 
CATGTCATGG 
GTCACACAAG 
CTAAGTTTCA 
AGCCTGGATG 
AAGTCCAGGT 
GGCTACTATA 
AAGGAATCAA 
ACAAAGACAT 
CAGCTGCTGG 
ATCTGCTGTG 
GAAAAGAACT 
GTAGAAGTTG 
AGAGAAGCAG 
GCAGACAGTA 
TATTCATACA 
CGGGACCCCA 
GAGTGCATGG 
CAAGGAGAAG 
AAACTGGGAA 
AATACAGAAG 
GCTGCTTCTG 
ACTATTCTTT 
GCAAATCGAT 
AGACACAACC 
CCTTATAGGT 
GTAGGGATTC 
TGTGGCATCC 
TATAAAGAAG 
GAGAGAAAAA 
CATCAGAATA 
CCTCCTCTTG 
GTGTTGAAAA 
TACTTCCAGT 
CAAAAAGTAT 
CGAGAACTTC 
CAAATGTATA 
GATAATGACT 
GATGAGAACC 
CCTTCAAATA 
GTGCACTTTT 
TATCCAAACC 
GATTCTGATT 
TCCCAGGGCA 
GCAGGGCAGA 
GGAAGAAGCT 
AGTCCCTCAT 
CCCTTGAAGT 
GTGGATAACA 
TTTATGAGGG 
AAACGAGAGA 
TACAGAAGCT 
ACCCCGTTAC 
TTGTTTTCTG 
ATCACTCAAA 
CCACCCTTTG 
GACCCAGCGG 
CTGCTAGAGG 
GGGAAGGCCC 
TCAATTGGAG 
ATCACTCAGA 
TATGATGAGG 
GATTTTTGGG 



CTGTAGGAGC 
CTTTCTTCTT 
CTGGGGCAGC 
GCACCGTTGT 
GATGGAAGCT 
OGCTGTGTGA 
TTCCTGATGT 
TAGAGACCCA 
ACTTGTATGG 
GTAAACAGCT 
TGCATCATTC 
ATGAGAGACA 
TGGAGTTAGC 
CAGCGGTGCT 
ATGGGGAGTA 
TGGATCTTGC 
CCGTTTTGAA 
GACTGAAACT 
AAGATTCCCC 
TTGATTCATC 
ATATTAGTCT 
TTCTTCCATT 
AGCAGCTCAT 
GGTTCAATAA 
GCCCTATGTT 
AAGAATTATT 
TAGGCCTTCT 
CACGCCAGTC 
CTTTGAGAGA 
TTACAAAGCT 
AGATTCTAGA 
AAATTAATCA 
TGATTAAATT 
AGAGGAGAAG 
TCTTCAATGA 
TGCTTATTTT 
AGGTTCCTAT 
CAAATGGGGA 
CCCTGAGTGA 
GCTCCCAAGA 
CGGTGCATGA 
CGCCCCTGAC 
AGGATTCAGT 
ATCCAATAGT 
AGGTCTTTCG 
AAAACAATGG 
CATGGACAGG 
TGCTTAATTT 
TTGAAATTAT 
TAATATTTGA 
AATTGCTAGG 
AGAGTAGCGA 
TGTATGCCGC 
ACATACTGGA 
CTATGGAGGA 
CAGACAGGTT 
CACTCTGTCT 
TAAA GAGCAA 
GTTTGGACAT 
TGAACCCCGT 
ATATTCTCAT 
CCCAGGAAAT 
CTGGACTTCA 
CCTTGGACCG 
TAAGTTTAGC 
CCATGTTCGA 
GGCGTTTCCG 
CTCTCCAGAC 
TAAGGGCCAC 
CATTTGATTG 
CTGACTCCTT 
CAGTGGGGCC 
AAGTGAAAGG 
ACCAGGAGAA 
AGGAAATCAA 
ACCGGCACGG 
AGGCCGTGGC 
GAATTTTGAA 
AGTTGCTTCA 
TCTCTTGTAT 
CTGTTAGCGC 
AGGCTCTGCT 
GCCTCCCTCC 
AATACGACGT 
GTGCATTATT 
CTCTCAATAA 
AACTTGCATC 



GCTCCAGGTC 
AGAAAGCATT 
AGGTAACAGA 
GGTCOGGATT 
CCTGAAGAAG 
GCCCGCAAGC 
TTGTGTGAAT 
TCTGAGAGAG 
CCCTGACGCG 
TCACAGAGCT 
TGTTGGCACA 
GTGTCTGCCT 
CTTTGCTTTT 
GTCCACGGCG 
TTTCTATAGC 
TGTATTGGAG 
OGGCATGTTA 
TGCGACTACA 
TCTCGAAACT 
TGTATCTTTT 
ACTTGCTGAC 
CTTCACCAGC 
CGTTGCTCAC 
TTATGTGGAC 
GTTGGAATTG 
TCAATCCAGT 
GGAAAGCGTG 
CTTTGTGGAC 
ATTCTTCAGC 
AAATGAATCT 
CGTGATGTAT 
AGTTTTCCAT 
GTGCTACGAT 
ACTTTACCAT 
GTTAAAATTT 
TGAAAATCTG 
GGAAAGAAAG 
TTCAGATGGT 
GGAAATGAGT 
CCCTAGACCT 
TGATGTGCTG 
GGCCCTGGTC 
GCCAAGAGAT 
ACCATTAAAT 
CCCTTACGCG 
AGGAGAAGGA 
CTTGGCCACT 
CCTAATGAAA 
AAAGACCCTT 
AAAGTTTTCC 
CATCGTGATC 
ATACTTCCAG 
TGCAGCAGAA 
GGAGTCTCTG 
CAAGTTTATT 
CATGAATGCT 
GGAGGTGGTA 
GGACTTCGTT 
AATTTATAAG 
TGTGGAATTC 
GTGGATTCAT 
ATTTAAGTTG 
ATTAATTATT 
GTTGCTGGCA 
AACAAATTTT 
GCATCCTCTG 
AAGTACTGTT 
CCGTACCCAG 
CCAGCAGCAG 
GCTGACCGGG 
GCTGTTTGCC 
TGATTTTGGG 
TGCGGCCGGC 
GCTCAGTTTG 
GAGTGAGTTA 
AGACCTTCCT 
CCAGAGGGAC 
AGAGATGGAT 
AGACTTCAAT 
TCAGGACATT 
TGGTTGCCTG 
CCGCCTGCTG 
TGATGTCCTC 
CCTCCGTGGG 
AGCAGAAGCC 
ACAAGACTGG 
CCTTGACTGT 
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TACAACCACC TTGCTGAGTQ GAAATCACTT GAATACTGTT 
GAGAACCCCC CAGACCTAAA TAAAATCTGG AGTGAACCAT 
CCTTACATGA TCCGCAGCAA GCT6AAGCTG CTGCTCCAGG 
CTGACATTTA TTGACAAAGC TATGCACGGG GAGCTCCAGA 
5 TACAGTCAAG AGCTGAGTCT GCTTTACCTC CTGCAAGATG 
TACATTCAAA ATGGCATTCA GAGTTTTATG CAGAATTATT 
CACCAAAGTA GACTCACCAA ATTGCAGTCT GTACAGGCTT 
ATCAGCTTTA TAAGCAAACA AGGCAATTTA TCATCTCAAG 
AACACCTGGA CAAACAGATA TCCAGATGCT AAAATGGACC 

10 ATCATCACAA ATCGATGTTT CTTTCTCAGC AAAATAGAGG 
GAAGATAATA GTATGAATGT GGATCAAGAT GGAGACCCCA 
GAGCAGGAAG AAGATATCAG CTCCCTGATC AGGAGTTGCA 
ATGATAGACA GTGCCCGGAA GCAGAACAAT TTCTCACTTG 
CTGCATAAAG AGTCAAAAAC CAGAGACGAT TGGCTGGTGA 

15 OGCCTGAGCC ACTGCCGGAG CCGGTCCCAG GGCTGCTCTG 
AAAACAGTCT CTTTGTTGGA TGAGAACAAC GTGTCAAGCT 
GCTTTCCGTG ACCAGAACAT TCTCTTGGGT ACAACTTACA 
AGCAGTGAGC CAGCCTGCCT TGCTGAAATC GAGGAGGACA 
CTTTCTGGAT CCAGTTCAGA GGATTCAGAG AAGGTGATCG 

20 TTCCAGCACC TCTCTGAGGC TGTGCAGGCG GCTGAGGAGG 
AGCTGTGGGC CTGCAGCTGG GGTGATTGAT GCTTACATGA 
CAACAGCTGC GCAAGGAGGA AGAGAATGCA TCAGTTATTG 
TATCCAGCAC TTGTGGTGGA GAAAATGTTG AAAGCTTTAA 
AGATTGAAGT TTCCTAGATT ACTTCAGATT ATAGAACGGT 

25 CTCATGACAA AAGAGATCTC TTCCGTTCCC TGCTGGCAGT 
ATGGTGGCCT TACTGGACAA AGACCAAGCC QTTGCTGTTC 
ACTGATAACT ACCCGCAGGC TATTGTTTAT CCCTTCATCA 
TTCAAGGATA CTTCTACTGG TCATAAGAAT AAGGAGTTTG 

_ TTGGATCAAG GAGGAGTGAT TCAAGATTTT ATTAATGCCT 

30 GAACTGCTCT TTAAGGATTG GAGCAATGAT GTAAGAGCTG 
AATAAAAAAA ACATTGAAAA AATGTATGAA AGAATGTATG 
GCTCCAGGCC TGGGGGCCTT TAGAAGGAAG TTTATTCAGA 
AAACATTTTG GGAAAGGAGG TTCTAAACTA CTGAGAATGA 
ATTACCAACA TGCTACTTTT AAAAATGAAC AAAGACTCAA 

35 GAATGTTCAC CCTGGATGAG CGACTTCAAA GTGGAGTTCC 
CCCGGTCAGT ATGACGGTAG GGGAAAGCCA TTGCCAGAGT 
TTTGATGAGC GGGTGACAGT CATGGCGTCT CTGCGAAGGC 
GGCCATGAOG AGAGGGAACA CCCTTTCCTG GTGAAGGGTG 

- CAGCGCGTGG AGCAGCTCTT CCAGGTCATG AATGGGATCC 
40 AGCCAGAGGG CCCTGCAGCT GAGGACCTAT AGCGTTGTGC 

TTAATTGAGT GGCTTGAAAA TACTGTTACC TTGAAGGACC 
CAAGAGGAGA AGGCGGCTTA CCTGAGTGAT CCCAGGGCAC 
TGGCTGACAA AAATGTCAGG AAAACATGAT GTTGGAGCTT 
. GCTAATCGTA CTGAAACAGT CACGTCTTTT AGAAAACGAG 

45 CTCTTAAAGC GGGCCTTCGT GAGGATGAGT ACAAGCCCTG 
TCCCACTTCG CCAGCTCTCA CGCTCTGATA TGCATCAGCC 
GACAGACATC TGAACAACTT TATGGTGGCC ATGGAGACTG 
TTTGGGCATG CGTTTGGATC CGCTACACAG TTTCTGCCAG 
_ _ CGGCTAACTC GCCAGTTTAT CAATCTGATG TTACCAATGA 
50 AGCATCATGG TACACGCACT CCGGGCCTTC CGCTCAGACC 
ATGGATGTGT TTGTCAAGGA GCCCTCCTTT GATTGGAAAA 
AAAAAAGGAG GGTCATGGAT TCAAGAAATA AATGTTGCTG 
CAGAAAATAT GTTACGCTAA GAGAAAGTTA GCAGGTGCCA 
_ GATGAGCTAC TCCTGGGTCA TGAGAAGGCC CCTGCCTTCA 

55 CGAGGAAGCA AAGATCACAA CATTCGTGCC CAAGAACCAG 
ACTCAAGTGA AGTGCCTGAT GGACCAGGCA ACAGACCCCA 
GAAGGATGGG AGCCCTGGAT GTGAGGTCTG TGGGAGTCTG 
GTTTAAAGAA TCTACTATAC TTTGGTTGGC AGCATTCCAT 
CTAAAGAGAA ATGTCTTTTG TGCTACAGTT TCGTAGCATG 
60 TGAGTAAATG TGTATGGGTT AAATCAAAGA TAAGGTTATA 
AGGTTTATAG AAAGATAGAT ATCCAGGCTT ACCAAAGTAT 
TGATCAGCTT TCAAAGCATT TACAAGTGCT GCAAGTTAGT 
TGGAGGAAAT GTGGGGAAGC CTTGGAATGC CCTTCTGGTT 

- CTCAGAAGGC TTCATCACCA AGATTTTGGG AGAGTAAAGC 
65 TTGTAGAAGC AGCATAGGAA CAATAAGAAC AATAGGTAAA 

TTAGAAATGA CTGCATTTGA TATTTTAGGA TATTTTTCTA 
TTCTCTTCTA GTTTTGACAT TTTATGATAG ATTTGCTCTC 
TAGGAGGGCA AAAATTTTGG TCATAGCATT CACTTTTGCT 
GATACATAAA AGTGCTTTGC ATTGAATTTG GGATAACTTC 

70 TAGGGATAGT ACTAAGCATT TCAGTTCCAG GAGAATAAAA 
ATTCCTCATT TGGAGGAAAA AAAGCATGCA TTCTAGCACA 
ACAAAAGTGG CTCCTTCCCA TGTGCAGTCC CTG TCCCCCC 
AACTGTTTCT GATTGGCTTT TAGCTTTTTG TTGTTTTTTT 
TTTGGAGGCT CTTCTGTGAT TTTGAGAAGT ATACTCTTQA 

75 CCAAAAGTA 

80 
85 



PCT/US02/12476 



CTACAGCCAG 
TTTATCAGGA 
GAGAGGCTGA 
AGGCGATTCT 
ATGTTGACAG 
CTAGTATTGA 
TAACAGAAAT 
TTCCCCTTAA 
CAATGAACAT 
AGAAGCTTAC 
GTGACAGGAT 
AGTTTTCCAT 
CTATGAAACT 
GCTGGGTGCA 
AGCAGGTGCT 
ACTTAAGCAA 
GGATCATAGC 
AGGCTAGAAG 
CGGGTCTGTA 
AGGCCCAGCC 
CGCTGGCAGA 
ATTCTGCAGA 
AATTAAATTC 
ATCCAGAGGA 
TCATCAGCTG 
AGCACTCTGT 
TAAGCAGCGA 
TGGCAAGGAT 
TAGATCAGCT 
AACTAGCAAA 
CAGCCTTGGG 
CTTTTGGAAA 
AGCTCAGTGA 
AGCCCCCTGG 
TGAGAAATGA 
ACCACGTGOG 
CCAAGCGCAT 
GCGAGGACCT 
TGGCCCAAGA 
CCATGACCTC 
TTCTTTTGAA 
CGCCGTGTGA 
ACATGCTAAT 
AAAGTAAAGT 
AGGCTTTCCT 
ACTGGATCCT 
GCGGCGTGAT 
TCCCTGAGTT 
AAGAAACGGG 
CTGGCCTGCT 
ATTTTGAACA 
AAAAAAATTG 
ATCCAGCAGT 
GAGACTATGT 
AGAGTGGGCT 
ACATCCTTGG 
CAGATAGAAA 
GAGCTGATTT 
AGTTTAAATC 
GTAACATGAA 
TAAGTCAAGA 
GAAACAGCTG 
CTGGCACATT 
TAAGTATAGT 
GCTATAATTA 
GGTTTTTTCC 
TAGAAGGAAA 
ATTCCAATCT 
AAAAATCCCA 
GAAATTCCTA 
ACAAGATGAA 
CCGCCAGTCC 
TTTTCCTTCT 
GTGTTTAATA 



TATAGACAGT 
AACATATCTA 
CCAGTCCCTG 
AGAGCTTCAT 
AGCCAAATAT 
TGTCCTCTTA 
TCAGGAGTTC 
GAGACTTCTG 
CTGGGATGAC 
CCCTCTTCCA 
GGAAGTGCAA 
GAAAATGAAG 
ACTGAAGGAG 
GAGCTACTGC 
CACTGTGCTG 
AAATATTCTG 
GAATGCTCTC 
AATCTTAGAG 
CCAGAGAGCA 
TCCCTCCTGG 
TTTCTGTGAC 
ACTGCAGGCG 
CAATGAAGCC 
GACTTTGAGC 
GATCAGCCAC 
GGAAGAAATC 
AAGCTATTCC 
TAAAAGTAAG 
CTCTAATCCT 
AACCCCTGTA 
TGACCCAAAG 
AGAATTTGAT 
CTTCAACGAC 
GAATCTGAAA 
GCTGGAGATT 
AATCGCCGGG 
CATCATCCGT 
GCGGCAGGAC 

CAGGTTAGGA 
CACCATGTCC 
ATATAAAGAT 
GTATAAGGGC 
GCCTGCTGAT 
GGCGCTCCGC 
CGGGATTGGA 
CGGGATCGAC 
GATGCCTTTT 
CCTTATGTAC 
CACCAACACC 
GAAAATGCTG 
GTACCCCOGA 
CATTACTTGT 
GGCTGTGGCA 
TTCAGAAGAG 
CAGAACCTGG 
GCATTACATT 
TCCTGAAACA 
AAGATTATGA 
AGATTAGGTG 
ATATAATATG 
TCTCOGTAAA 
GGAAAGCACA 
TGATGTAACA 
TGGCTTATAT 
TTTCATTTTA 
CGTCTTTATT 
ACAACTGGAA 
TGGTTGTTGT 
TTTGAAATGA 
ATTATGGAAT 
TCCACACCCA 
AACACTTGTA 
AAGTTTTTTT 



9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
1038O 
10440 
10500 
10560 
10620 
10680 
10740 
10600 
10860 
10920 
109BO 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 



Seq ID NO: 99 Protein sequence: 
Protein Accession #j NP_0Q8835.5 



MAGSGAGVRC 
RDFGLLVFVR 
AKCKIPALDL 
LGBVHPSEMI 
DPQTSREIFN 
CAHTNVKLKK 



11 
I 

SLLRLQBTLS 
KSLNSIEFRE 
LIKLLQTFRS 
NNAENLFRAF 
FVLKAIRPQI 
AALSALESFL 



21 
I 

AADRCGAALA 
CREBILKFLC 
SRLMDEPKIG 
LGELKTQMTS 
DLKRYAVPSA 
KQVSNMVAKN 



31 
I 

GHQLIRGLGQ 
IFLEKMGQKI 
ELFSKFYGEL 
AVREPKLPVL 
GLRLPALHAS 
AEMHKNKLQY 



41 51 
I I 

ECVLSSSPAV LALQTSLVFS 60 

APYSVEIKNT CTSVYTKDRA 120 

ALKKXIPDTV LEKVYELLGL 180 

AGCLKGLSSL LCNFTKSMEE 240 

QFSTCLLDNY VSLFEVLLKW 300 

FMEQFYGIIR NVDSNNKELS 360 



227 
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IAIRGYGLFA GPCKVINAKD VDFMYVELIQ RCKQMFLTQT DTGDDRVYQM PSFLQSVASV 420 

LLYLDTVPEV YTPVLEHLW MQIDSFPQYS PKMQLVCCRA IVKVPLALAA KGFVLRNCIS 480 

TWHQGLIRI CSKPWLPKG PESESEDHRA SGEVRTGKWK VPTYKDYVDL FRHLLSSDQM 540 

MDSILADEAF FSVNSSSESL NHLLYDEFVK SVLKIVEKLD LTLEIQTVGE QENGDEAPGV 600 

WMIPTSDPAA NLHPAKPKDF SAFINLVEFC REILPEKQAE FFEPWVYSFS YELILQSTRL 660 

PLISGFYKLL SITVRNAKKI KYFEGVSPKS LKHSPEDPEK YSCFALFVKF GKEVAVKMKQ 720 

YKDELLASCL TFLLSLPHNI IELDVRAYVP ALQMAFKLGL SYTPLAEVGL NALEEWSIYI 780 

DRHVMQPYYK DILPCLDGYL KTSALSDETK NNHEVSALSR AAQKGFNKW LKHLKKTKNL 840 

SSNEAISLEE IRIRWQMLG SLGGQINKNL LTVTSSDEMM KSYVAWDREK RLSFAVPFRE 900 

MKPVIFLDVF LPRVTELALT ASDRQTKVAA CELLHSMVMF MLGKATQMPE GGQGAPPMYQ 960 

LYXRTFPVLL RLACDVDQVT RQLYSPLVMQ LIHWFTNNKK FESQDTVALL EAILDGIVDP 1020 

VDSTLRDFCG RCIREFLKWS IKQITPQQQE KSPVNTKSLF KRLYSLALHP NAFKRLGASL 1080 

AFNNIYREFR EEESLVEQFV FEALVIYMES LALAHADEKS LGTIQQCCDA IDHLCRIIEK 1140 

KHVSLNKAKK RRLPRGFPPS ASLCLIiDLVK WLLAHCGRPQ TECRHKSIEL FYKFVPLLPG 1200 

NRSPNLWLKD VLKEEGVSFL INTFEGGGCG QPSGIIAQPT LLYLRGPFSL QATLCWLDLL 1260 

IiAAXiECYNTF IGERTVGALQ VLGTEAQSSL LKAVAFFLES IAMHDIIAAE KCFGTGAAGN 1320 

RTSPQEGERY NYSKCTWVR IMEFTTTLLN TSPEGWKLLK KDLCNTHLKR VLVQTLCEPA 1380 

SIGFNIGDVQ VMAHLPDVCV NLMKALKMSP YXDILETHLR EKITAQSIEE LrCAVNLYGPD 1440 

AQVDRSRLAA WSACKQLHR AGLLHNIIiPS QSTDLHHSVG TELLSLVYKG IAPGDERQCL 1500 

PSLDLSCKQL ASGLLELAFA FGGLCERLVS LLLNPAVLST ASLGSSQGSV IHFSHGEYFY 1560 

SLFSETINTE LLKNLDLAVL ELMQSSVDNT KMVSAVLNGM LDQSFRERAN QKHQGLKLAT 1620 

TILQHWKKCD SWWAKDSPLE TKMAVLALLA KILQIDSSVS FNTSHGSFPE VFTTYISLLA 1680 

DTKLDlaHLRG QAVTLLPFFT SLTGGSLEEL RRVLEQLIVA HFPMQSREFP PGTPRFNNYV 1740 

DCMKKFLDAL ELSQSPMLLE LMTEVLCREQ QHVMEELFQS SFRR1ARRGS CVTQVGLLES 1800 

VYEMFRKDDP RLSFTRQSFV DRSLLTLLWH CSLDALREFF STIWDAIDV LKSRFTKLNE 1860 

STFDTQITKK MGYYKILDVM YSRLPKDDVH AKESKINQVF HGSCITEGNE LTKTLIKLCY 1920 

DAFTENMAGE NQLLERRRLY HCAAYNCAIS VICCVFNELK FYQGFLFSEK PEKNLLIFEN 1980 

LIDLKRRYNP PVEVEVPMER KKKYIEIRKE AREAANGDSD GPSYMSSLSY LADSTLSEEM 2040 

SQFDFSTGVQ SYSYSSQDPR PATGRFRRRE QRDPTVHDDV LELEMDELNR HECMAPLTAL 2100 

VKHMHRSLGP PQGEEDSVPR DLPSWMKFLH GKLGNPIVPL NIRLFLAKLV INTEEVFRPY 2160 

AKHWLSPLLQ LAASENNGGE GIHYMWEIV ATILSWTGLA TPTGVPXDEV LANRLLNFLM 2220 

KHVFHPKRAV FRHNLBIIKT LVECWKDCLS IPYRLIFEKF SGKDPNSKDN SVGIQLLGIV 2280 

MANDLPPYDP QCGIQSSEYF QALVNNMSFV RYKEVYAAAA EVLGLILRYV MERKNILEES 2340 

LCELVAKQLK QHQNTMEDKF IVCLNKVTKS FPPLADRFMN AVFFLLPKFH GVLKTLCLEV 2400 

VLCRVEGMTE LYFQLKSKDF VQVMRHRDDE RQKVCLDIIY KMMPKLKPVE LRELLNPWE 2460 

FVSHPSTTCR EQMYNILMWI HDNYRDPESE TDNDSQEIFK LAKDVLIQGL IDENPGLQLI 2520 

IRNFWSHETR LPSNTLDRLL ALNSLYSPKI EVHFLSLATN FLLEMTSMSP DYPNPMFEHP 2580 

LSECEFQEYT IDSDWRFRST VLTPMFVETQ ASQGTLQTRT QEGSLSARWP VAGQIRATQQ 2640 

QHDFTLTQTA DGRSSFDWLT GSSTDPLVDH TSPSSDSLLF AHKRSERLQR APLKSVGPDF 2700 

GKKRLGLPGD EVDNKVKGAA GRTDLLRLRR RFMRDQEKLS LMYARKGVAE QKREKEIKSE 2760 

LKMKQDAQW LYRSYRHGDL PDIQIKHSSL ITPLQAVAQR DPIIAKQLFS SLFSGILKEM 2820 

DKFKTLSEKN NITQKLLQDF NRFLNTTFSF FPPFVSCIQD ISCQHAALLS LDPAAVSAGC 2880 

LASLQQPVGI RLLEEALLRL IiPAELPAKRV RGKARLPPDV LRWVELAKLY RSIGEYDVLR 2940 

GIFTSEIGTK QITQSALLAE ARSDYSEAAK QYDEALNKQD WVDGEPTEAE KDFWELASLD 3000 

CYNHLAEWKS LEYCSTASID SENPPDLNKI WSEPFYQETY LPYMIRSKLK LL.LQGEADQS 3060 

LLTFIDKAMH GELQKAILEL HYSQELSLLY LLQDDVDRAK YYIQNGIQSF MQNYSSIDVL 3120 

LHQSRLTKLQ SVQALTEIQE FISFISKQGN LSSQVPLKRL IWTWTNRYPD AKMDPMNIWD 3180 

DIITNRCFFL SKIEEKLTPL PEDNSMNVDQ DGDPSDRMEV QEQEEDISSL IRSCKFSMKM 3240 

KMIDSARKQN NFSLAMKLLK ELHKESKTRD DWLVSWVQSY CRLSHCRSRS QGCSEQVLTV 3300 

LKTVSLLDEN NVSSYLSKNI LAFRDQNILL GTTYRI I ANA LSSEPACLAB IEEDKARRIL 3360 

ELSGSSSEDS EKVXAGLYQR AFQHLSEAVQ AAEEEAQPPS WSCGPAAGVI DAYMTLADFC 3420 

DQQLRKEEEN ASVIDSAELQ AYPALWEKM LKALKLNSNE ARLKFPRLLQ I I ERYPEETL 3480 

SLMTKEISSV PCWQFISWIS HMVALLDKDQ AVAVQHSVEE ITDNYPQAIV YPFIISSESY 3540 

SFKDTSTGHK NKEFVARIKS KLDQGGVIQD FINALDQLSN PELLFKDWSN DVRAELAKTP 3600 

VNKKNIEKMY ERMYAALGDP KAPGL/3AFRR KFIOTFGKEF DKHFGKGGSK LLRMKLSDFN 3660 

DITNMLLLKM NXDSKPPGNL KECSPW4SDF KVEFLRNELE IPGQYDGRGK PLPEYHVRIA 3720 

GFDERVTVMA SLRRPKRIII RGHDEREHPF LVKGGEDLRQ DQRVEQLFQV MNGILAQDSA 3780 

CSQRALQLRT YSWPMTSRL GLIEWLENTV TLKDLLLNTM SQEEKAAYLS DPRAPPCEYK 3840 

DWLTKMSGKH DVGAYMLMYK GANRTETVTS FRKRESKVPA DLLKRAFVRM STSPEAFLAI* 3900 

RSHFASSHAL ICISHWILGI GDRHLNNFMV AMBTGGVIGI DFGHAFGSAT QFLPVPELMP 3960 

FRLTRQFINL MLPMKETGLM YSIMVHALRA FRSDPGLLTN TMDVFVKEPS FDWKNFEQKM 4020 

LKKGGSWIQE INVAEKKWYP RQKICYAKRK LAGANPAVIT CDELLLGHEK APAFRDYVAV 4080 
ARGSKDHNIR AQEPESGLSE ETQVKCLMDQ ATDPNILGRT WEGWEPWM 

Seq ID NO: 100 DNA sequence 
Nucleic Acid Accession #: NM_000673 
Coding sequence: 101-1225 

1 11 , 21 31 41 • 51 

j I I I I I 

ATGTGAAGGC ACAAGCTGCT GTTATATACA ACAGAGTGAA CTGAGCATCA GTCAGAAAAA 60 

GTCTATGTTT GCAGAAATAC AGATCCAAGA CAAAGACAGG ATGGGCACTG CTGGAAAAGT 120 

TATTAAATGC AAAGCAGCTG TGCTTTGGGA GCAGAAGCAA CCCTTCTCCA TTGAGGAAAT 180 

AGAAGTTGCC CCACCAAAGA CTAAAGAAGT TCGCATTAAG ATTTTGGCCA CAGGAATCTG 240 

TCGCACAGAT GACCATGTGA TAAAAGGAAC AATGGTGTCC AAGTTTCCAG TGATTGTGGG 300 

ACATGAGGCA ACTGGGATTG TAGAGAGCAT TGGAGAAGGA GTGACTACAG TGAAACCAGG 360 

TGACAAAGTC ATCCCTCTCT TTCTGCCACA ATGTAGAGAA TGCAATGCTT GTCGCAACCC 420 

AGATGGCAAC CTTTGCATTA GGAGCGATAT TACTGGTCGT GGAGTACTGG CTGATGGCAC 480 

CACCAGATTT ACATGCAAGG GCAAACCAGT ACACCACTTC ATGAACACCA GTACATTTAC 540 

CGAGTACACA GTGGTGGATG AATCTTCTGT TGCTAAGATT GATGATGCAG CTCCTCCTGA 600 

GAAAGTCTGT TTAATTGGCT GTGGGTTTTC CACTGGATAT GGCGCTGCTG TTAAAACTGG 660 

CAAGGTCAAA CCTGGTTCCA CTTGOGTCGT CTTTGGCCTG GGAGGAGTTG GCCTGTCAGT 720 

CATCATGGGC TGTAAGTCAG CTGGTGCATC TAGGATCATT GGGATTGACC TCAACAAAGA 780 

CAAATTTGAG AAGGCCATGG CTGTAGGTGC CACTGAGTGT ATCAGTCCCA AGGACTCTAC 840 

CAAACCCATC AGTGAGGTGC TGTCAGAAAT GACAGGCAAC AACGTGGGAT ACACCTTTGA 900 

AGTTATTGGG CATCTTGAAA CCATGATTGA TGCCCTGGCA TCCTGCCACA TGAACTATGG 960 

GACCAGCGTG GTTGTAGGAG TTCCTCCATC AGCCAAGATG CTCACCTATG ACCCGATGTT 1020 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
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85 



GCTCTTCACT 
TGTCCCAAAA 
TCATGTTTTA 
CATTCGAACG 
GAACTGGAGT 
ACAAGCATAA 
TATAAACATT 
TTGATTTACA 
TATGTTGAAA 
CAGATATAGC 
TTGAAACTAT 
TAACTTGGAT 
AGAAAGACAG 
ATAACTTGGT 
TATTAATATT 
ATATTATCAT 
CCTATTCACT 
CTAAACCG 



GGACGCACAT 
CTAGTGACTG 
CCATTTAAAA 
GTCCTGACGT 
TTCTCTTGTG 
GTAGAAGATT 
TAAAGTCTTG 
TTTTGTAAGG 
TGGAGATTTT 
GTATAAAGAT 
TATTTTTTAG 
TACATTTTGA 
AAAAGATTAA 
GAAACTGAAA 
TTAGAAAATA 
ACTTATCATA 
GTGCTTAGTA 



GGAAGGGATG 
AGTTCCTGGC 
AAATCAGTGA 
TTTGAGATCC 
AGAGTTCCCT 
TGTTGAAGAC 
TGAGCACCTG 
CTATAATTGT 
TAAGAGTTTT 
ATAGTAAATG 
ATTTGAATAT 
AATCAGTTCA 
GGGACGGGCA 
AAGTATATCA 
TTCCTTTTGT 
ATGTTCAATT 
GTGACTCCAT 



TGTCTTTGGA 
AAAGAAATTT 
AGGATTTGAG 
AAAGTGGCAG 
CATCTGAAAT 
ATAGAACCCT 
GGAATTAGTA 
ATCTTTTAAG 
AACCAGCTGC 
CATCTCCCAG 
AAATGTATTT 
TTCCATGATG 
CATTTTTCAA 
TATGGGTACA 
AATACTGAAT 
TGATACAGTA 
TTAATAAAAA 



Seq ID NO: 101 Protein sequence: 
Protein Accession #: NP 000664 



MGTAGKVIKC 
KFPVIVGHEA 
GVLADGTTRF 
GAAVKTGKVK 
ISPKDSTKPI 
LTYDPMLLFT 
LLNSGQSIRT 



11 
I 

KAAVLWEQKQ 
TGIVESIGEG 
TCKGKPVHHF 
PGSTCWFGL 
SEVLSEMTGN 
GRTWKGCVFG 
VLTP 



21 
I 

PPSIEEIEVA 
VTTVKPGDKV 
MNTSTFTEYT 
GGVGLSVIMG 
NVGYTFEVIG 
GLKSRDDVPK 



31 
I 

PPKTKEVRIK 
IPLPLPQCRB 
WDESSVAKI 
CKSAGASRII 
HLETMIDALA 
LVTEPLAKKP 



GGTTTGAAAA 
GACCTGGACC 
CTGCTCAATT 
GAGGTCTGTG 
CATGTATCTG 
TATAAAGAAT 
TAATAACAAT 
AAAACATACA 
TGCAGATATA 
AGTAATATTC 
TTTAAACACT 
CATATTACTG 
CGATTAAGAA 
CAAGGCTATT 
ATAAACATAG 
GAATTGCAAG 
GTGTTTTTAG 



41 
1 

ILATGICRTD 
CNACRNPDGN 
DDAAPPEKVC 
GIDLNKDKFE 
SCHMNYGTSV 
DLDQLITHVL 



GCAGAGATGA 
AGTTGATAAC 
CAGGACAAAG 
TTGTCATGGT 
TCTCACAAAT 
TATTAACCTT 
GTTAATATTT 
CTTGGATTTC 
TAACTCAAAA 
ACTTAACACA 
TGTTATGAGT 
GATTAGATTA 
TCATCATTAC 
TGCCAGCATA 
AGCTAGAGTC 
TCCCTAAGTC 
TTTTTAACAA 



SI 
I 

DHVIKGTMVS 
LCIRSDITGR 
LIGCGPSTGY 
KAMAVGATEC 
WGVPPSAKM 
PFKKISEGFE 



Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession #: NMJ>06783.1 

Coding sequence: 1..786 



1 
I 

ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 

I 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
I 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



31 
I 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGTCAAA 



41 
I 

ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 
TCCGGCTGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAATCAC 



GGACCTGGGA 
AAGGCACGAG 
ACTGGGOGTC 
GGCGACTOGG 
ACAAACACAG 
GAACCAGCAG 
ACAGCTGAGA 
ACCGGGGATA 
ACACTGTAAA 
AAACAACGCA 
CAGCTTTCAC 



11 
I 

AGGAGCATAG 
GGAACCTCAC 
TTCCCATCGG 
GTCCCTGAGG 
AACCACACAG 
CTGAAAGTCG 
TCCCAGTGCG 
AATCTGGATT 
ATGCCAGAAG 
AGCTGGTTTT 
CAAAAAAAAA 



21 

I 

GACAGGGCAA 
TGCGCATGCT 
CCCCTTCGCC 
TCTGGATTCT 
CCAGTCCCAG 
GGATCCTACA 
CGACATGGAA 
TGGGTTCCGG 
CAGGTGAAGA 
ATATTAGATA 
AAAAAA 



31 

) 

GGCGGGATAA 
CCTTTGGTGC 
AGTGTGGGGA 
TTCTCCGCTA 
GAGCCCAGTA 
CCTGGGCAGC 
GGTGATCTGC 
CGTCAAGGTG 
GCAACCAGAA 
TTTGACTTAA 



41 

I 

GGAGGGGCAC 
CCACCTCAGT 
ACGCGGCGGA 
CTGAGACAOG 
ATGGAGAGCC 
AGACAGAAGA 
AAGAGCTGCA 
AAGATAATAC 
GTTTAAATGA 
ACTATCTCAA 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



51 
I 

CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CAGCAGCATC 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



Seq ID NO t 103 Protein sequence : 
Protein Accession #: NP_006774.1. 

1 11 21 31 41 51 

I I I I 1 ) 

MDWGTLHTPI GGVNKHSTSI GKVWITVIP1 FRVMILWAA QBVWGDEQED PVCNTI*QPGC 
KNVCYDHFFP VSHIRLWALQ LIPVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 
IKKHKVRIEG SLWWTYTSSI FPRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 
FISRPTEKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 104 DNA sequence 
Nucleic Acid Accession #: NM_020411 
Coding sequence: 86-526 



51 
I 

CACAGCCCTT 
GCGCATGTTC 
GCTGTGAGCC 
GCGGACACAC 
CCAAAAAGAA 
AGATCAGGAT 
TCAGTCAAAC 
CTAAAGAGGA 
AGACAAGCTG 
TAAAGTTTTG 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 105 Protein sequence: 
Protein Accession #: NP_065144.1 



31 



51 
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WO 02/086443 
I I I I I I 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPBVW ILSPLLBHGO HTQTQNHTAS 
PRSPVMESPK KKNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDIjG 
SSVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID HO: 106 DNA sequence 
Nucleic Acid Accession 8: J04129 
Coding sequence: 99-S87 



PCT/US02/12476 



60 
120 



1 
I 

CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 
I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCAOGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 
1 

TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAQCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 

I 

CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCOGTGCCG 
CCTCCAGAGC 
ATGAOGTGGT 
GGAGGCTGCT 
G 



41 

I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 
I 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCOGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



Seq ID NO: 107 Protein sequence: 
Protein Accession #: AAA60147 

1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 
WBNNSCVEKK VLGEKTGNPK KPKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGPIRAF RPLPRHLWYL LDLKQMKKPC RF 

Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 48-794 



TCCCAGGCAG 
GTCTGATCCA 
TCATGAAAGG 
CAGTAGCCTA 
TTGAGCAGAA 
GGGAGAAGGT 
GCCACCTCAT 
GTGACTACTA 
ACTCAGCCCG 
CCAACCCCAT 
ACAGCCCCGA 
TGCACACCCT 
ACAACCTGAC 
AGCCCCAGAG 
TGCCGAGAGG 
CrCCAAAGGG 
CACTCTTCTT 
CGCACCCGCT 
CTGCCCCTGC 
GGACAGTGGC 
CGCGCGCGCC 
TTCCTCTCAA 



11 

I 

CAGTTAGCCC 
GAAGGCCAAG 
CGCCGTGGAG 
TAAGAACGTG 
AAGCAACGAG 
GGAGACTGAG 
CAAGGAGGCC 
CCGCTACCTG 
GTCAGCCTAC 
CCGCCTGGGC 
GGAGGCCATC 
CAGCGAGGAC 
ACTGTGGACG 
CTGAGTGTTG 
ACTAGTATGG 
CTCCGTGGAG 
GCAGCTGTTG 
TCCTCCCGAC 
TGCCTCTGAT 
AGGGGCTGGA 
AGTGCAAGAC 
TAAAGTTCCC 



21 

I 

GCCGCCCGCC 
CTGGCAGAGC 
AAGGGCGAGG 
GTGGGCGGCC 
GAGGGCTCGG 
CTCCAGGGCG 
GGGGAOGCCG 
GCCGAGGTGG 
CAGGAGGCCA 
CTGGCCCTGA 
TCTCTGGCCA 
TCCTACAAAG 
GCCGACAAOG 
CCCGCCACCG 
GGTGGGAGGC 
AGGGACTGGC 
AGCGCACCTA 
CCCAGGACCA 
CGTAGGAATT 
GATGGGTGTG 
GGAGATTGAG 
CTGTGACACT 



31 
I 

TGTGTGTCCC 
AGGCCGAACG 
AGCTCTCCTG 
AGAGGGCTGC 
AGGAGAAGGG 
TGTGCGACAC 
AGAGCCGGGT 
CCACCGGTGA 
TGGACATCAG 
ACTTTTCCGT 
AGACCACTTT 
ACAGCACCCT 
CCGGGGAAGA 
CCCCGCCCTG 
CCCACCCTTC 
AGAGCTGAGG 
ACCACTGGTC 
GGCTACTTCT 
GAGGAGTGTC 
TGTGTGTGTG 
GGAAAGCATG 
C 



41 
I 

CAGAGCCATG 
CTATGAGGAC 
CGAAGAGCGA 
CTGGAGGGTG 
GCCCGAGGTG 
CGTGCTGGGC 
CTTCTACCTG 
CGACAAGAAG 
CAAGAAGGAG 
CTTCCACTAC 
CGACGAGGCC 
CATCATGCAG 
GGGGGGCGAG 
CCCCCTCCAG 
TCCCCTAGGC 
CCACCTGGGG 
ATGCCCCCAC 
CCCCTCCTCT 
CCGCCTTGTG 
TGTGTGTGTG 
TCTGCTGGGT 



51 
I 

GAGAGAGCCA 
ATGGCAGCCT 
AACCTGCTCT 
CTGTCCAGTA 
CGTGAGTACC 
CTGCTGGACA 
AAGATGAAGG 
CGCATCATTG 
ATGCCGCCCA 
GAGATCGCCA 
ATGGCTGATC 
CTGCTGOGAG 
GCTCCCCAGG 
TCCCCCACCC 
GCTGTTCTTG 
CTGGGGATCC 
CCCTGCTCTC 
TGCCTCCCTC 
GCTGAGAACT 
TGTGTGTGTG 
GTGACCATGT 



Seq ID NO: 109 Protein sequence: 
Protein Accession 8: NP_006133.1 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1 11 21 31 41 

I 1 I I J 1 

MERASLIQKA KLAEQAERYE DMAAFMKGAV EKGEELSCEE RNLLSVAYKN WGGQRAAWR 
VLSSIEQKSN EEGSBEKGPE VREYREKVET ELQGVCDTVL GLLDSHLIKE AGDAESRVPY 
LKMKGDYYRY LAEVATGDDK KRI IDSARSA YQEAMDISKK EMPPTNPIRL GLAUJFSVFH 
YEIANSPEEA ISLAKTTPDE AMADLHTLSE DSYKDSTLIM QLLRDNLTLW TADNAGEEGG 
EAPOEPQS 

Seq ID NO: 110 DNA sequence 
Nucleic Acid Accession #: NM_0O069S 
Coding sequence: 407-1564 

1 11 21 31 41 51 

I 1 I I I I 

CACGAGTTGG TTTGGGAGCT GCCAGTCTCC TGGGAGGATC GCAGTCAGCA GAGCAGGGCT 
GAGGCCTGGG GGTAGGAGCA GAGCCTGCGC ATCTGGAGGC AGCRTGTCCA AGAAAGGGAG 
TGGAGGTGCA GCGAAGGACC CAGGGGCAGA GCCCACGCTG GGGATGGACC CCTTCGAGGA 
CACACTGCGG CGGCTGCGTG AGGCCTTCAA CTGAGGGCGC ACGCGGCCGG COGAGTTCCG 
GGCTGCGCAG CTCCAGGGCC TGGGCCACTT CCTTCAAGAA AACAAGCAGC TTCTGCGCGA 



60 
120 
180 
240 



60 
120 
180 
240 
300 



230 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CGTGCTGGCC CAGGACCTGC ATAAGCCAGC TTTCGAGGCA 
TTGCCAGAAC GAGGTTGACT ACGCTCTCAA GAACCTTCAG 
ACGGTCCACG AACCTGTTCA TGAAGCTGGA CTCGGTCTTC 
CCTGGTCCTC ATCATCGCAC CCTGGAACTA CCCATTGAAC 
GGGCACCCTC CCCGCAGGGA ATTGOGTGGT GCTGAAGCCG 
AGAGAAGGTC CTGGCTGAGG TGCTGCCCCA GTACCTGGAC 
GCTGGGCGGA CCCCAGGAGA CAGGGCAGCT GCTAGAGCAC 
CACAGGGAGC CCTCGTGTGG GCAAGATTGT CATGACTGCT 
TGTCACCCTG GAGCTGGGGG GCAAGAACCC CTGCTACGTG 
GACOGTGGCC AACCGCGTGG CCTGGTTCTG CTACTTCAAT 
CCCTGACTAC GTCCTGTGCA GCCCCGAGAT GCAGGAGAGG 
CACCATCACC CGTTTCTATG GCGACGACCC CCAGAGCTCC 
CAACCAGAAA CAGTTCCAGC GGCTGCGGGC ATTGCTGGGC 
GGGCCAGAGC AACQAGAGCG ATCGCTACAT CGCCCCCACG 
GACGGAGCCT GTGATGCAGG AGGAGATCTT CGGGCCCATC 
GAGCGTGGAC GAGGCCATCA AGTTCATCAA CCGGCAGGAG 
CTTCTCCAAC AGCAGACAGG TTGTGAACCA GATGCTGGAG 
TGGAGGCAAT GAGGGCTTCA CCTACATATC TCTGCTGTCC 
CCACAOTGGG ATGGGCCGGT ACCACGGCAA GTTCACCTTC 
CACCTGCCTG CTCGCCCCCT CCGGCCTGGA GAAATTAAAG 
TACCGACTGG AACCAGCAGC TGTTACGCTG GGGCATGGGC 
GTGAGCGTCC CACCCGCCTC CAACGGGTCA CACAGAGAAA 
GCTTATGCTC CCAACTCACA TTGTTCCTCC AGACCGCAGG 
TGGAGCTGTC ACATGACTGC ATCCTGCCTG CCAGGGCTGC 
TCTGGGGGAC GCTGCTCGAG AGAGGCCGAG AGGCCGCAGA 
CACCCCACCC TCCCCAATTC CAGCCCTTTG CCCTCTOGGT 
CACAGGGGCA GTGTCACCCT GGAAAATACA GTGCCCTGCC 
GAACGGTTGA GAGCGTGGAG CCCTCCAGGC CTTTGCTCTC 
TTCCACCTCT GCCCCATCCC AACTGCACCA GCACTGCCTC 
CCCACACTGG TCTCTGCACC ACCCCTCTGG TTCACACCGC 
AGCTCCATCC ACTGGGAAAA CTGGGGTTTG CATCACTCCA 
CTGGGGGCAA GTCCCTTGAC TTCTCTGAGC CTCAGTTTCC 
CCAAAATGGA GTCACTTATG CCAAACTCTA ATAAAATGGA 
CCCTCACACA CACATGCCCG TAACAGGATT TATCACCAAG 
AGACACAGGG CGTATGGAAA AGCACGTCCT CAAAGACTGT 
GATGCTTACC TACCACGGCC GTCTCCACCA GAAAACCATC 
TGTGACTTAC AAACCTTGTT TAAAAGCTGC TTACATGGAC 
CCCTTGGCTG TGGCCCTCTG TGTATGCCTG GGATCCTTCC 
GGAATCCTCT GCTCCTCCCA AATAAATTCA TCTGTTC 



Seq IO NO i 111 Protein sequence: 
Protein Accession ft: NP 000686 



PCT/US02/12476 



GACATATCTG 
GCCTGGATGA 
ATCTGGAAGG 
CTGACCCTGG 
TCAGAAATCA 
CAGAGCTGCT 
AAGTTGGACT 
GCCACCAAGC 
GACGACAACT 
GCCGGCCAGA 
CTGCTGCCCG 
CCAAACCTGG 
TGCGGCCGCG 
GTGCTGGTGG 
CTGCCCATCG 
AAGCCCCTGG 
CGGACCAGCA 
GTGCCATTCG 
GACACCTTCT 
GAGATCCGCT 
TCCCAGAGCT 
CCTGAGTCTA 
CTCCCCCAGC 
AAAGCAAGGT 
ACATGCCAGG 
CAGGGTTGGC 
TTCTTAGGGG 
CCCTCTAGGC 
CCCCAGGGAT 
ACCCTGCACT 
CTGCACAGTG 
TTATGTGAAA 
GTOGGGGGGG 
ACACGCCTGC 
AGTATTCCAG 
GCCAACTCCT 
TTCTGTCCTT 
AAGCACTCAT 



AGCTCATCCT 
AGGATGAACC 
AACCCTTTGG 
TGCTCCTGGT 
GCCAGGGCAC 
TTGCCGTGGT 
ACATCTTCTT 
ACCTGACGCC 
GCGACCCCCA 
CCTGCGTGGC 
CCCTGCAGAG 
GCCGCATCAT 
TGGCCATTGG 
ACGTGCAGGA 
TGAACGTGCA 
CCCTGTACGC 
GCGGCAGCTT 
GGGGAGTCGG 
CCCACCACCG 
ACCCACCCTA 
GCACCCTCCT 
GCCATGAGGG 
CTCAGGTTGC 
CTTGCTTCTA 
TGTCCTCACT 
CAGGCCCAGT 
CATCAGCCCT 
ACACGCGCAC 
CCTCTCACAT 
CACCCACAGC 
TTAGTGGGAC 
GTTGCTGGAA 
CACATAGAAG 
ATGTAAGACC 
ATGAGCTGCA 
GCGATCAGCT 
TAAAACGTTC 
AGCCCAGATA 



MKDEPRSTNL 
ISQGTEKVIjA 
KHLTPVTLEL 
PALQSTITRP 
VDVQETEPVM 



RYPPYTDWNQ 



11 
I 

FMKLDSVFIW 
EVLPQYLDQS 
GGKNPCYVDD 
YGDDPQSSPN 
QEEIFGPILP 
FTYISLLSVP 
QLLRWGMGSQ 



21 

I 

KEPFGLVLII 
CPAWLGGPQ 
NCDPQTVANR 
LGRIINQKQF 
IVNVQSVDEA 
PGGVGHSGMG 
SCTLL 



31 
I 

APWNYPLNLT 
ETGQLiLEHKL 
VAWFCYFNAG 
QRbRALLGCG 
IKPINRQEKP 
RYHGKFTPDT 



41 

I 

LVLLVGTLPA 
DYIFFTGSPR 
QTCVAPDYVL 
RVAIGGQSNE 
LALYAPSNSR 
FSHHRTCLIiA 



51 



GNCWLKPSE 
VGKIVMTAAT 



Seq ID NO: 112 DNA sequence 
Nucleic Acid Accession #: NM_0044S6 
Coding sequence: 58-2298 



1 
I 

GAATTCCGGG 
GGCCAGACTG 
GAGTACATGC 
TTTAGTTCCA 
CAGCGAAGGA 
GAGTGTTCGG 
AATGCAGTTG 
GTGGAAGATG 
GATGGTACTT 
GAATGTGGGT 
AATGATGATG 
GATCTGGAGG 
AAAATTTTGG 
GAAAAATA7A 
CCCAACATAG 
CATACGCTTT 
ACACCCAACA 
CCACAGTGTT 
CGGATAAAGA 
' AGTAGCAGGC 
AGGGAAGCAG 
GATGAAACTT 
CCAAATATTG 
GTCCTCATTG 
ACATGTAGAC 
GCTGAGGATG 
CACTGCAGAA 



11 

I. 

CGACGCGCGG 
GGAAGAAATC 
GACTGAGACA 
ATCGTCAGAA 
TACAGCCTGT 
TGACCAGTGA 
CTTCAGTACC 
AAACTGTTTT 
TCATTGAAGA 
TTATAAATGA 
ACGATGATGA 
ATCACCGAGA 
AGGCCATTTC 
AAGAACTCAC 
ATGGACCAAA 
TCTGTAGGCG 
CTTATAAGCG 
ACCAGCATTT 
CCCCACCAAA 
CCAGCACCCC 
GGACTGAAAC 
CGAGCTCCTC 
AACCTCCTGA 
GCACTTACTA 
AGGTGTATGA 
TGGATACTCC 
AGATACAGCT 



21 
I 

GAACAACGOG 
TGAGAAGGGA 
GCTCAAGAGG 
AATTTTGGAA 
GCACATCCTG 
CTTGGATTTT 
CATAATGTAT 
ACATAACATT 
ACTAATAAAA 
TGAAATTTTT 
TGATGGAGAC 
TGATAAAGAA 
CTCAATGTTT 
CGAACAGCAG 
TGCTAAATCT 
ATGTTTTAAA 
GAAGAACACA 
GGAGGGAGCA 
AOGTCCAGGA 
CACCATTAAT 
GGGGGGAGAG 
TGAAGCAAAT 
GAATGTGGAG 
TGACAATTTC 
GTTTAGAGTC 
TCCAAGGAAA 
GAAAAAGGAC 




360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



SDRYIAPTVL 
QWNQMLERT 
PSGLEKLKEI 



51 
I 

AATAATCATG 
TGTAAAATCA 
AAAGAGTATG 
AGAATGGAAA 
CGGGACTAGG 
AAAGACTCTG 
GAATTTTATG 
TTTAGATCAG 
CGGGGATAGA 
TGGTCAATAT 
AAAGCAGAAA 
TCCTTCTGAT 
AGAACTAAAG 
TGAATGTACC 
ACACTCCTTT 
TTTTCATGCA 
ACCTTGTGGA 
CACCGCTGAG 
TCCCAATAAC 
AGACAGTGAT 
AGAGAAGAAA 
AAAGATGAAG 
AATGTTTAGA 
TGGGACCAAA 
TCCAGCTCCC 
GTGGGCTGCA 
CAACTATCAA 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



231 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTGTGTGAT 
TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAGTGTCAAA ACCGCTTTCC 
TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG 
CCTGACCTCT GTCTTACTTG TGGAGCCGCT GACCATTOGG ACAGTAAAAA 
AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGCTGGCACC 
GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT 
TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA 
ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGQ TGGATGCAAC 
AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA 
GTTAACGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC 
CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG 
GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA 
CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT 
AATTTGCAAA GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA 
GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA 
TACATTTTTC AACTTTGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA 



Seq ID NO: 113 Protein sequence: 
Protein Accession §: NP 004447 



PCT/US02/12476 



MGQTGKKSEK 
KQRRIQPVHI 
MVEDETVLHN 
YNDDDDDDDG 
KEKYKELTEQ 
ATPNTYKRKN 
NSSRPSTPTI 
KPNIEPPENV 
PAEDVDTPPR 
NFCEKPCQCS 
CKNCSIQRGS 
YMCSPLFNLN 
ELFVDYRYSQ 



11 

I 

GPVCWRKRVK 
LTSVSSLRGT 
IPYMGDEVLD 
DDPEEREEKQ 
QLPGALPPEC 
TETALDNKPC 
NVLESKDTDS 
EWSGAEASMP 
KKKRKHRLWA 
SECQNRFPGC 
KKHLLLAPSD 
NDPWDATRK 
ADAIiKYVGIB 



21 
I 

SEYMRLRQIjK 
RECSVTSDLD 
QDGTPIEELI 



TPNIDGPNAK 
GPQCYQHLEG 
DREAGTETGG 
RVLIGTYYDN 
AHCRKIQLKR 
RCKAQCNTKQ 
VAGWGIFIKD 
GNKIRFANHS 
REMEIP 



31 

I 

RPRRADEVKS 
FPTQVIPLKT 
KNYDGKVHGD 
ESRPPRKFPS 
SVQREQSLHS 
AKEFAAALTA 
ENNDKEEEEK 
FCAIARZiZGT 
DGSSNHVYNY 
CPCYLAVREC 
PVQKNEFISE 
VNPNCYAKVM 



41 



MFSSNRQKIL 
LNAVASVPIM 



Seq ID NO: 114 DNA sequence 
Nucleic Acid Accession 8: NMJJ01827 
Coding sequence: 96-335 



i . 

AGTCTCCGGC 

CGGACAAGTA 
CCAAACAAGT 
AACAGAGTCT 
TTAGACGACC 
TTTTCAAATT 
ACAAATCTTT 
AAATGCAACT 
TTTCTCTTAA 
TATGTTGCAT 



11 

1 

GAGTTGTTGC 
TCATTTTCTG 
CTTCGACGAA 
ACCTAAAACT 
AGGCTGGGTT 
TCTTCCAAAA 
TAATGTATAT 
CATCCATACC 
GCAAGTAGGT 
GTGCCTGTTT 
TTAAAAAAAA 



21 
I 

CTGGGCTGGA 
CAGCGCGCCA 
CACTACGAGT 
CATCTGATGT 
CATTACATGA 
GATCAACAAA 
GTGTATATAA 
TGTGCATGAG 
TACTGTAAGA 
GAGTTTACTG 



31 
I 

CGTGGTTTTG 
CGAGGATGGC 
ACCGGCATGT 
CTGAAGAGGA 
TTCATGAGCC 
AATGAAGTTT 
GGTAGTATTC 
CTGTATTCTT 
TGTTTAAGAT 
AAACAGTTTA 



DKILEAISSM 
FHTLPCRRCF 
ERIKTPPKRP 
KDETSSSSEA 
KTCRQVYEFR 
QPCDHPRQPC 
DPDLCLTCGA 
YCGEIISQDE 
MVNGDHRIGI 



41 
I 

TCTGCTGCGC 
CCACAAGCAG 
TATGTTACCC 
GTGGAGGAGA 
AGAACCACAT 
ATCTGGGGAT 
AGTGAATACT 
CACAGCAACA 
AAAAGTTCTT 
CTTTTGTTCA 



AGCACAAAAT 
GGGATGCCGC 
AGAGTGTGAC 
TGTGTCCTGC 
ATCTGACGTG 
CTCAGAATAC 
TGATAAATAC 
CCGCAAGGGT 
AGTTATGATG 
TGGCGAAGAG 
CATCGAAAGA 
GCTGCCTTAG 
TGAAATTCTG 
ACTTTTTATT 
TGCAGTATGG 
AAAAAA 



51 
I 

ERTEILNQEW 
YSWSPLQQNF 
FVELVNALGQ 
FPDKGTAEEL 
KYDCPLHPFH 
GGRRRGRLPN 
NSRCQTPIKM 
VKESSIIAPA 
DSSCPCVIAQ 
ADHWDSKNVS 
ADRRGKVYDK 
FAKRAIQTGE 



51 
1 

CCGCTCTTCG 
ATCTACTACT 
AGAGAACTTT 
CTTGGTGTCC 
ATTCTTCTCT 
OGTCAAATCT 
TGAGAAATGT 
GAGCTCAGTT 
CCAGTCAGTT 
ATAAAGTTTG 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 115 Protein sequence: 
Protein Accession #i NP 001818 



11 



21 



31 



41 51 

t I.I I I 1 

MAHKQIYYSD KYFDEHYEYR HVMLPRELSK QVPKTHLMSE EEWRRLGVQQ SLGWVHYMIH 
EPEPHILIiFR RPLPKDQQK 

Seq ID NO i 116 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TCAGACCTCA 
GCATCTGGAC 
AGAGGTGTGT 
GCAGCCAACA 
AAGGACTGAT 
CCTTGGAAGG 
TCAAGAATTC 
AGTGTGGGAG 
CCCATCTCTA 
CCTGTAGTTC 
AGGCTGCAAT 
CCTGTCTCAA 
TTTGAGGTGC 
CCTGAAGGAG 
GACAGACCTT 
CTCTTCCCCC 
CCGCCTCCCA 
CAGCCGGATA 
TCCTGCTCCG 



11 
I 

TGAGTCACTT 
CCTTGGTGCT 
TCCAGGGAAA 
GAGTTCAAAA 
CCACATTCCC 
ACCTGGCTCA 
TTTGCTGAGC 
GATCTCTTGA 
AAATAATAAT 
CAGCTACCCA 
GAACTGTGAT 
ATAATAATAA 
CATTTGGGTA 
CAGAGGGATG 
GTCCTTCTTC 
TCCCTGTCCC 
TGTCTGCTGT 
CAGAGTGAAT 
TGTAAAGAGG 



21 
1 

GGACTCTTGA 
ATCGACGAAG 
GCCCCTATCT 
TGCAGGCTTG 
ACCAGGAAGT 
GGCTGGACCA 
ATGGTGCCTC 
GCCCAGGAGT 
AATAATAAAA 
GGAGGCTGAG 
TACCCCACTG 
TAATAATAAT 
GAAAGAAAAG 
CATCGCTGGA 
CTTGTGGAAA 
AGGGAACCAA 
GCCTTTGTAC 
AGTTAACCAC 
CCAGTGTTTG 



31 
I 

GCCACCTCTG 
CTTGGGTGGG 
CTCTGCAGAG 
GAAAGTACAG 
TTAGCAGAAC 
CCTCTTGAGA 
ATGCCTATAA 
TCAAGACTAG 
TAAAAAATTA 
GCAAGAGGAT 
CACTCCAGCC 
CTTATTTTGG 
ACGTTTACAC 
GGTGACCTAC 
GTGTTTCCTC 
AGGGCTTTCT 
TCAGCAATTC 
ACTTAGGTCA 
TGTGTTGCAA 



41 
I 

GGGGTGGAGT 
GCTCTTAGCT 
GTCAAGTGAA 
GGGGCTCTGT 
CCCCGCGTGC 
GGGAGGAGCT 
TACCAACACT 
CCTGGGCAAC 
GCAGGGCATG 
GGCTGGAGCC 
TGGGCAAAAG 
AGAATAAAGA 
CGAGAAATAG 
AGTTGAAGAA 
TGCTGCTACT 
ACCACACCCT 
TTGTTTGCTC 
AATAGGATCT 
GCAGCCTTGG 



51 
I 

CTCTCTCCTG 
GCTATGTGCA 
AGCGACGGCC 
GGAGGATGGG 
CAACTGGACC 
CTGGATTTGA 
TTGGGAGGCC 
ACAGAGAGAA 
GTGGCATGTG 
TGGGATGTTG 
AGCGAGAGAA 
GACCTCTGGA 
TCTGTGTTGC 
GACTCATTAT 
GCTCATGAGA 
TTCTTGCCCC 
CATTATCTTC 
AAATTTTTGT 
AATAGTAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 



232 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CTTCTCATTT GTTTGGGATC TGGCCACCAA GTTCCAGAAT 
AGTTCATCAG GCTCTCGGAC CTTAGGGCTG TTGGAGAAGG 
GTGAAGGCTC GTGTTCTCCA TCCTCAACTT TCTTTGCTTC 
TTGGAAGGGC AAAAAATGAA CACTGTCGTT CATTGCAGCC 
ACAGTCTGCT GTGAAGACCT TCTCTCAAGT GGCATTTGGG 
GCTTCATGAG AGACTGACAG CTATCAGGGG TTGTGGCACT 
CAGTGTGTGC TGATGACACA TACACACCTG ACAATAGCTT 
ACTCTGTAGC CAACATACAC ATGATTTAAA ACCCTTTCTA 
CTTGTCCAAA TGCAGAGTCA GAGCTATTTG TACTTCATTA 
TGGCTTTCTT TTTGCAAAAA TAATTAAAGT TTTTGTATGT 
AAACAAAAAA 

Seg ID NO: 117 dna sequence 

Nucleic Acid Accession Jh BC012178.1 

Coding sequence: 2 04-22 B5 



GATACACGGA 
CTTCAGCAGC 
GATCATACAC 
GTGTTTTGTG 
AGTCCATGCC 
TAGTGAGGAC 
GAGTCTTCTC 
AATATCTATC 
TTATTTCCAA 
TGCAAAAAAA 



TCAGTGCAGA 
AGAACTGATG 
AAGAATACAT 
ACACAGAT6C 
AGATCATGGT 
TCTCCTCCCC 
TGTTCCTTTT 
ATGGTTCATC 
GGCGAATAGT 
AAAAAAAAAA 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



PCTYUS02/12476 



i 
I 

CTTCTCTCCC 
TGGCGGCTGG 
GCCCGCGGCG 
CCGCGGCTCC 
CTGGAGGAGA 
CTGGTGCTCA 
AAATTTTCCC 
TCATCTCTGG 
TATTCACTAT 
TATTTGGAGG 
TGGATAATAC 
ATGGAGATAG 
TAGTAGCAGG 
TTGGCCTTAC 
GCAGTGGAAC 
GAGTAGGCAC 
CAGCTTTGCT 
GCTTTATGAG 
AGGTCAAAGT 
CAGATGAAGA 
CTGAAGAGAA 
GAGAAATGAA 
TAATTGAAAG 
ATGACACAGA 
ATTTTCATAA 
TTTCCAGGCA 
CTTATATTTG 
CTGCAAGTGT 
AGGATCAGGA 
CAATTAAAAC 
CCAGTAAAGA 
TGTGTCACAA 
CAGATGTTAC 
TTGAGGCCCA 
TGATTTTGAC 
GATCTGTGGT 
CTGGCAATGA 
CTGGTATTTC 
AGTAATAAAC 



11 

I 

GCGGCGCTGG 
CTCCTCTCCG 
CCGACCCTTC 
GGCCCTGGCC 
CCTTAAGGAT 
GTACGGGAAA 
CTTGGAAACA 
AGGACCTAAT 
TGGCAAGCCT 
TACTGTGCAC 
ATGTTCATTA 
TGTAGACAAA 
CATAGCAAAT 
AGAAAATGGA 
CTTCACCGTG 
GTCAAAAGTT 
AAATCGTGCT 
AAAACGAGAA 
GATAAATGCT 
TAGAACCCCA 
AAGAAAAATC 
CTTGAAACCA 
TGCATCCCTT 
GCTCATCAGA 
AGATGAAGTG 
TCCATTTCCA 
TAAGGACTTT 
TAAAAAGCCA 
GAAGCTGATG 
TGTAGGTGTG 
TGAACCTGAC 
CGTTAACAGA 
TCCCACTTTC 
TAACATTCTC 
ACCATTACAT 
TATTCGAACC 
GATCCCTGTA 
TCGAATTATG 
TTCTTGTTCT 



21 
1 

GGCCCGCGCT 
CTGCCGGCTG 
CGGCACCCTC 
CCGATGGCTC 
GGCCACCACC 
GTCATAGACC 
CCAGCATTTG 
TCTGTGTATG 
GTTCTTGGAA 
AAAAAAAGTG 
TTCAGGGGCC 
GTAGCTGATG 
GAATCTAAAA 
AAAGTAATAC 
CAGAACAGAG 
TTGGTTTTAC 
TTGAACCAAG 
AGCCAGTCTG 
GCTCATTCTT 
CGGAAAAGAA 
ATTGGGGATA 
GAGGAGGTTT 
GTTGCAAGTG 
AAGTTGAGAG 
AGAATTTTGG 
GGTCCTGGCC 
CCTGAAACCA 
CATACCCTAT 
CAAATTACCA 
CAGGGTGACT 
TGGGAATCAC 
GTTGTTTATA 
TTGACAACAG 
AGQGAGTCTG 
TTTGATCGGG 
TTTATTACTA 
GAGGTGGTAT 
TATGACTTAA 
ATTAAAA 



31 
I 

CCGCTGCTGT 
CTCCTCGACC 
CCGCCCCGTC 
TGTGCAACGG 
ACTATGAAGG 
GAAGAGTGAG 
CTATAAAGGA 
CTGAAGATGC 
TTTGCTATGG 
TCAGAGAAGA 
TTCAGAAGGA 
GATTCAAGGT 
AGTTATATGG 
TGAAGAATTT 
AACTTGAGTG 
TCAGTGGTGG 
AACAAGTCAT 
TTGAAGAGGC 
TCTACAATGG 
TTAGCAAAAC 
CTTTTGTTAA 
TCCTTGCCCA 
GCAAAGCTGA 
AGGAGGGAAA 
GCAGAGAACT 
TGGCAATCAG 
ACAATATTTT 
TACAGAGAGT 
GTCTGCATTC 
GTCGTTCCTA 
TTATTTTTCT 
TATTTGGCCC 
GGGTGCTCAG 
GGTATGCTGG 
ACCCACTTCA 
GTGACTTCAT 
TAAAGATGGT 
CATCAAAGCC 



41 
I 

TGCTCCATTC 
AGGCCTCCTT 
TCGTACTGTC 
AGACTCCAAG 
AGCTGTTGTC 
GGAACTGTTC 
ACAAGGATTC 
TCCCTGGTTT 
TATGCAGATG 
TGGAGTTTTC 
AGAAGTTGTT 
TGTGGCACGT 
AGCACAGTTC 
CCTTTATGAT 
TATTCGAGAG 
AGTAGACTCA 
TGCTGTGCAC 
CCTCAAAAAG 
AACAACAACC 
GTTAAATATG 
GATTGCCAAT 
AGGTACTTTA 
ACTCATCAAA 
AGTAATAGAA 
TGGACTTCCA 
AGTAATATGT 
GAAAATAGTA 
CAAAGCCTGC 
ACTGAATGCC 
CAGTTACGTG 
GGCTAGGCTT 
ACCAGTTAAA 
TACTTTACGC 
GAAAATCAGC 
AAAGCAGCCT 
GACTGGTATA 
CACTGAGATT 
CCCAGGAACT 



51 
I 

GGCGCTTTTC 
CTCAACCTCA 
GCCGTCACCG 
CTGGAGAATG 
ATTCTGGATG 
GTGCAGTCTG 
CGTGCTATTA 
GATCCAGCAA 
ATGAATAAGG 
AACATTAGTG 
TTGCTTACAC 
TCTGGAAACA 
CACCCTGAAG 
ATAGCTGGAT 
ATCAAAGAGA 
ACAGTTTGTA 
ATTGATAATG 
CTTGGAATTC 
CTACCAATAT 
ACCACAAGTC 
GAAGTAATTG 
CGGCCTGATC 
ACCCATCACA 
CCTCTGAAAG 
GAAGAGTTAG 
GCTGAAGAAC 
GCTGATTTTT 
ACAACAGAAG 
TTCTTGCTGC 
TGTGGAATCT 
ATACCTCGCA 
GAACCTCCTA 
CAAGCTGATT 
CAGATGCCGG 
TCATGCCAGA 
CCTGCAACAC 
AAGAAGATTC 
ACTGAGTGGG 



Seq ID NO: 118 Protein sequence: 
Protein Accession #: AAH12178.1 



1 
I 

MALCNGDSKL 
APAIKEQGFR 
KSVREDGVFN 
SKKLYGAQPH 
VLLSGGVDST 
HSFYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TLLQRVKACT 
ESLIFLARLI 
ESGYAGKISQ 
WLKMVTEIK 



11 
I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLP 
PEVGLTENGK 
VCTALLNRAL 
PISDEDRTPR 
PDLIESASLV 
ELVSRHPFPG 



PRMCHNVNRV 
MPVILTPLHF 
KIPGISRIMY 



21 
I 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEEWL 
VILKNFLYDI 
NQBQVIAVHI 
KRISKTLNMT 
ASGKAELIKT 
PGLAIRVICA 
ITSLHSUNAP 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 

I 

LDAGAQYGKV 
PAIFTIGKPV 
LTKGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEKKRKII 
HHNDTEL1RK 
EEPYICKDFP 
LliPIKTVGVQ 
PPTDVTPTFL 
CQRSWIRTF 
EWE 



41 
I 

IDRRVRKLPV 
LGICYGMQMM 
ADGFKWARS 
NRELECIREI 
QSVEEAIiKKL 
GDTFVKIANE 
LREEGXVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TTGVLSTLRQ 
ITSDFMTGIP 



51 
I 

QSEIFPLETP 
NKVPGGTVHK 
GNIVAGIANE 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPB 
LKDPHKDEVR 
DPSASVKKPH 
GISSKDEPDW 
ADFEAHNIIiR 
ATPGNEIPVE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: NMJJ06500.1 

Coding sequence: 27.. 1967 

1 11 21 31 41 51 

I I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 
TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 
CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 
AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 



60 
120 
180 
240 



233 
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WO 02/086443 

TCATCTTCCG tgtGCGCCAG GGCCAGGGCC AGAGCGAACC 
TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA 
GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA 
TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA 
TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA 
CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG 
GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG 
GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT 
AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT 
TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA 
CCCCTGAGAG AGAGGAAGGC AGCAGCCTCA CCCTGACCTG 
ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA 
GTGAAGOGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA 
AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG 
TGCCGGAGCC GGAGAGCCGG GGOGTGGTCA TCGTGGCTGT 
TGGOGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC OGTCTCGTAA 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TOGATCTGAG 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC 
GTCCACCACC ATCTCCTCCA OGTTGAGTGA AGCTCATCCC 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT 
TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT 
CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC 
GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC 
AGAATGGTAC TTAGGGATGG AAAAOGGGGC CTGGCTAGAG 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT 
TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA 
CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA 



Seq ID NO: 120 Protein sequence: 
Protein Accession 8: NP 006491.1 



TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
OGGGTACCCC 
GAACOGGGTG 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCOG 
GGACCGOGTG 
CAAGCAGAAC 
GGTGCIGGAG 
GGACACCATG 
CGTCOGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 
TCGCTGCGTG 
GGCCATTTTT 
TATGGTGTTG 
CGTCAACGGC 
TGTCCTOGTG 
GGGCAAAAAC 
CTCCAACACA 
CACCTCCACA 
GATTGTGTGC 
GGGCAAGCTG 
GACCGAACTT 
GGGCAGCAGC 
GCATTAGCCC 
CACTCTTCTC 
CTGCACACCC 
AAGCCGCTTT 
AAGCAAGGAG 
TTTACACACA 
TCTGAGCTGG 
AGGACACACC 
AGAGCACCCC 
CCTCTTCAAA 
CCTTAAAAGA 
GGGAGGCCGA 
AACCCTGTCT 
AGTCCCAGCT 
GCAGTGAGCC 
CGAGGAAAAA 
TTCAGGTGAA 
ACGTAAAACT 
GGATTAAAGC 
CCTAGAAGGG 
CTTCGGGTGT 
CAGGTGTGTA 
TATATATATA 
TTCTACATGG 
AACCGTTTCC 
AGCTCTACCA 
GCACGAAGGG 
CTT 



GAGCAGCGGC 
CAAGACGAGC 
CAGCTCCGCG 
CCTGTGAACA 
ATTCCTCAAG 
CACATTCAGT 
AAGGCACAGC 
CTGCCCAGTG 
ACAGAAAAAG 
GAAATCAGGT 
CCCAGCACCA 
CCTGCCCGGA 
ATATCGCTGC 
AGTCCCGCAG 
AGTAGCCAGG 
GGGCCTGTGC 
GCGTCTGTGC 
GGCCCCCCTT 
AATCTGTCTT 
ACGGCAAGTG 
ACCCCGGAGC 
ACCAGCATCC 
ACCACTGGCC 
GAGAGAAAGC 
ATCCTGGTCC 
CCGTGCAGGC 
GTAGTTGAAG 
GGTGACAAGA 
CGAATCACTT 
TCAGCCAAAG 
CCTTTCAGAG 
TCAGGGACCA 
CCCCAGTCTC 
TTATGGCTGT 
TTTCCTGCCC 
GGAGCCAGGC 
AGCGGCATCC 
GTCTCCTGTG 
TACGTGCCGG 
GGCGGGCGGA 
CTACTAAAAA 
ACTCGGAAGG 
GAGACGGTGC 
AAAAGAAAAG 
TTAGCCTCAA 
GAAAGGCAGC 
TATGGTTATA 
CCCAAATGAG 
GTGTGTCTGT 
AATTTGCAAA 
TATGAAAAAT 
GTACCACAGG 
AGTTGGCAGC 
GAGCAGACAG 
CCTGGCAGGC 



MGLPRLVCAP 
DWFSVHKEKR 
PRSQEYRIQL 
LKEEKNRVHI 
VTVPVPYPTE 
DNGVLVLBPA 
LTLTCEASSS 
QLVKLAIFGP 
LSTLNVLVTP 
TRANSTSTER 
PPSRKTELW 



11 
I 

LLAACCCCPR 
TLIPRVRQGQ 
RVYKAPBBPN 
QSSQTVBSSG 
KVWLEVBPVG 



QDLEPQWLRE 
PWMAFKERKV 
ELLBTGVECT 
KLPBPESRGV 
EVKSDKLPEE 



21 
I 

VAGVPGEAEQ 
GQSEPGEYEQ 
IQVNPIiGIPV 
LYTLQSILKA 
MLKEGDRVEI 
QAWNLDTMIS 
ETDQVLERGP 
WVKENMVLNL 
ASNDLGKNTS 
VIVAVIVCIL 
MGLLQGSSGD 



31 
I 

PAPELVEVEV 
RLSLQDRGAT 
NSKEPEEVAT 
QLVKEDKDAQ 
RCLADGNPPP 
LLSEPQELLV 
VLQLHDLKRB 
SCEASGHPRP 
ILFLELVNLT 
VLAVLGAVLY 
KRAPGDQGEK 



41 

I 

GSTALLKCGL 
LALTQVTPQD 
CVGRNGYPIP 
FYCELNYRLP 
HPSISKQNPS 
NYVSDVRVSP 
AGGGYRCVAS 
TISWNVHGTA 
TLTPDSNTTT 
FLYKKGKLPC 
YIDLRH 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740. 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3460 
3540 



PCT7US02/12476 



51 
I 

SQSQGNLSHV 
ERIFLCQGKR 
QVIWYKNGRP 



TREAEEETTN 
AAPERQEGSS 
VPSIPGLNRT 
SEQDQDPQRV 
GLSTSTASPH 
RRSGKQEITL 



60 
120 
180 
240 
300 
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420 
480 
540 
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Seq ID NO: 121 DNA sequence 
Nucleic Acid Accession NM_018306 
Coding sequence; 60-671 

1 11 21 31 41 51 

I II I I ] 

ATAGTCTACA CAGAGCTCCC CTTGCTGCCC AGACAAGCTG AAGGACCACA GGAAAAGCCA 
TGGAGACTTC AGCATCCTCC TCCCAGCCTC AGGACAACAG TCAAGTCCAC AGAGAAACAG 
AAGATGTAGA CTATGGAGAG ACAGATTTCC ACAAGCAAGA CGGGAAGGCT GGACTCTTTT 
CCCAAGAACA ATATGAGAGA AACAAGTCTT CTTCCTCCTC CTTCTCTTCC TCCTCATCCT 
CCTCATCTTC TTCATCCTCC TCCTCCTCAG GTCCTGGGCA TGGGGAGCCT GACGTTTTGA 



60 
120 
180 
240 
300 



234 



WO 02/086443 PCT/US02/12476 

AGGATGAGCT TCAACTCTAT GGAGATGCTC CTGGAGAGGT GGTACCCTCT GGGGAATCAG 360 

GACTCCGAAG GAGAGGCTCT GACCCAGCAA GTGGAGAA6T GGAGGCCTCT CA6TTAAGAA 420 

GACTGAATAT AAAGAAAGAT GATGAGTTTT TCCATTTCGT CCTCCTGT6C TTTGCCATOG 480 

GGGCCTTGCT GGTGTGTTAT CACTATTACG CAGACTGGTT CATGTCTCTT GGGGTOGGCC 540 

5 TGCTCACCTT CGCCTCCCTG GAAACCGTTG GCATCTACTT CGGACTAGTG TACCGTATCC 600 

ACAGCGTCCT CCAAGGCTTC ATCCCCCTCT TCCAGAAGTT TAGGCTGACA GGGTTCAGGA 660 

AGACTGACTG AGGCCACTTC CAGGTGGGCA GCAGAGGCAG GCCCCAGTGT GACCACCACT 720 

GCGACCCCTG AGCCCACAAG GGCAGAGCAG CATTCTGAGA GACGCACAGG AGACCAAGCC 780 

AGACCAATAA ACAGAACACT TTTCCTTCCA TGTGGTCTGA ATGTTGGCAC CAGCCCGGGC 840 

10 AGGGGCATCT CATTTGGGCA GTACTGCTGT GCAACCCAGC TGCAAGGATG GAAGGCAGAG 900 

GGTGGGTGTG GGGCCTGAGG CTTCACAGTA CCTGGACCAG CAGGAAGATT CTGGGAGGTC 960 

ACTGCTCTCA GAGGACAGCA AGGGACCCTG AGCTCTGCAA GCTGTGATCT GTCTGGGTTC 1020 

ATGGTTTTTC TCAAATCCCA GGCTATCTGC ATGCGCTCTC AGGTGCTACC GAGCCATCCT 1080 

GGGAGAGATG GATGGTCCAC TGCTTTGAGG CAGGGAGCCA TCGGGCTGGG GCCCCTTGGT 1140 

15 GAACCTGATG CAGGTAAGAT GCTGAGGACT AAAACCATTT TTTTTGCACC CAAAAAAAAA 1200 

GGCAGGAAAA TGATCATCAG AAACTAAATG GCAGCCAGGC ATGGGGGCTC ACGACTGTAA 1260 

TCCTCGCACT TTGGGAGGCT CAGGCTAAGG GTCGCTTGAA GCTGAGAGTT CAAGACCAAC 1320 

CTGGGCAACA TAGTGAGACC CCCATCTCTA CAATTTTTTT TTAATGACCA AATGTGGCGG 1380 

TACATACCTG TACATACCTG CGGTTCCAGC TACTCAAGAG GCTGAGGCAG GAGGACTGCT 1440 

20 TGAGCCCAGG AGTTCAGGGC TGCAGTGAGG TACGATCAAG CCACTGCACT CCAGCCTGGG 1500 

CGACAGAGCA AGATCGTTTC TCTAAAATT 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 122 Protein sequence: 
Protein Accession ft: NP_060776 

1 11 21 31 41 51 

I I I I I I 

METSASSSQP QDNSQVHRET EDVDYGETDF HKQDGKAGLF SQBQYERNKS SSSSPSSSSS 
SSSSSSSSSS GPGHGEPDVL KDBLQLYGDA PGEWPSGES GLRRRGSDPA SGEVEASQLR 
RLNIKKDDEF FHFVLLCFAI GALLVCYHYY ADWFMSLGVG LLTFASLETV GIYFGLVYRI 
HSVLQGFIPL FQKFRLTGFR KTD 

Seq ID NO: 123 DNA sequence 
Nucleic Acid Accession 4h BC022542 
Coding sequence: 243.. 896 



51 



1 11 21 31 41 

I I 1 I I I 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 
CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCOGGCT TCOGGCGTCC 
TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTQGCT GCTCCTCGGG GCGGCGACGG 
GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 
CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 
TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 
TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 
AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 
ATTTGTCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 
GTTTTCAAGC C T TT TTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 
AAGCCTCGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 
TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 
TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 
CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 
TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 
TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 
TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 
TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 
CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 
TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 
GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 
COGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 
TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 
ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 
CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 
GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 
GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 
ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 
CTAAGAAATT AATATTAATA TAAAAATTAT TGATAATCTT AAATTATTGA TTATTCCTTA 
ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 
GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGTCAGGAAG ACAACTCAGA 
TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 



60 
120 
180 
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420 
480 
540 
600 
660 
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Seq ID NO: 124 Protein sequence: 
Protein Accession 8: AAH22542 

1 11 21 31 41 51 

I I I I I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 60 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGE 120 
ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 180 
WNKMKYKSVY KNVIIjQVPVG LTVHTSLVCS VTLLITILCS KKKKK 

Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession fh NM_004 994.1 

Coding sequence: 20.. 2143 
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WO 02/086443 

1 11 21 31 41 51 

I I I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 
GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 
CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 
CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGCCCTGCGC TGCTGCTTCT 
CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 
GOGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 
CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 
GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTOGCACTG TGGAGCGGGG TGACGCCGCT 
CACCTTCACT CGCGTGTACA GCOGGGAGGC AGACATOGTC ATCCAGTTTG GTGTCGCGGA 
GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 
TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 
GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCOCTT 
CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCOG ACGGCTTGCC 
CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 
GAGACTCTAC ACCCGGGAOG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 
CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 
CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCX3GC TTCTGCCCGA CCCGAGCTGA 
CTCGACGGTG ATGGGGGGCA ACTCGGOGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 
GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGOGC 
TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 
TTTGTTCCTC GTGGCGGCGC ATGAGTTOGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 
GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 
OGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTOGCCCT GAACCTGAGC CACGGCCTCC 
AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 
TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 
AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGAOGA 
TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 
CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCOGGCCGC AGGGCGCCTT 
CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTOGGTCT TTGAGGAGCC 
GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGGGTC 
GGTGCTGGGC COGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 
GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACOGGATGTT 
CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 
CCAGGACOGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 
GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 
GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seg ID NO: 126 Protein sequence: 
Protein Accession #: NPJ)049BS.l 



PCT/US02/12476 



I 

MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNVSB 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 
RPSEGRGSRP 
LDKLGI/3ADV 
THDVFQYREK 



11 

I 

LLVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 
I 

PRQRQSTLVL 
PET GELD SAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLPGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSEIATQ 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTPTRVY 
SLGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPFTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLKRFDVKAQ 
VDQVGYVTYD 



41 

DRQLAEEYLY 
VPDLGRFQTF 
SRDADIVIQF 
RPGNADGAAC 
GNADGKPCQF 
NSAGELCVFP 
HEFGHALGLO 
PTAPPTVCPT 
FDAIAEIGNQ 
FFSGRQVWVY 
MVDPRSASEV 
ILQCPED 



Seq ID NO: 127 DNA sequence 
Nucleic Acid Accession # > NM_004181 
Coding sequence: 32-670 



GCAGAAATAG 
GGTCGCCGGC 
GGTGCCAGCG 
CTTCAGGAAA 
CATGAAGCAG 
TAATCAAGAC 
AGAGAAAATG 
AGCCCATGAT 
TTTTATTCTG 
TCCGGTGAAC 
CAGAGAATTC 
GGCAGCCTAA 
AATATATACC 
TGTTCTGCAG 
ACAGCTGTCC 
TATGTCTTGT 
AAGACCTTGG 



11 
I 

CCTAGGGAGA 
CAGTGGCGCT 
CCTGCCTGOG 
AAGCAGATTG 
ACCATTGGGA 
AAACTGGGAT 
TCCCCTGAAG 
GCCGTGGCAC 
TTTAACAACG 
CATGGCGCCA 
ACOGAGCGTG 
TGCTCTGTGG 
CCCCATGCAG 
ACACGCCTTC 
ACTGGGCCAT 
ATCCGATATC 
ATGTGGTTAT 



21 

I 

TCAACCCCGA 
TCGTGGACGT 
CGCTGCTGCT 
AAGAGCTGAA 
ATTCCTGTGG 
TTGAGGATGG 
ACAGAGCAAA 
AGGAAGGCCA 
TGGATGGCCA 
GTTCAGAGGA 
AGCAAGGAGA 
GAGGGACTTT 
TCTAAAATGC 
CCCTCAGCCA 
TGTGGTGTGA 
TAACGCTTTA 
GTTGTCCTAA 



31 

I 

GATGCTGAAC 
GCTGGGGCTG 
GCTGTTTCCC 
GGGACAAGAA 
CACAATCGGA 
ATCAGTTCTG 
ATGCTTTGAA 
ATGTCGGGTA 
CCTCTATGAA 
CACCCTGCTG 
AGTCCGCTTC 
GCTGATTTCC 
TTCAGTACTT 
CACCCAGGCA 
GCTTCAGATG 
AATGGCTACT 
AGAATAAATT 



41 
I 

AAAGTGCTGT 
GAAGAGGAGT 
CTCACGGCCC 
GTTAGTCCTA 
CTTATTCAOG 
AAACAGTTTC 
AAGAATGAGG 
GATGACAAGG 
CTTGATGGAC 
AAGGACGCTG 
TCTGCCGTGG 
CCTCTTCCCT 
GTGAAACACA 
CTTAAGCACA 
GTGAAGCATT 

TTGCTGATAG 



51 
I 

RYGYTRVAEM 
EGDLiKWHHHN 
GVAEHGDGYP 
HFPFIFEGRS 
PFIFQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSER 
LYLFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



51 
I 

CCCGGCTGGG 
CTCTGGGCTC 
AGCATGAGAA 
AAGTGTACTT 
CAGTGGCCAA 
TTTCTGAAAC 
CCATACAGGC 
TGAATTTCCA 
GAATGCCTTT 
CCAAGGTGTG 
CTCTCTGCAA 
TCAACATGAA 
GCTGTTCTTC 
AGCAGAGTGC 
CTCCCCAGTG 
TCTGTAAGTT 
TAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
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900 
960 



Seq ID NO i 128 Protein sequence: . 
Protein Accession #: NP_004172 



236 
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WO 02/086443 

1 11 21 31 41 51 

I I I I I I 

MLNKVLSRLG VAGQWRFVDV LGLEEESLGS VPAPACALLL LFPLTAQHEN FRKKQIEELK 
GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGPED6 SVLKQFLSET BKMSPEDRAX 
CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPF PVNHGASSED 
TLLKDAAKVC REFTEREQGB VRFSAVALCK AA 



Seq ID NO i 129 DMA sequence 
Nucleic Acid Accession Bs NM_0 00213 
Coding sequence: 127-5385 



PCTVUS02/12476 



60 
120 
180 



CGCCCGCGCG 
CCCCGAGGTA 
AAGAGGATGG 
AGCGTCAGCC 
ACGGAGTGTG 
CGGCGCTGCA 
GTCATGGAGA 
AGCCAGATGT 
GAGCTGGAGG 
TCCAACTCCA 
GTCCTGAGCC 
AGOGTCCCGC 



AAACTGCAGG 
ATCCTGCAGA 
CTGGTCTTCT 
GGCATCATGA 
TACAGGACAC 
ATCATCCCCA 
TATTTCCCTG 
CTGGAGGAGG 
CGAGGCCTTC 
CACATCCGGC 
GATGGGACGC 
TCCTTCTCCG 
CTGCAAAAAG 
TGTGTGTGCA 
GACATTCAGC 
CAGTGCGGGC 
GACAACTTCC 
ATGGGCCAGT 
AATGCCACCT 
GGCOGCTGCC 
TCGGCGATCC 
GGCACCGGCG 
GACGAGCTTA 
GACTGCACCT 
CTGGTGCACA 
CTCCTCCTCC 
TGCAAGGCCT 
GAAGACCACT 
CTGGGCAGCG 
CAGCGGCCTG 
GGGCTGTCCT 
GAGTGCGCCC 
TCCGGTGTAC 
CAAGACCACA 
CTGAAGCTTA 
GGCTACTACA 
GTGGAGCTGG 
CAGCTGCTGG 
GTAAACATCA 
TTCTCGGTCA 
GGCGGGAAGT 
TACATCCCCG 
GTGAAGCTCC 
TTCCACGTCC 
ACCATCATCA 
TCACAGCCAC 
GGGTCCAGGA 
GTAAAGTACT 
CCCTCAGTGG 
TAOGGGGCTC 
GTGCCCAGCG 
AGCTGGGCTG 
CTGGTCAAOG 
AAGAACCGGA 
AAGGOGOGCA 
CAGCCCAAGA 
AGCGGGGAGG 
GGCAGCCAGA 
TTTGCCTTCC 
TATGGCACCC 



AGAAGAAGGG 
AGAGAGCCGA 
ACAGCTACAC 
AGAAGAAGGA 
TGCCGCTCCT 
GCCTGGCACT 
ACATGCTGCG 
GGAACCTCAA 
GCTTTGCCAC 
TGCGCCTGGC 
AGCTGCGCCA 
ACAAGCTCCA 
CCATTGTGGA 
CAGAGAAGCA 
CCCTCACTGC 
TGGACGTACG 
TGGAGGCCAT 
CCATCATCAA 
GCCGCGGGGA 
CCCAGGTCTC 
TGGAGGGTGA 
TGGAGCTGCA 
AGCTCAGCAA 
TCAGGGACCC 
CCCCTCACGG 
AGATCCATTT 
GGATTCAGGG 
AGCTCACCAA 
AGGGCGAGGG 
AGCCAGGGCG 
AGCOGGCTGA 
ATGACAACCG 
TGCTGCTTAT 
ACGGGGCCGG 
GGCCCATGTC 
ACTACGACAG 
GGCCCAGCGT 
GGGGCAGCAC 
ACCTGAGCCC 



21 
I 

ATCTCCTAGC 
GGGCGCACAG 
CCCCAGCCCA 
CTTGGCAAAC 
TAAGGACTGC 
GGAGCTGCTG 
AATCACAGAG 
CCTGCGGGTC 
ACTGGAGAGC 
TCTGGACAAC 
CGACTACACT 
GAGGCCTGAG 
CGTCATCAGC 
CTCAGGCAAC 
CACGAGGGAC 
AGCCTTCCAC 
TGAACGGTGC 
GTCGGTGCCC 
CACCAACTAC 
GGGGGTGCTG 
GATCCGCTCC 
CACCTCCAAG 
GGGTATATAC 
GCTGCCGGAG 
GATGGACGCG 
AGCTCGCTGC 
GAGTGGCCAG 
GGAGGGCGAG 
CTACGGCGAA 
CACTTCCGGG 
GCCTGGTTGG 
CAATGGGGGC 
GCAGTOGCTC 
CTGCGAGGAC 
GCGCACGTGT 
GGAGGTGGTG 
CATGGAAGGT 
CTGCCCTCCG 
GGCCCTGCTA 
TCTCCCGTGC 
GGAGAACCTG 
GGGCCGTGAC 
TCATGCCGCC 
COGCCTTTGC 
GGAGGTGGAG 
GCAGACCAAG 
CACAGTGCTG 
GGTGGAACAG 
AGACCAGGAC 
GGTGCCCCTC 
CGACGTGCCC 
GGAGCAAGCC 
CCAGGTGGCC 
CTACCGCACA 
GCTGCTGTTC 
AGAAGTTGAC 
CCCTAAGTTT 
AGATGAACTG 
CGACCTGGGC 
CAACTGGCTG 
TGACTCCGAA 
CCTGTACCCG 
ACCCTACAGC 
TCTGGCCTTC 
GACCAACGGT 
ACCTATTGGG 
TGAGAACCTT 
CTGGGGGCCT 
CATCCCCATC 
CTTCCTTATG 
CTCCGATGAC 
CAACTCCCTG 
ACAOGTGCCC 



31 
I 

GGCAGOCCAG 
CAGCAGCCGA 
TGGGCCAGGC 
CGCTGCAAGA 
GCCTACTGCA 
GCOGCGGGCT 
GAGACCCAGA 
CGTCTGCGGC 
CCOGTGGACC 
CTCAAGAAGA 
ATTGGATTTG 
AAGCTGAAGG 
CTGACAGAAG 
CTGGATGCTC 
ATTGGCTGGC 
TATGAGGCTG 
CACCTGGACA 
ACCCTGGTGC 
TCCTATAGCT 
CAGGAGGACT 
AACCTGGACA 
ATGTTCCAGA 
CAGGTGCAGC 
GACCAGAAGG 
GGCATCATCT 
AGCTTCAACG 
ACCTGCAACT 
GACAAGCCGT 
GGCCGCTACG 
TTCCTCTGCA 
ACAGGCCCAA 
ATCTGTAATG 
TACACGGACA 
CTACGCTCCT 
GAGGAATGCA 
GTGCGCTGCT 
GACG GCGC CC 

CTGCTGCTAT 
TGCAACCGAG 
ATGGOCTCTG 
GTGGTCCGCT 
AGCATCAACC 
ACCGAGAACC 
GAGAACCTGA 
TTCCGGCAGC 
ATGGOGCCCC 
AGGGCCTTCC 
GCCCGGGGCA 
TTTATCCGGC 
GCAGGCACTG 
AGAGACGTGG 
CGCATCCCTG 
CAGGATGGCA 
CAGCCTGGGG 
TCCCTCCTGC 
GGGGCCCACC 
GACCGGAGCT 
GCCCCGCAGA 
CCCCCTTCTG 
TCCGAAGCCC 
TATTGCGACT 
TCCCTGGTGT 
AATGTCGTCT 
GAGATCACAG 
CCCATGAAGA 
OGGGAGTCCC 
GAGCGGGAGG 
ATCCCTGACA 
TACAGCGATG 
ACTGAGCACC 
CACAGGATGA 
CACCGCGTGC 



41 
I 

GCGCGGAGGG 
GGCTGGCCGG 
TGCTCCTGGC 
AGGCCCCAGT 
CAGACGAGAT 
GCCAGOGGGA 
TTGACACCAC 
CCGGTGAGGA 
TGTACATCCT 
TGGGGCAGAA 
GCAAGTTTGT 
AGCCCTGGCC 
ATGTGGATGA 
CTGAGGGCGG 
GCCCGGACAG 
ATGGCGCCAA 
CCACGGGCAC 
GCCTGCTOGC 
ACTACGAGAA 
CGTCCAACAT 
TCOGGGCCCT 
AGACCAGGAC 
TGCGGGCCCT 
GCAACATCCA 
GTGATGTGTG 
GAGACTTCGT 
GCTCCACCGG 
GCTCCGGCCG 
AGGGTCAGTT 
ATGACCGAGG 
GCTGTGACTG 
GACGTGGCCA 
CCATCTGCGA 
GCGTGCAGTG 
ACTTCAAGGT 
CCTTCCGGGA 
CTGGGCCCAA 
GGTGGCTCAT 
GCTGGAAGTA 
GTCACATGGT 
ACCACTTGGA 
OGAAGGTCAC 
CCACAGAGCT 
TGCTGAAGCC 
ACGAGGTCTA 
AGCCCAATGC 
GCTCGGCCAA 
AOGACCTCAA 
TGGTGGAGTT 
CTGAGGATGA 
CCAC CCTCG G 
TGTCCTTTGA 
TCATCCGGCG 
CCGCGCAGGG 
AGGCCTGGAA 
GGGGCCGCCA 
TGGGCCAGCC 
TCACGAGTCA 
ACCCCAATGC 
GCAAGCCAAT 
ACCTGCTCGA 
ATGAGATGAA 
CCTGCCGCAC 
CCTCCACGGT 
CCTACGAGGT 
AAGTGCTGGT 
AGCCCTACCG 
CCATCATCAA 
TCCCTATCGT 
ACGTTCTACG 
TGGTGAATGG 
CCACGACCAG 
TAAGCACATC 



51 
I 

AGCGAGTCCG 
GAGAGGGAGG 
AGCCTTGATC 
GAAGAGCTGC 
GTTCAGGGAC 
GAGCATCGTG 
CCTGCGGCGC 
GCGGCATTTT 
CATGGACTTC 
CCTGGCTCGG 
GGACAAAGTC 
CAACAGTGAC 
GTTCCGGAAT 
CTTCGATGCC 
CACCCACCTG 
CGTGCTGGCT 
CTACACCCAG 
CAAGCACAAC 
GCTTCACACC 
CGTGGAGCTG 
AGACAGCCCC 
TGGGTCCTTT 
TGAGCACGTG 
TCTGAAACCT 
CACCTGCGAG 
GTGCGGACAG 
CTCTCTGAGT 
TGGGGAGTGC 
CTGCGAGTAT 
AOGCTGCTCC 
TCCCCTCAGC 
CTGTGAGTGT 
GATCAACTAC 
CCAGGCGTGG 
CAAGATGGTG 
CGAGGATGAC 
CAGCACTGTC 
CCCCCTGCTC 
CTGTGCCTGC 
GGGCTTTAAG 
CACGCCCATG 
CAACAACATG 
QGTGCCCTAC 
TGACACTCGG 
CAGGCAGATC 
CGGGAAAAAG 
GCCGGCCCTG 
GGTGGCCCCC 
CCAGGAGGGC 
OGACGAGAAG 
CCGCCGCCTG 
GCAGCCTGAG 
TGTCCTGGAC 
CAACCGGGAC 
AGAGCTGCAG 
GGTCCGCCGT 
CCACTCCACC 
GATGTTGTCA 
TAAGGCCGCT 
GGGGTACAGG 
CAGCAAGGTG 
GGTGTGCGCC 
CCACCAGGAA 
GACCCAGCTG 
CTGCTATGGC 
TGACAACCCT 
CTACACGGTG 
CCTGGCCACC 
GGACGCCCAG 
CTCTCCATCG 
CCGGATGGAC 
TGCTGCTGCC 
CTCCACCCTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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1380 
1440 
1500 
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1920 
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2040 
2100 
2160 
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2400 
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2640 
2700 
2760 
2820 
2880 
2940 
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3060 
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3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
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WO 02/086443 

ACACGGGACT ACAACTCACT GACCCQCTCA 
GACTACTCCA CCCTCACCTC CGTCTCCTCC 
GACACGCCCA CCCGCCTGGT GTTCTCTGCC 
CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG 
GGCGGTQAQC TGCATCGGCT CAACATCCCC 
GACCTCCTGC CCAACCACTC CTACGTGTTC 
GGCCGAGAGC GTGAGGGTGT CATCACCATT 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG 
TTCACTGCCC TGAGCCCAGA CTOGCTGCAG 
GGGGATATCG TOGGCTACCT GGTGACCTGT 
GCATTCCGGG TGGATGGAGA CAGCCCCGAG 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC 
GAGGGCATCA TCACCATAGA GTCCCAGGAT 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC 
AGCGCCACCG AGCCCTTCCT AGTGGATGGG 
GGCGGCTCCC TCACCCGGCA TGTGACCCAG 
GGAACCCTTA GCACCCACAT GGACCAACAG 
CCCCCGCCAT GTCCCACTAG GCGTCCTCCC 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA 
TCCTGGGAGG CATGAAGGGG GCAAGGTCCG 
AAAGAGCTGG GAGCAGCACA AGGACCCAGC 
ACTG 



Seq ID NOt 130 Protein sequence j 
Protein Accession *h NP_OO0204 



GAACACTCAC 
CACGACTCTC 
CTGGGGCCCA 
GGCTACAGTG 
AACCCTGCCC 
CGCGTGCGGG 
GAATCCCAGG 
AGCACTCCCA 
CTGAGCTGGG 
GAGATGGCCC 
AGCCGGCTGA 
AGGACCACTG 
GGAGGACCCT 
GAGTACAGCA 
CCGACCCTGG 
GAGTTTGTGA 
TTCTTCCAAA 
GACTCCTCTC 
CCCGCATGCA 
TCCTCTGTGG 
CTTTGTTCTG 



ACTCGACCAC 
GCCTGACTGC 
CATCTCTCAG 
TGGAGTACCA 
AGACCTCGGT 
CCCAGAGCCA 
TGCACCCGCA 
GTGCCCCAGG 
AGCGGCCACG 
AAGGAGGAGG 
CCGTGCCGGG 
AGGGCTTCGG 
TCCCGCAGCT 
GCATCACCAC 
GGGCCCAGCA 
GCCGGACACT 
CTTGACCGCA 
CCGGAGCCTC 
CAGAGCAGGG 
GCCCAAACCT 
CACTTAATAA 



MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTEDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 
CSEGWSGQTC 
PQCPRTSGPL 
CHCHQQSLYT 
LKRAEEVWR 
LLPLLALLLL 
SGNLXGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RKXHFNWliPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
FPGSTNSLHR 
STLTSVSSHD 
ELHRLNIPNP 
: LPGSAFTLST 
RVDGDSPESR 
LFQHPLQSEY 
LSTHMDQQPF 



11 
I 

RLLLAALISV 
GCQRESIWM 
DLYILMDFSN 
KEPWPNSDPP 
WRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIHLKPSF 
NCSTGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
PHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
MTTTSAAAYG 
SRLTAGVPDT 
AQTSVWEDI* 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTHTSA 
QT 



21 
I 

SLSGTLANRC 
ESSFQITEET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKKF 
SDGIiKMDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYXMEGDG 
ACLALLPCCN 
PGFATHAASI 
VHKLQQTKPR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLPQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRLAFNV 
RMLLIENLRB 
EDYDSFLMYS 
THLSPHVPHR 
PTRLVFSALG 
LPNHSYVFRV 
AI*SPDSLQLS 
PYICPKVQART 
TBPFLVDGPT 



31 
I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
EDVDEFRNKL 
ADGANVLAGI 
SYYEKIaHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEPQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
VLSTSSTLTR 
PTSLRVSWQE 
RAQSQEGWGR 
WERPRRPNGD 
TEGFGPEREG 
LGAQHLEAGG 



41 

I • 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 
MSRNDERCRL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 



ACTGCCGAGG 

TGGTGTGCCC 

AGTGAGCTGG 

GCTGCTGAAC 

GGTGGTGGAA 

GGAAGGCTGG . 

GAGCCCACTG 

CCCGCTGGTG 

GAGGCCCAAT 

GCCAGCCACC 

CCTCAGCGAG 

GCCAGAGCGC 

GGGCAGCCGT 

CACCCACACC 

CCTGGAGGCA 

GACCACCAGC 

CCCTGCCCCA 

CTCAGCTACT 

GCTAGGTGTC 

ATTTGTAACC 

ATGGTTTTGC 



51 
I 

CTDBMFRDRR 



FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 



HYMLRENLMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTB 
DYNSLTRSEH 
PRCERPLQGY 
EREGVITIES 
IVGYLVTCEM 
IITIESQDGG 
SLTRHVTQEP 



CNPKVKMVDE 
FVWLIPLLLL 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
HLVNGRMDFA 
SHSTTLPRDY 
SVEYQLLNGG 
QVHPQSPLCP 
AQGGGPATAF 
PFPQLGSRAG 
VSRTLTTSGT 



Seq ID NOt 131 DNA sequence 
Nucleic Acid Accession #: BC004372 
Coding sequence: 132.. 2231 



CCTCGTGCCG 
GCCGGCCCCT 
CTCCGGACAC 
TGAGCCTGGC 
AGAAAAATGG 
ATAGCACCTT 
GCAGGTATGG 
GTGCAGCAAA 
ATTGCTTCAA 
ATGCCTTTGA 
AGAAAGGAGA 
ACGTGAGCAG 
CCTTTTCTAC 
ACAGAATCCC 
AAGAAAATGA 
ATGAAGATTT 
AGAACCAGGA 
CAACCACAAG 
ACCCAGAAGC 
ATTCTACAAG 
AGGAACAGTG 



11 

I . 

CGGACCCCAG 
GCCCCGCGCC 
CATGGACAAG 
GCAGATCGAT 
TCGCTACAGC 
GCCCACAATG 
GTTCATAGAA 
CAACACAGGG 
TGCTTCAGCT 
TGGACCAATT 
ATACAGAACG 
CGGCTCCTCC 
TGTACACCCC 
TGCTACCAGT 
AGATGAAAGA 
TATCTCCAGC 
CTGGACCCAG 
GATGACTGAT 
ACACCCTCCC 
CACAATCCAG 
GTTTGGCAAC 



21 

I 

CCTCTGCCAG 
CAGGGATCCT 
TTTTGGTGGC 
TTGAATATAA 
ATCTCTCGGA 
GCCCAGATGG 
GGGCATGTGG 
GTGTACATCC 
CCACCTGAAG 
ACCATAACTA 
AATCCTGAAG 
AGTGAAAGGA 
ATCCCAGACG 
ACGTCTTCAA 
GACAGACACC 
ACCATTTCAA 
TGGAACCCAA 
GTAGACAGAA 
CTCATTCACC 
GCAACTCCTA 
AGATGGCATG 



31 . 

I 

GTTCGGTCCG 
CCAGCTCCTT 
ACGCAGCCTG 
CCTGCCGCTT 
CGGAGGCCGC 
AGAAAGCTCT 
TGATTCCCCQ 
TCACATCCAA 
AAGATTGTAC 
TTGTTAACCG 
ACATCTACCC 
GCAGCACTTC 
AAGACAGTCC 
ATACCATCTC 
TCAGTTTTTC 
CCACACCACG 
GCCATTCAAA 
ATGGCACCAC 
ATGAGCATCA 
GTAGTACAAC 
AGGGATATCG. 



41 

I 

CCATCCTCGT 
TCGCCCGCGC 
GGGACTCTGC 
TGCAGGTGTA 
TGACCTCTGC 
GAGCATCGGA 
GATCCACCCC 
CACCTCCCAG 
ATCAGTCACA 
TGATGGCACC 
CAGCAACCCT 
AGGAGGTTAC 
CTGGATCACC 
AGCAGGCTGG 
TGGATCAGGC 
GGCTTTTGAC 
TCCGGAAGTG 
TGCTTATGAA 
TGAGGAAGAA 
GGAAGAAACA 
CCAAACACCC 



51 

I 

CCCGTCCTCC 
CCTCCGTTCG 
CTCGTGCCGC 
TTCCACGTGG 
AAGGCTTTCA 
TTTGAGACCT 
AACTCCATCT 
TATGACACAT 
GACCTGCCCA 
CGCTATGTCC 
ACTGATGATG 
ATCTTTTACA 
GACAGCACAG 
GAGCCAAATG 
ATTGATGATG 
CACACAAAAC 
CTACTTCAGA 
GGAAACTGGA 
GAGACCCCAC 
GCTACCCAGA 
AGAGAAGACT 



4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 



PCTYUS02/12476 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
.1740 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
.1020 
1080 
1140 
1200 
1260 



238 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCCATTCGAC AACAGGGACA GCTGCAGCCT CAGCTCATAC 
GGACAACACC AAGCCCAGAG GACAGTTCCT GGACTGATTT 
CCATGGGACG AGGTCATCAA GCAGGAAGAA GGATGGATAT 
CGCTTCAGCC TACTGCAAAT CCAAACACAG GTTTGGTGGA 
CTCTTTCAAT GACAACGCAG CAGAGTAATT CTCAGAGCTT 
TGGAAGAAGA TAAAGACCAT CCAACAACTT CTACTCTGAC 
TCACAGGTGG AAGAAGAGAC CCAAATCATT CTGAAGGCTC 
ATACCTCTCA TTACCCACAC ACGAAGGAAA GCAGGACCTT 
AGACTGGGTC CTTTGGAGTT ACTGCAGTTA CTGTTGGAGA 
GTTCCTTATC AGGAGACCAA GACACATTCC ACCCCAGTGG 
GATCTGAATC AGATGGACAC TCACATGGGA GTCAAGAAGG 
GTCCTATAAG GACACCCCAA ATTCCAGAAT GGCTGATCAT 
TGGCTTTGAT TCTTGCAGTT TGCATTGCAG TCAACAGTCG 
AAAAGCTAGT GATCAACAGT GGCAATGGAG CTGTGGAGGA 
ACGGAGAGGC CAGCAAGTCT CAGGAAATGG TGCATTTGGT 
CTCCAGACCA GTTTATGACA GCTGATGAGA CAAGGAACCT 
TTGGGGTGTA ACACCTACAC CATTATCTTG GAAAGAAACA 
TACAGGGAGC TGGGACACTT AACAGATGCA ATGTGCTACT 
TTTTTAGCAT AAAATTTTCT ACTCTTAAAA AAAAAAAAAA 



Seq ID NO: 132 Protein sequence? 
Protein Accession #: AAH04372 



CAGCCATCCA 
CTTCAACCCA 
GGACTCCAGT 
AGATTTGGAC 
CTCTACATCA 
ATCAAGCAAT 
AACTACTTTA 
CATCCCAGTG 
TTCCAACTCT 
GGGGTCCCAT 
TGGAGCAAAC 
CTTGGCATCC 
AAGAAGGTGT 
CAGAAAGCCA 
GAACAAGGAG 
GCAGAATGTG 
ACCGTTGGAA 
GATTGTTTCA 
AAAAAAA 



ATGCAAGGAA 
ATCTCACACC 
CATAGTACAA 
AGGACAGGAC 
CATGAAGGCT 
AGGAATGATG 
CTGGAAGGTT 
ACCTCAGCTA 
AATGTCAATC 
ACCACTCATG 
ACAACCTCTG 
CTCTTGGCCT 
GGGCAGAAGA 
AGTGGACTCA 
TCGTCAGAAA 
GACATGAAGA 
ACATAACCAT 
TTGCGAATCT 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1-740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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1 


11 
I 


21 


31 


41 


51 




1 

MDKFWWHAAW 


GLCL.VPLSLA 


1 

QIDLNITCRF 


1 

AGVFHVEKNG 


1 

RYSZSRTEAA 


1 

DLCKAFNSTL 


60 


PTMAQMEKAL 


SIGFETCRYG 


FIEGHWIPR 


IHPNSICAAN 


NTGVYILTSN 


TSQYDTYCFN 


120 


ASAPPEEDCT 


SVTDLPNAFD 


GPITITIVNR 


DGTRYVQKGE 


YRTNPEDIYP 


SNPTDDDVSS 


180 


GSSSERSSTS 


GGYIFYTFST 


VHPIPDEDSP 


WITDSTDRIP 


ATSTSSNTIS 


AGWEPNEENE 


240 


DERERHLSFS 


GSGIDDDEDF 


ISSTXSTTPR 


AFDHTKQNQD 


WTQWNPSHSN 


PEVLLOTTTR 


300 


MTDVDRNGTT 


AYEGNWNPEA 


HPPLIHHEHH 


EEEETPHSTS 


TIQATPSSTT 


EETATQKEQW 


360 


FGNRWHEGYR 


QTPREDSHST 


TGTAAASAHT 


SHPMQGRTTP 


SPEDSSWTDF 


FNPISHPMGR 


420 


GHQAGRRMDM 


DSSHSTTLQP 


TANPNTGLVE 


DLDRTGPLSM 


TTQQSNSQSF 


STSHEGLEED 


480 


KDHPTTSTLT 


SSNRNDVTGG 


RRDPNHSEGS 


TTLLEGYTSH 


YPHTKESRTF 


IPVTSAKTGS 


540 


FGVTAVTVGD 


SNSNVNRSLS 


GDQDTFHPSG 


GSHTTHGSES 


DGHSHGSQBG 


GANTTSGPIR 


600 


TPQIPBWIall 


LASLLALALI 


LAVCIAVNSR 


RRCGQKKKLV 


INSGNGAVED 


RKPSGLNGEA 


660 


SKSQEMVHLV 


NKESSETPDQ 


FMTADETRNIi 


QNVDMKIGV 









Seq id NO i 133 DNA sequence 
Nucleic Acid Accession NM_002882 
Coding sequence: 150-755 



CGAGGTTCGG 
GCGGAGGGAA 
AGCCGAGCCG 
ATGATACTTC 

TGCGGGCAAA 
GCACTGGTGA 
GGAGGGACAA 
AGCCCAACGC 
AGTGCCCCAA 
TCAAAACAAA 
CAGGCAAAAA 
AGGAGACCAA 
TCTCTTTCCT 
ATTCTTTCAT 



11 ■ 

I 

GTOGTGGGGC 
GGAGCTACGA 
CCGCCGCCGC 
CACTGAGAAT 
TGAGCAAGAA 
ACTGTTCCGA 

CGTCAAGCTC 
GACCCTGAAG 
AGGTAGCGAC 
GCCAGAGCTG 
GTTTGAAGAA 
TGATCATGCC 
GGAGGATGCT 

TTTTACAAGG 



21 
! 

GGAGGGAAGA 
GTAGCCGCCG 
CGCGCCCCCA 
ACAGACGAGT 
ATTAAAACAC 
TTTGCCTCTG 
CTGAAGCACA 
ATCTGTGCCA 
CGTGCCTGGG 
CTGGCCATCC 
TGCAGGAAAG 
GAAAAAGTGG 
GAGGAGAAGC 
TAAAAAATTT 
GAOGTTATAT 



31 

I 

GCGGGCGGGC 
AGAGGCCGCG 
TGGCGGCCGC 
CCAACCATGA 
TGGAAGAAGA 
AGAACGATCT 
AGGAGAAAGG 
ACCACTACAT 
TCTGGAACAC 
GCTTCCTGAA 
AGATCGAAGA 
CGGAAAAGCT 
AATAAATCGT 
TACCCTGCCC 
AAAGAACTGA 



41 
I 

GGGAGGCGCC 
GAGCCAGCGA 
CAAGGACACT 
CCCTCAGTTT 
TGAAGAGGAA 
CCCAGAATGG 
GGCCATCCGC 
CACGCCGATG 
CCACGCTGAC 
TGCTGAGAAT 
GAGAGAAAAG 
AGAAGCTCTC 
CTTATTTTAT 
CTCTTTTTCG 
ACTC 



51 

I 

GGCGCCAGAC 
CGACCGACCC 
CATGAGGACC 
GAGCCAATAG 
CTTTTTAAAA 
AAGGAGCGAG 
CTCCTCATGC 
ATGGAGCTGA 
TTCGCCGACG 
GCACAGAAAT 
AAAGCAGGAT 
TCGGTGAAGG 
TTTCTTTTCC 
GTTTGTTTTT 



Seq ID NO: 134 Protein sequence: 
Protein Accession #: NP_002873 

11 



51 



1 11 21 31 41 

I I I I I I 

MAAAKDTHED HDTSTENTDE SNHDPQFEPI VSLPEQEIKT LEEDEEELFK MRAKLFRFAS 
ENDLPEWKER GTGDVKLLKH KEKGAIRLLM RRDKTLKICA NHYITPMMEL KPNAGSDRAW 
VWNTHADFAD ECPKPELLAI RFLNAENAQK FKTKFEECRK EIEEREKKAG SGKNDHAEKV 
AEKLEALSVK EETKEDAEEK Q 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_000077.2 

Coding sequence: 277-742 



CCCAACCTGG 
TCCTCCGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCTTCGGCT 
GCTGCTGGAG 
GGTCATGATG 



11 
I 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCGG 
GACTGGCTGG 
GCGGGGGCGC 
ATGGGCAGCG 



21 
I 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
CCACGGCCGC 
TGCCCAACGC 
CCCGAGTGGC 



31 
I 

ATTCGCTAAG 
TGCCTGGAAA 
TTCCGCCAGC 
ACCGCGTGCG 
CATGGAGCCG 
GGCCCGGGGT 
ACCGAATAGT 
GGAGCTGCTG 



41 
I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 
GCGGCGGGGA 
CGGGTAGAGG 
TACGGTCGGA 
CTGCTCCACG 



51 
I 

TAATAGCACC 
TCCCTCCAGA 
GAAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
AGGTGCGGGC 
GGCCGATCCA 
GOGCGGAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 



239 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC 
CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGO CTGGACGTGC 
GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG 
GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG 
TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC 
GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA 
TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCCTGCCTTT 
ATATGCCTTC CCCCACTACC GTAAATGTCC A7TTATATCA TTTTTTATAT 
AAATGTAAAA AAGAAAAACA COGCTTCTGC CTTTTCACTG TGTTGGAGTT 
■AGCACTCACG CCCTAAGOGC ACATTCATGT GGGCATTTCT TGCGAGCCTC 
GAAGCTGTCG ACTTCATGAC AAGCATTTTG TGAACTAGGG AAGCTCAGGG 
CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA 
ATTTTCATTC ATTCACTC 

Seq ID NO: 136 Protein sequence: 
Protein Accession #: NP_00006B.l 

1 11 



GGGAGGGCTT 
GCGATGCCTG 
CACGGTACCT 
CCGCGGAAGG 
GGGAAACTTA 
CCCACCCCGC 
TAACGTAGAT 
ATTCTTATAA 
TTCTGGAGTG 
GCAGCCTCOG 
GGGTTACTGG 
AAATAAAATA 



21 31 41 SI 

I. I I . I I I 

MEPAAGSSME PSADWLATAA ARGRVBBVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 
ELLLLHGAEP NCADPATLTR FVHDAAREGP LDTLWLHRA GARLDVRDAW GRLPVDLAEE 
LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 



Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: NM_058196.1 

Coding sequence: 104-421 



TGTGTGGGGG 
GCCCCCACCC 
CCGAGTGGCG 
TCTCACCCGA 
GCACCGGGCC 
GG CTGAGGAG 
CAGAGGCAGT 
AAAGAACCAG 
CTACAGGGCC 
AATAGAGCTT 
TAAATGTCCA 
CGCTTCTGCC 
CATTCATGTG 
AGCATTTTGT 
AGCAAATGGC 



11 
I 

TCTGCTTGGC 
TGGCTCTGAC 
GAGCTGCTGC 
CCCGTGCACG 
GGGGCGCGGC 
CTGGGCCATC 
AACCATGCCC 
AGAGGCTCTG 
ACAACTGCCC 
TTAAAAATGT 
TTTATATCAT 
TTTTCACTGT 
GGCATTTCTT 
GAACTAGGGA 
AGAACCAAAG 



21 

I . 

GGTGAGGGGG 
CATTCTGTTC 
TGCTCCACGG 
ACGCTGCCCG 
TGGACGTG CG 
GCGATGTCGC 
GCATAGATGC 
AGAAACCTCG 
CGGCCACAAC 
CCTGCCTTTT 
TTTTTATATA 
GTTGGAGTTT 
GCGAGCCTCG 
AGCTCAGGGG 
CTCAAATAAA 



31 
I 

CTCTACACAA 
TCTCTGGCAG 
CGCGGAGCCC 
GGAGGGCTTC 
CGATGCCTGG 
ACGGTACCTG 
CGCGGAAGGT 
GGAAACTTAG 
CCACCCCGCT 
AACGTAGATA 
TTCTTATAAA 
TCTGGAGTGA 
CAGCCTCCGG 
GGTTACTGGC 
AATAAAATAA 



41 

I 

GCTTCCTTTC 
GTCATGATGA 
AACTGCGCCG 
CTGGACACGC 
GGCCGTCTGC 
CGCGCGGCTG 
CCCTCAGACA 
ATCATCAGTC 
TTCGTAGTTT 
TAAGCCTTCC 
AATGTAAAAA 
GCACTCACGC 
AAGCTGTCGA 
TTCTCTTGAG 
TTTTCATTCA 



Seq ID NO: 138 Protein sequence: 
Protein Accession #: NP 478103.1 



CCCAACCTGG 
TCCTCOGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCGGCGGCG 
GGGTCGGGTA 
TAGTTACGGT 
CGGGCGACTC 
CCGGAAAAAG 
TCCTGGCGAC 
ACAGATCTCT 
TCATGATGAT 
ACTGCGCCGA 
TGGACACGCT 
GCCGTCTGCC 
GCGCGGCTGC 
CCTCAGACAT 
CATCAGTCAC 
CGTAGTTTTC 
TGCCTTCCCC 
TGTAAAAAAG 
ACTCACGCCC 
GCTGTCGACT 
CTCTTGAGTC 



11 

I 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCGG 
GGGAGCAGCA 
GAGGAGGTGC 
CGGAGGCCGA 
TGGAGGACGA 
GGGAGGCTTC 
GCCCTGGGGG 
CGAATGCTGA 
GGGCAGCGCC 
CCCCGCCACT 
GGTGGTGCTG 
CGTGGACCTG 
GGGGGGCACC 
CCCCGATTGA 
CGAAGGTCCT 
ATTTAGAAAA 
CACTACCGTA 
AAAAACACCG 
TAAGCGCACA 
TCATGACAAG 
ACACTGCTAG 



21 
1 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
TGGAGCCTTC 
GGGCGCTGCT 
TCCAGGTGGG 
AGTTTGCAGG 
CTGGGGAGTT 
CTTGGGAAAC 
GAAGATCTGA 
CGAGTGGCGG 
CTCACCCGAC 
CACCGGGCCG 
GCTGAGGAGC 
AGAGGCAGTA 
AAGAACCAGA 
ACAGGGCCAC 
TAGAGCTTTT 
AATGTCCATT 
CTTCTGCCTT 
TTCATGTGGG 
CATTTTGTGA 
CAAATGGCAG 



31 
I 

ATTCGCTAAG 
TGCCTGGAAA 
TTCOGOCAGC 
ACCGCGTGCG 
CATGGAGCOG 
GGCTGACTGG 
GGAGGCGGGG 
TAGAAGGTCT 
GGAATTGGAA 
TTCAGAAGGG 
CAAGGAAGAG 
AGGGGGGAAC 
AGCTGCTGCT 
CCGTGCACGA 
GGGCGCGGCT 
TGGGCCATCG 
ACCATGCCOG 
GAGGCTCTGA 
AACTGCCCCC 
AAAAATGTCC 
TATATCATTT 
TTCACTGXGT 
CATTTCTTGC 
ACTAGGGAAG 
AACCAAAGCT 



41 
I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 
GCGGCGGGGA 
CTGGCCACGG 
GCGCTGCCCA 
GCAGCGGGAG 
TCAGGTAGCG 
GTTTGTAATC 
GAATGAGGAG 
ATATTTGTAT 
GCTCCACGGC 
CGCTGCCCGG 
GGACGTGCGC 
CGATGTCGCA 
CATAGATGCC 
GAAACCTCGG 
GCCACAACCC 
TGCCTTTTAA 
TTTATATATT 
TGGAGTTTTC 
GAGCCTCGCA 
CTCAGGGGGG 
CAAATAAAAA 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



PCT7US02/12476 



si 
I 

CGTCATGCCG 
TGGGCAGCGC 
ACCCCGCCAC 
TGGTGGTGCT 
CCGTGGACCT 
CGGGGGGCAC 
TCCCCGATTG 
ACCGAAGGTC 
TCATTTAGAA 
CCCACTACCG 
AGAAAAACAC 
CCTAAGCGCA 
CTTCATGACA 
TCACACTGCT 
TTCACTC 



1 11 21 31 41 51 

I I I I I I 

MMMGSARVAE LLLLHGAEPN CADPATLTRP VHDAAREGPL DTLWLHRAG ARLDVHDAWG 
RLPVDLAEEL GHRDVARYLR AAAGGTRGSN HARIDAABGP SDIPD 

Seq ID NO.- 139 DNA sequence 

Nucleic Acid Accession #: NM_058197.l 

Coding sequence: 272-684 



51 
1 

TAATAGCACC 
TCCCTCCAGA 
GAAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
CCGCGGCCCG 
ACGCACCGAA 
CAGGGGATGG 
CTTCGATTCT 
ACAGACCTCC 
CCACGCGCGT 
TAGATGGAAG 
GCGGAGCCCA 
GAGGGCTTCC 
GATGCCTGGG 
CGGTACCTGC 
GCGGAAGGTC 
GAACTTAGAT 
ACCCCGCTTT 
CGTAGATATA 
CTTATAAAAA 
TGGAGTGAGC 
GCCTCCGGAA 
TTACTGGCTT 
TAAAATAATT 



60 
120 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



240 



WO 02/086443 

TTCATTCATT CACTC 

Seq ID NO: 140 Protein sequence: 
Protein Accession NP_47B104.1 

1 11 21 31 41 51 

i I I I I I 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GKVEEVRALL EAGALFNAPN SYGRRPIOVG 60 
RRSAAGAGDG GRLWRTKPAG ELESGSASIL RKKGRIiPGEF SEGVCNHRPP PGDALGAWET 120 
KEEE 

Seg ID NO: 141 DNA sequence 

Nucleic Acid Accession #t NM_058195.1 

Coding sequence: 163-684 

1 11 21 31 41 51 

1 I I 1 I I 

CCTCCCTACG GGOGCCTCCG GCAGCCCTTC CCGCGTGCGC AGGGCTCAGA GCCGTTCCGA 60 

GATCTTGGAG GTCCGGGTGG GAGTGGGGGT GGGGTGGGGG TGGGGGTGAA GGTGGGGGGC 120 

GGGCGCGCTC AGGGAAGGCG GGTGCGCGCC TGCGGGGCGG AGATGGGCAG GGGGCGGTGC 180 

GTGGGTCCCA GTCTGCAGTT AAGGGGGCAG GAGTGGCGCT GCTCACCTCT GGTGCCAAAG 240 

GGCGGCGCAG CGGCTGCCGA GCTCGGCCCT GGAGGCGGCG AGAACATGGT GCGCAGGTTC 300 

TTGGTGACCC TCCGGATTCG GCGCGCGTGC GGCCCGCCGC GAGTGAGGGT TTTCGTGGTT 360 

CACATCCCGC GGCTCACGGG GGAGTGGGCA GCGCCAGGGG CGCCCGCCGC TGTGGCCCTC 420 

GTGCTGATGC TACTGAGGAG CCAGCGTCTA GGGCAGCAGC CGCTTCCTAG AAGACCAGGT 480 

CATGATGATG GGCAGCGCCC GAGTGGCGGA GCTGCTGCTG CTCCAOGGCG CGGAGCCCAA 540 

CTGCGCCGAC CCCGCCACTC TCACCCGACC CGTGCACGAC GCTGCCCGGG AGGGCTTCCT 600 

GGACAOGCTG GTGGTGCTGC ACCGGGCCGG GGCGCGGCTG GACGTGCGCG ATGCCTGGGG 660 

CCGTCTGCCC GTGGACCTGG CTGAGGAGCT GGGCCATCGC GATGTCGCAC GGTACCTGCG 720 

CGCGGCTGCG GGGGGCACCA GAGGCAGTAA CCATGCCCGC ATAGATGCCG CGGAAGGTCC 780 

CTCAGACATC CCCGATTGAA AGAACCAGAG AGGCTCTGAG AAACCTCGGG AAACTTAGAT 840 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCOGCTTT 900 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 960 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1020 

TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1080 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1140 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1200 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1260 
TTCATTCATT CACTC 

Seq ID NO: 142 Protein sequence: 
Protein Accession #: NP_47B102.1 



1 11 21 31 41 51 

I I I I I I 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 60 
VRVFWHIPR LTGEWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 120 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARO.GPSAR GPG 

Seq ID NO: 143 DNA sequence 
Nucleic Acid Accession *h NM_018131 
Coding sequence t 412.. 1107. 



11 21 31 41 51 

I I I I I 

ACTTAAAGAC ATCAGTGGAT GAAATCACAA GTGGGAAAGG AAAGCTGACT 60 

GACAGAGACT TTTGGAGAAA ATTCGAGTCC TTGAGGCTGA GAAGGAGAAG 120 

AACTCACAGA GAAGGACAAA GAAATACAGC GACTGAGAGA CCAACTGAAG 180 

GTACTACCGC ATTGCTTGAA CAGCTGGAAG AGACAACGAG AGAAGGAGAA 240 

AGGTGTTGAA AGCCTTATCT GAAGAGAAAG ACGTATTGAA ACAACAGTTG 300 

CCTCACGAAT TGCTGAACTT GAAAGCAAAA CCAATACACT CCGTTTATCA 360 

CTCCAAACTG CTTCAACTCA TCAATAAATA ATATTCATGA AATGGAAATA 420 

ATGCTCTGGA GAAAAATCAG CAGTGGCTCG TGTATGATCA GCAGCGGGAA 480 

AAGGACTTTT AGCAAAGATC TTTGAGTTGG AAAAGAAAAC GGAAACAGCT 540 

TCCCACAGCA GACAAAAAAG CCTGAATCAG AAGGTTATCT TCAAGAAGAG 600 

GTTACAACGA TCTCTTGGCA AGTGCAAAAA AAGATCTTGA GGTTGAACGA 660 

CTCAGCTGAG TTTTGAACTG AGTGAATTTC GAAGAAAATA TGAAGAAACC 720 

TTCACAATTT AAATCAGCTG TTGTATTCAC AAAGAAGGGC AGATGTGCAA 780 

ATGATAGGCA TAAAACAGAG AAGATACAAA AACTCAGGGA AGAGAATGAT B40 

GAAAACTTGA AGAAGAGAAG AAGAGATCCG AAGAGCTCTT ATCTCAGGTC 900 

ACACATCTCT GCTAAAGCAG CAAGAAGAAC AAACAAGGGT AGCTCTGTTG 960 

TGCAGGCATG TACTTTAGAC TTTGAAAATG AAAAACTCGA CCGTCAACAT 1020 

AATTGCATGT AATTCTTAAG GAGCTCCGAA AAGCAAGAAA AAATAACACA 1080 

TTGAAACAGC TTCATGAGTT TGCCATCACA GAGCCATTAG TCACTTTCCA 1140 

GAAAACAGAG AAAAAGTTGC CGCCTCACCA AAAAGTCCCA CTGCTGCACT 1200 

CTGGTGGAAT GTCCCAAGTG CAATATACAO TATCCAGCCA CTGAGCATCG 1260 

GTCCATGTGG AATACTGTTC AAAGTAGCAA AATAAGTATT TGTTTTGATA 1320 

CAATACTGTA TTTTCTGTTA GCTTGTGGGC ATTTTG AATT ATATATTTCA 1380 

AAAACTGCCT ATCTACCTTT GACACTCCAG CATGCTAGTG AATCATGTAT 1440 

GCTGTGCATT TCTCTTGGCA GTGATACCTC CCTGACATGG TTCATCATCA 1500 

ACAGAATGTG GTGAGCAGCG TCTACTGAGA TACTAACATT TTGCACTGTC 1560 

GTGAGGAAAA GATAGCTCAG GTTATTGCTA ATGGGTTAAT GCACCAGCAA 1620 

TTATGTTTCG GGGGTTTTGA AAAATCAAAG ATAATTAACC AAGGATCTTA 16 B0 

CATTTTTTAT CCAAGCACTT AGAAAACCTA CAATCCTAAT TTTGATGTCC 1740 

GGTGGTGATA GATACTATTT TTTTTTCATA TTGTATAGCG GTTATTAGAA 1800 



1 
I 

GAAATTGCAC 
GATAAAGAGA 
AATGCTTATC 
GCCAGATATA 
AGGAGGGAGC 
TCTGCTGCAA 
CAGACTGTGG 
CAGCTGAAAG 
GTCTATGTAA 
GCTCATTCAC 
AAGCAGAAAT 
CAAACCATAA 
CAAAAAGAAG 
CATCTGGAAG 
ATTGCTAGGG 
CAGTCTCTTT 
GAACAACAGA 
GTGCAGCATC 
GTTGGAATCC 
AGGAGAGACT 
CAATGGAAGC 
CGATCTGCTT 
TTAAAAGATT 
CATTTTGCAT 
CTTTTAGGCT 
GGCTGCAATG 
AAAATACTTG 
GCAAAATATT 
ACTGTGTTCG 
ATTGTTAAGA 
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AAGTTGGGGA TTTTCTTGAT CTTTATTGCT GCTTACCATT GAAACTTAAC CCAOCTGTGT I860 

TCCCCAACTC TGTTCTGCGC ACGAAACAGT ATCTGTTTGA GGCATAATCT TAAGTGGCCA 1920 

CACACAATGT TTTCTCTTAT GTTATCTGGC AGTAACTGTA ACTTGAATTA CATTAGCACA 1980 

TTCTGCTTAG CTAAAATTGT TAAAATAAAC TTTAATAAAC CCATGTAGCC CTCTCATTTG 2040 

ATTGACAGTA TTTTAGTTAT TTTTGGCATT CTTAAAGCTG GGCAATGTAA TGATCAGATC 2100 

TTTGTTTGTC TGAACAGGTA TTTTTATACA TGCTTTTTGT AAACCAAAAA CTTTTAAATT 2160 

TCTTCAGGTT TTCTAACATG CTTACCACTG GGCTACTGTA AATGAGAAAA GAATAAAATT 2220 
ATTTAATGTT TT 



Seq ID NO t 144 Protein sequence: 
Protein Accession #: NP 060601 



1 11 21 31 41 51 

I i I i i i 

MEIQLKDALE KNQQWLVYDQ QREVYVKGLk AKIFELEKKT ETAAHSLPQQ TKKPESEGVX* 60 

QEEKQKCYND LLASAKKDLE VERQTITQLS FELSEFRRKY EETQKEVHNL NQLLYSQRRA 120 

DVQHLEDDRH KTEKIQKLRE ENDIARGKLE EEKKRSEELL SQVQSLYTSL LKQQEEQTRV 180 
ALLEQQMQAC TLDFENEKLD RQHVQHQLHV ILKELRKARK NNTVGILETA S 

Seq 10 NOt 145 DNA sequence 
Nucleic Acid Accession Jh NM_00116B 
Coding sequence: 50.. 478 

1 11 21 31 41 51 

I I I I I I 

CCGCCAGATT TGAATCGCGG GACCCGTTGG CAGAGGTGGC GGCGGCGGCA TGGGTGCCCC 60 

GACGTTGCCC CCTGCCTGGC AGCCCTTTCT CAAGGACCAC CGCATCTCTA CATTCAAGAA 120 

CTGGCCCTTC TTGGAGGGCT GCGCCTGCAC CCCGGAGCGG ATGGCCGAGG CTGGCTTCAT 180 

CCACTGCCCC ACTGAGAACG AGCCAGACTT GGCCCAGTGT TTCTTCTGCT TCAAGGAGCT 240 

GGAAGGCTGG GAGCCAGATG ACGACCCCAT AGAGGAACAT AAAAAGCATT CGTCCGGTTG 300 

OGCTTTCCTT TCTGTCAAGA AGCAGTTTGA AGAATTAACC CTTGGTGAAT TTTTGAAACT 360 

GGACAGAGAA AGAGCCAAGA ACAAAATTGC AAAGGAAACC AACAATAAGA AGAAAGAATT 420 

TGAGGAAACT GCGAAGAAAG TGCGCCGTGC CATCGAGCAG CTGGCTGCCA TGGATTGAGG 480 

CCTCTGGCCG GAGCTGCCTG GTCCCAGAGT GGCTGCACCA CTTCCAGGGT TTATTCCCTG 540 

GTGCCACCAG CCTTCCTGTG GGCCCCTTAG CAATGTCTTA GGAAAGGAGA TCAACATTTT 600 

CAAATTAGAT GTTTCAACTG TGCTCCTGTT TTGTCTTGAA AGTGGCACCA GAGGTGCTTC 660 

TGCCTGTGCA GCGGGTGCTG CTGGTAACAG TGGCTGCTTC TCTCTCTCTC TCTCTTTTTT 720 

GGGGGCTCAT TTTTGCTGTT TTGATTCCCG GGCTTACCAG GTGAGAAGTG AGGGAGGAAG 780 

AAGGCAGTGT CCCTTTTGCT AGAGCTGACA GCTTTGTTCG CGTGGGCAGA GCCTTCCACA 840 

GTGAATGTGT CTGGACCTCA TGTTGTTGAG GCTGTCACAG TCCTGAGTGT GGACTTGGCA 900 

GGTGCCTGTT GAATCTGAGC- TGCAGGTTCC TTATCTGTCA CACCTGTGCC TCCTCAGAGG 960 

ACAGTTTTTT TGTTGTTGTG TTTTTTTGTT ■JTTTTTTTTT GGTAGATGCA TGACTTGTGT 1020 

GTGATGAGAG AATGGAGACA GAGTCCCTGG CTCCTCTACT GTTTAACAAC ATGGCTTTCT 1080 

TATTTTGTTT GAATTGTTAA TTCACAGAAT AGCACAAACT ACAATTAAAA CTAAGCACAA 1140 

AGCCATTCTA AGTCATTGGG GAAACGGGGT GAACTTCAGG TGGATGAGGA GACAGAATAG 1200 

AGTGATAGGA AGCGTCTGGC AGATACTCCT TTTGCCACTG CTGTGTGATT AGACAGGCCC 1260 

AGTGAGCCGC GGGGCACATG CTGGCCGCTC CTCCCTCAGA AAAAGGCAGT GGCCTAAATC 1320 

CTTTTTAAAT GACTTGGCTC GATGCTGTGG GGGACTGGCT GGGCTGCTGC AGGCCGTGTG 1380 

TCTGTCAGCC CAACCTTCAC ATCTGTCACG TTCTCCACAC GGGGGAGAGA CGCAGTCCGC 1440 

CCAGGTCCCC GCTTTCTTTQ GAGGCAGCAG CTCCCGCAGG GCTGAAGTCT GGCGTAAGAT 1500 

GATGGATTTG ATTCGCCCTC CTCCCTGTCA TAGAGCTGCA GGGTGGATTG TTACAGCTTC 1560 
GCTGGAAACC TCTGGAGGTC ATCTCGGCTG TTCCTGAGAA ATAAAAAGCC TGTCATTTC 

Seq ID NOt 146 Protein sequence: 
Protein Accession ft> NP 001159 



1 11 21 31 41 51 

I I I I 1 1 

MGAPTLPPAM QPPLKDHRIS TFKNWPFLEG CACTPERMAE AGFIHCPTEN EPDLAQCFFC 60 

FKELEGWEPD DDPIEEHKKH SSGCAFLSVK KQFEELTLGE FLKLDRERAK NKIAKETNNK 120 
KKEFEETAKK VRRAIEQLAA MD 

Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: NM_014176.1 

Coding sequence: 127-720 

1 11 21 31 41 51 • 

I I I I I I 

GCGCGCAGCG CTGGTACCCC GTTGGTCCGC GCGTTGCTGC GTTGTGAGGG GTGTCAGCTC 60 

AGTGCATCCC AGGCAGCTCT TAGTGTGGAG CAGTGAACTG TGTGTGGTTC CTTCTACTTG 120 

GGGATCATGC AGAGAGCTTC ACGTCTGAAG AGAGAGCTGC ACATGTTAGC CACAGAGCCA 180 

CCCCCAGGCA TCACATGTTG GCAAGATAAA GACCAAATGG ATGACCTGCG AGCTCAAATA 240 

TTAGGTGGAG CCAACACACC TTATGAGAAA GGTGTTTTTA AGCTAGAAGT TATCATTCCT 300 

GAGAGGTACC CATTTGAACC TCCTCAGATC CGATTTCTCA C7CCAATTTA TCATCCAAAC 360 

ATTGATTCTG CTGGAAGGAT TTGTCTGGAT GTTCTCAAAT TGCCACCAAA AGGTGCTTGG 420 

AGACCATCCC TCAACATCGC AACTGTGTTG ACCTCTATTC AGCTGCTCAT GTCAGAACCC 480 

AACCCTGATG ACCCGCTCAT GGCTGACATA TCCTCAGAAT TTAAATATAA TAAGCCAGCC 540 

TTCCTCAAGA ATGCCAGACA GTGGACAGAG AAGCATGCAA GACAGAAACA AAAGGCTGAT 600 

GAGGAAGAGA TGCTTGATAA TCTACCAGAG GCTGGTGACT CCAGAGTACA CAACTCAACA 660 

CAGAAAAGGA AGGCCAGTCA GCTAGTAGGC ATAGAAAAGA AATTTCATCC TGATGTTTAG 720 

GGGACTTGTC CTGGTTCATC TTAGTTAATG TGTTCTTTGC CAAGGTGATC TAAGTTGCCT 780 

ACCTTGAATT. TTTTTTTAAA TATATTTGAT GACATAATT7 TTGTGTAGTT TATTTATCTT 840 

GTACATATGT ATTTTGAAAT CTTTTAAACC TGAAAAATAA ATAGTCATTT AATGTTGAAA 900 
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AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

Seq ID NO: 148 Protein sequence: 
Protein Accession ft: NP_0S4B95.1 

1 11 21 31 41 51 

I I I 1 I I 

MQRASRLKRE LHMLATEPPP GITCWQDKDQ MDDLRAQILG GANTPYEKGV FKLEVIIPER 
YPFBPPQIRF LTPIYHPNID SAGRICLDVL KLPPKGAWRP SLNIATVLTS IQLU4SEPNP 
DDPLMADISS EFKYNKPAFL KNARQWTEKH ARQKQKADEE EMLDNLPBAG DSRVHHSTQK 
RKASQLVGIE KKFHPDV 

Seq ID NO: 149 DNA sequence 
Nucleic Acid Accession ft: nm_003812 
Coding sequence: 224-2722 



PCT/US02/12476 



1 
I 

TCCTCTGCGT 
GCCCAGCCCC 
CCATGCGCGC 
GCTCTCGCCG 
CAGCTCGCGG 
ACGCGGCCCC 
GCTTCTCGTC 
GGCTGCTGCG 
GGCAGATGAA 
AATGCAGAAA 
AAGCCCTTAT 
CCATCTGGCC 
CATACTQAAC 
ACCACAGTAC 
A6ACTCCAA6 
CTTOGTGTAT 
ACATATAATC 
GGAAAGAGGT 
AGCAGTGAAT 
TAATGATCAC 
AAAGTCOGTO 
CCTGGTGGCT 
GCAGATGCTC 
GCACCTCATC 
TOTCTGTTCT 
ACAAGTATTA 
AAAGCCAAAA 
GTCCCATTCT 
AGGAGGTGGA 
AAATGGATAC 
ATTATGCTGT 
TAACAATACC 
GTGTGATATT 
GCAAGACGGA 
CAGAGACAAC 
CTATGAAAAG 
GTGGATTCAG 
TCGAGCTCCA 
AGGCCGGGTG 
CTATGTAGAA 
ACAAATTCAA 
GGGCCATGGG 
AGATTGCAGT 
GGGTCCTAGT 
TATTGTCCTT 
TACTCAGCAA 
ATTCTGGGTA 
CTTTGGGTGG 
CTGTCTCTTT 
GGGGGCAAAA 
ACGAAGGAAC 



11 
I 

CCCGCCCCGG 
GAGCCCCGCG 
CGAGCCGGCG 
GCCACACGGA 
CAGCCGCCCC 
GCCGGCTCGG 
CTTCTCCTGC 
CCCAGCGCTC 
GACAATACAT 
GAAATCACAC 
CAOGTTCTTG 
CAGGCAAGCT 
AATGGTTTGT 
TCTAAGGGTG 
GTGGCTCTGT 
ATGATAGAGC 
CAGAAAACCT 
GACCAGTGGC 
CCATCACGTG 
AAAACGTATA 
GTCAACCTTG 
GTAGAGACCT 
CATGAGTTCT 
TCGCGGGTGA 
OGCACAAGAG 
TCGCAGAGCC 
TGTGACTGCA 
OGAAAATTTT 
GCCTGCCTTT 
GTGGAAGCTG 
AAGAAATGTT 
TCATGTCTTT 
ACTGAATATT 
TATGCATGCA 
CAGTGTCAGT 
CTGAATACAG 
TGCAGCAAAC 
CGTATTGGTC 
ATTGACTGCA 
GATGGAACGC 
GCCCTAAATA 
GTGTGTAGTA 
ATCCGGGATC 
GCCACCAATC 
GGGGGCACAG 
GGCCCCATCT 
TGACATACTC 
TAATGACTAC 
TGGAAATAAT 
GACCATGCTA 
AACACACACA 



21 
I 

GAGTGGCTGC 
CCCOGTGCCC 
TGACCGGCTC 
GOGGCGCCCG 
TGGCGGGCTG 
TGCCTGCCAG 
TGCCTCCGCT 
CGCATTGGAA 
TGCAACAGAA 
TGCCTTCAAG 
ACACAAAGGC 
TCCAGATTOA 
TGTCTTCTGA 
GAGAGCACTG 
CAACCTGCAA 
CACTAGAGCT 
TGGCAGGACA 
CCTTTCTCTC 
GTATATTTGA 
AGAAGCATCG 
TGGATTCTAT 
GGACTGAGAA 
CAAAATACCG 
CATTTCACTA 
GAGTTGGTGT 
TGGCTCAAAA 
CAGAATCCTG 
CAAAGTGCAG 
TCAACAGGCC 
GGGAGGAGTG 
CCCTCTCCAA 
TTCAGCCACG 
GTACTGGAGA 
ATCAAAATCA 
ACATCTGGGG 
AAGGCACTGA 
ATGATGTGTT 
AACTTCAGGG 
GTGGTGCCCA 
CATGTGGCCC 
TGAGCAGCTG 
ATGAAGCCAC 
CAGTTAGGAA 
TCATAATAGG 
GCTGGGGATT 
GAATCAGCTG 
GCAGCAGTGT 
GGAGCTAAAG 
GTCAAAGAAC 
TAAAAAGAAC 
CAAAAATTAA 



31 
I 

GAGGCTAGGC 
CGAGCCCGGA 
CGCCCGCGGC 
GGAGCTATGA 
CAGCCTTGCC 
CGCCCCGGCC 
CGCCGCCTCG 
TGAAACTGCA 
TAGCAGCAGT 
ACTCATATAT 
AAGACACCAG 
AGCCTTCGGC 
TTATGTGGAG 
TTACTACCAT 
TGGACTTCAT 
GGTTCATGAT 
GTATTCTAAG 
TGAATTACAG 
AGAAATGAAA 
CTCTTCTCAT 
TTACAAGGAG 
GGATCAGATT 
GCAGCGCATT 
TAAGAGAAGC 
GAATGAGTAT 
CCTTGGAATC 
GGGTGGCTGC 
CATTTTGGAG 
AACAAAGCTA 
TGATTGTGGT 
CGGGGCTCAC 
AGGGTATGAA 
CTCTGGTCAG 
GGGCCGCTGC 
AACAAAGGCT 
GAAGGGAAAC 
CTGTGGATTC 
TGAGATCATT 
TGTAGTTTTA 
GTCTATGATG 
TCCACTCGAT 
CTGCATTTGT 
CCTTCACCCC 
CTCCATOGCT 
TAAAAATGTC 
CGCTGGATGG 
TACTGGAACT 
TTGGGGTGAC 
ACCTTTCACC 
TGTTCCAGAA 
ATGCAATAAA 



41 
I 

GAGCCGGGAA 
GCCCCCTGCC 
CGCCCCGCAG 
GCCATGAAGC 
GGCGCTTCCT 
CGCACGCCGC 
TCCCGGCCCC 
GAAAAAAATT 
AATATCAGTT 
TACATCAACC 
CAAAAACATA 
TCCAAATTCA 
ATTCACTACG 
GGAAGCATCA 
GGCATGTTTG 
GAGAAAAGCA 
CAAATGAAGA 
TGGTTGAAAA 
TATTTGGAAC 
GCACATACCA 
CAGCTCAACA 
GACATCACCA 
AAGCAGCATG 
AGTCTGAGTT 
GGTCTTCCAA 
CAATGGGAAC 
ATCATGGAGG 
TATAGAGACT 
TTTGAGCCCA 
TTTCATGTGG 
TGCAGCGACG 
TGCCGGGATG 
TGCCCACCAA 
TACAATGGCG 
GCAGGGTCTG 
TGCGGGAAGG 
TTACTCTGTA 
CCAACTTCCT 
GATGATGATA 
TGTTTAGATC 
TCCAAGGGTA 
GATTTCACCT 
CCCAAGGATG 
GGTGCCATCC 
AAGAAGAGAA 
ACACCGCCTT 
ATTAAGTTTG 
AAGGATGGGG 
ACCTGTCAGT 
TCTTTTTTTT 
GGAATCATTA 



51 
I 

AGGGGGCGCC 
CGCGGCGGCA 
CTAGCCOGGC 
CGCCCGGCAG 
GCGGCCCCCA 
CCTGCCGCCT 
GCGCCTGGGG 
TGGGAGTCCT 
ACAGCAATGC 
AAGACTGGGA 
ATAAGGCTGT 
TTCTTGAOCT 
AAAATGGGAA 
GAGGCGTCAA 
AAGATGATAC 
CAGGTOGACC 
ATCTCACTAT 
GAAGGAAGAG 
TTATGATTGT 
ACAACTTTGC 
CCAGGGTTGT 
CCAACCCTGT 
CTGATGCTGT 
ACTTTGGAGG 
TGGCAGTGGC 
CTTCTAGCAG 
AAACAGGGGT 
TTTTACAGAG 
CGGAATGTGG 
AATGCTATGG 
GGCCCTGCTG 
CTGTGAACGA 
ATCTTCATAA 
AGTGCAAGAC 
ACAAGTTCTG 
ATGGAGACCG 
CCAATCTTAC 
TCTACCATCA 
CGGATGTGGG 
GGAAGTGCCT 
AAGTCTGTTC 
GGGCAGGGAC 
AAGGACCCAA 
TGGTAGCAGC 
GGTTCGATCC 
GCACTGTTGG 
TAAACAAAAC 
TAAAAGAAAA 
AAACGGGGGA 
TCCCTAATGG 



Seq ID NO: 150 Protein sequence: 
Protein Accession &: NP 003803 



MKPPGSSSRQ 
RPRAWGAAAP 



HYENGKPQYS 
KSTGRPHXXQ 
LELMIVNDHK 
ITTNPVQMLH 
LPMAVAQVLS 
RDFLQRGGGA 
SDGPCCNNTS 



11 

I 

PPLAGCSLAG 
SAPKWNETAE 
VLDTKARHQQ 
KGGEHCYYHG 
KTLAGQYSKQ 
TYKKHRSSHA 
EPSKYRQRIK 
QSLAQNLGIQ 
CLFNRPTKLF 
CLFQPRGYEC 
CQYIWGTKAA 



21 
1 

ASCGPQRGPA 
KNLGVLADED 
KHNKAVHLAQ 
SIRGVKDSKV 
MKNLTMERGD 
HTNNFAKSW 
QHADAVHLIS 
WEPSSRKPKC 
EFTECGNGYV 
RD AVNE CD IT 



31 
I 

GSVPASAPAR 
NTLQQNSSSN 
ASFQZEAFGS 
ALSTCNGLHG 
QWPPLSELQW 
NXVDSIYKEQ 
RVTFHYKRSS 
DCTESWGGCI 
EAGEECDCGP 
EYCTGDSGQC 
NTEGTEKGNC 



41 

I 

TPPCRLLLVL 
ISYSNAMQKE 
KFILDLILNN 
MFEDDTPVYM 
LKRRKRAVNP 
LNTRWLVAV 
LSYFGGVCSR 
MEETGVSHSR 
HVECYGLCCK 
PPNLHKQDGY 
GKDGDRWIQC 



SI 
I 

LLLPPLAASS 
ITLPSRLIYY 
GLLSSDYVEI 
IEPLELVHDE 
9RGIFEEMKY 
ETWTEKDQID 
TRGVGVNEYG 
KPSKCSILEY 
KCSLSNGAHC 
ACNQNOGRCY 
SKHDVPCGPL 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLD DDTDVGYVED GTPCGPSMMC 720 

LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCICD PTWAGTDCSI RDPVRNLHPP 780 
KDEGPKGPSA TNLIIGSIAG AILVAAIVU3 GTGWGFKNVK KRRFDPTQQG PI 

Seq ID NO: 151 DNA sequence 
Nucleic Acid Accession ft: NM_023915 
Coding sequence: 250-1326 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 60 

TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 160 

CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 

AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT B40 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 

AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 1020 

GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 10 80 

AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 1140 

ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 1200 

TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 1260 

ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 1320 

GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 1380 
TTCATTATCC TTAAAAAAAA AA 



Seq ID NO: 152 Protein sequence: 
Protein Accession #» NP_076404 

1 11 21 31 41 51 

III I I I 

MGFNLTLAKL PNNBLHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASILLN 60 

GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 
FLSACNVCLD PIIYFFMCR9 FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 

Seq ID NO: 153 DNA sequence 
Nucleic Acid Accession ih D8000B.1 
Coding sequence: 149-739 

1 11 21 31 41 51 

I I I I I I 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 120 

AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 

OGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACGAGGATG GACTCAGACA 240 

AGTTCTGGAG GAGATGAAAG CTTTGTATGA ACAAAACCAG TCTGATGTGA ATGAAGCAAA 300 

GTCAGGTGGA CGAAGTGATT TGATACCAAC TATCAAATTT CGACACTGTT CTCTGTTAAG 360 

AAATCGAOGC TGCACTGTAG CATACCTGTA TGACCGCTTG CTTCGGATCA GAGCACTCAG 420 

ATGGGAATAT GGTAGCGTCT TGCCAAATGC ATTACGATTT CACATGGCTG CTGAAGAAAT 480 

GGAGTGGTTT AATAATTATA AAAGATCTCT TGCTACTTAT ATGAGGTCAC TGGGAGGAGA 540 

TGAAGGTTTG GACATTACAC AGGATATGAA ACCACCAAAA AGCCTATATA TTGAAGTCCG 600 

GTGTCTAAAA GACTATGGAG AATTTGAAGT TGATGATGGC ACTTCAGTCC TATTAAAAAA 660 

AAATAGCCAG CACTTTTTAC CTCGATGGAA ATGTGAGCAG CTGATCAGAC AAGGAGTCCT 720 

GGAGCACATC CTGTCATGAC CATGCOCCGA GGCACTTCCA GGCTTCACTC AACTCATGGA 780 

CTCCTCTGTA CTCACTCTCT CCACCACTCC CTTCACCTCC CTCTTTGATT TTAGAAGCTA 840 

TAGACATTGT TTAAGATAAC TAAGAATACT TGGCTAAGAA GTATAATTTG CTAACTATTA 900 

AGGACTTTCT TTTTTTAATG TTGTACACTA TTCTTCCTAC TCTTTTTTGG TTTTGGTTTT 960 

GTTTTGTAGA GACTGTCTCA CTATGTTGCC CAAGCTGGTC TCAAACTCCT GGCCTCAAGC 1020 

AGTCCTCCCA CCTTAGCTTC TCAAAGTGTT GAGATCACAG GCGTGAGCCA CTGCACCCGG 1080 

CCCCTACTCC TTTTTCTAAT AAGCTGTATC TGTAATCACA GCATTCCTAC AGTTGTTACA 1140 

GTGTGTTTTT TAAATGAAAG TAAACATGGT TACATTTGAA TCTCTTAAAT AAGCAGTCAC 1200 

TTGGCTGGAC AGGAAGAAGG TAGATCCTGT GTGTCTTGTT TTCTGGTCAT GTGTATTGTA 1260 

CAAGCTAGAG AGCTGAATTT CTGAGATACA CATTTTCAAA TCACATGCAA GTGAAGATGA 1320 

TGGTCTGTAG AAATTTTCAG TATATATAAT GTTTAATGAC ATACTAATTT ATCATCTGGC 1380 

TATTTGGGAA GGAAGGACAC ACATGGATTT TGCACATTTC CACCATGGTG GCTGGTGTGG 1440 

CTTGTGGCTA TGGGGTGATC ACCAGTATCA CCACTTTGGA AGGGGACAGT GAAATTGGGG 1500 

CTAGAGAAGG AACTTTGTAC AGTTTTCCCT GAGATTCAGA TTGACTGAAA AGTCACATGA 1560 

AGAGTTGATT GTCTTTTAAT GGTAT G TTTT AAACAGCTGA CATTTTAAAT TTTGATGAAA 1620 

TCCAGTTTAT TCGTTTGTTC TTTTATGCTT TGGGTGTTGC ATCCGAGAAA TCTTTTCCCA 1680 

TCCCAAGATC ACAATTTTTT TTCCTTTTTA CTTCTAGAAG TGTTATAATT TTAAGCTTTA 1740 

TACTTTGGTC TATGACCCGT TTTTTTTTTT GTTTTGTTTT GTTTTTTCGT TTGTTTCTTT 1800 

GTTTTGAGAT GGAGTCTTGT TCTGTCACCC AGGCTGGGGT GCAGTGGCGT GATCTTGGCT I860 

CACTGCAATC TCTATCCCCT GGGTTCAAGT GATTCTCTTG TCTCAGCCTC CCAAGTAGCT 1920 

GGGATTACAG GCACAGGCCG CCACGCCTGG CTAATTTTTG TATTTTTAGT AGAGACAGAG 1980 
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TTTTACCATG TTGGCCAGGC TGGTTTCAAA CTCCTGACCT CAA6T6ACCC ACCTTGGCCT 
CCCAAAGTTT TGGGATTACA AGTGTGGGCC ACOGCGGCCA GCCTATGATC CATTTTGAAT 
GAATTTTTTA TATGGTGCAA GGTGTCAATC CACCTTCACT TTTTCTTGGG AATATAGATA 
TCCAGCTGTT TCACTACCAT TTTTTGAAAG GACTGCCCTT TGCTCTATCA CCTTTGCATT 
TTTGTTAAAA AGTAGTTGTC AATGTATATG TGGGTTTATT TCAGGACTCT GTTTTGTTCC 
ATTGACCTGT TTTTCTCTCC TGAATGCCAA TACCATATTT GTATGTAGTG TATGTAATTT 
TCTAATAATT CTTGAAACAG ATAGTATTAA TGTGTCATAT TTTTGCTGTT GTTTGTATTT 
TTTGTAGAGA TGGGGTTTCA CCGTGTTGGC CAGGCTGTGT TGAACTCCTG AGCTAAAGCA 
ATACACTTGC CTCGTCCTCC CCATGTGCTG GGATTACAGG CGTGAGCCTT GGTGCTGGCC 
CAGTGTACCA CATTTCTTTT TGAGATTTGT TTTGGCTATG TTAAGTCCTT TGCTTTTGAT 
GTGAAATTTG GGAACAGGCA GGGTGTGGTG GCTTATGCCT GTAATCCTAG AACTTTGGGA 
GGCCTAGATG GGTGGATCAC TTGAGCTCAG GAGTTCCAGA CCAGCCCGGG CCTATGGCAA 
AACTCCGTCT CTACAAAAAA TAGAAAAAAT TAGCCAGGTG TGGTGGTGCA TGCCTGTAGT 
CACAGTTACA CGGCAGGCTG AGGTGGGAGG ATCACTTGAA CCCCAGAGGT CAAGACTGCA 
GTGAGCTGAG ATCACACCAC TGTACTCCAG CCTGGGTGAC AAAGTGAGAC TCTATCTCAA 
AAAGAAATTA GGATCAATTT GTCAATTTCT ACAACAACAA CAACAAAAAC CCCTGTTGGG 
CACCTTGATT GAGATTGCAT TGAATTTATA TAAAACTGTT GGGAGAATTG ACATCTTAAT 
AATATTGAGT CTTCTGGCCT ATAAACAAGG TCTGTCTTCC TAGGTATTAA TGTTTTGTCT 
TCTATTTCTC TTAATAATCT TTTGTAGTTT TCAGTGTACA GGTCTACCAT GTCAGCATTT 
CATAGTTTTG ATGCTAAATG GTATTTTAAA ATTTCAAATT CTAACCACTT GTTGCTAGTA 
AATAGAAATA CAATTGATGT TGAACTTGTA TCCTTCAGCC TTGCTAAACT GTGAGTTCTC 
ATGGTGTTTT TGTAAATTAC ATCAACAGTC ATGTGTTCTA TGAATAAAGA GTTTTACTCC 
TTC 

Seg ID NO: 154 Protein sequence i 
Protein Accession #: BAA11503.1 

1 11 21 31 41 51 

I I I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL. RWEYG6VLPN ALRFHMAAEE MEWFKNYKRS 
LATYMRSLGG DEGLDITQDM KPPKSLYIEV RCLKDYGEFE VDDGTSVLLK KNSQHPLPRW 
KCEQLIRQGV fcEHILS 

Seq ID NO: 155 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence t 149-709 



1 

t 

GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
AGTTCTGGAG 
GTCAGGTGGA 
AAATCGACGC 
ATGGGAATAT 
GGAGTGGTTT 
TGAAGGTTTG 
ATGCAGTGGC 
CAACCTCCAC 
GCACTTCAGT 
AGCTGATCAG 
CAGGCTTCAC 
CCCTCTTTGA 
AAGTATAATT 
ACTCTTTTTT 
TCTCAAACTC 
AGGCGTGAGC 
CAGCATTCCT 
AATCTCTTAA 
TTTTCTGGTC 
AATCACATGC 
ACATACTAAT 
TCCACCATGG 
GAAGGGGACA 
GATTGACTGA 
GACATTTTAA 
GCATCCGAGA 
AGTGTTATAA 
TTGTTTTTTC 
GTGCAGTGGC 
TGTCTCAGCC 
TGTATTTTTA 
CTCAAGTGAC 
CAGCCTATGA 
CTTTTTCTTG 
TTTGCTCTAT 
TTTCAGGACT 
TTGTATGTAG 
ATTTTTGCTG 
GTTGAACTCC 
GGCGTGAGCC 
TGTTAAGTCC 
CTGTAATCCT 
GACCAGCCCG 



11 
I 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
GAGATGAAAG 
CGAAGTGATT 
TGCACTGTAG 
GGTAGOGTCT 
AATAATTATA 
GACATTACAC 
GCGATCTCGG 
CTCCCAGGTC 
CCTATTAAAA 
ACAAGGAGTC 
TCAACTCATG 
TTTTAGAAGC 
TGCTAACTAT 
GGTTTTGGTT 
CTGGCCTCAA 
CACTGCACCC 
ACAGTTGTTA 
ATAAGCAGTC 
ATGTGTATTG 
AAGTGAAGAT 
TTATCATCTG 
TGGCTGGTGT 
GTGAAATTGG 
AAAGTCACAT 
ATTTTGATGA 
AATCTTTTCC 
TTTTAAGCTT 
GTTTGTTTCT 
GTGATCTTGG 
TCCCAAGTAG 
GTAGAGACAG 
CCACCTTGGC 
TCCATTTTGA 
GGAATATAGA 
CACCTTTGCA 
CTGTTTTGTT 
TGTATGTAAT 
TTGTTTGTAT 
TGAGCTAAAG 
TTGGTGCTGG 
TTTGCTTTTG 
AGAACTTTGG 
GGCCTATGGC 



21 
I 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
CTTTGTATGA 
TGATACCAAC 
CATACCTGTA 
TGCCAAATGC 
AAAGATCTCT 
AGGATATGAA 
CTCAACCTGC 
CGGTGTCTAA 
AAAAATAGCC 
CTGGAGCACA 
GACTCCTCTG 
TATAGACATT 
TAAGGACTTT 
TTGTTTTGTA 
GCAGTCCTCC 
GGCCCCTACT 
CAGTGTGTTT 
ACTTGGCTGG 
TACAAGCTAG 
GATGGTCTGT 
GCTATTTGGG 
GGCTTGTGGC 
GGCTAGAGAA 
GAAGAGTTGA 
AATCCAGTTT 
CATCCCAAGA 
TATACTTTGG 
TTGTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
AGTTTTACCA 
CTCCCAAAGT 
ATGAATTTTT 
TATCCAGCTG 
TTTTTGTTAA 
CCATTGACCT 
TTTCTAATAA 
TTTTTGTAGA 
CAATACACTT 
CCCAGTGTAC 
ATGTGAAATT 
GAGGCCTAGA 
AAAACTCCGT 



31 
I 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
ACAAAACCAG 
TATCAAATTT 
TGACCGCTTG 
ATTACGATTT 
TGCTACTTAT 
ACCACCAAAA 
AACCTCCACC 
AAGACTATGG 
AGCACTTTTT 
TCCTGTCATG 
TACTCACTCT 
GTTTAAGATA 
CTTTTTTTAA 
GAGACTGTCT 
CACCTTAGCT 
CCTTTTTCTA 
TTTAAATGAA 
ACAGGAAGAA 
AGAGCTGAAT 
AGAAATTTTC 
AAGGAAGGAC 
TATGGGGTGA 
GGAACTTTGT 
TTGTCTTTTA 
ATTCGTTTGT 
TCACAATTTT 
TCTATGACCC 
ATGGAGTCTT 
TCTCTATCCC 
AGGCACAGGC 
TGTTGGCCAG 
TTTGGGATTA 
TATATGGTGC 
TTTCACTACC 
AAAGTAGTTG 
GTTTTTCTCT 
TTCTTGAAAC 
GATGGGGTTT 
GCCTCGTCCT 
CACATTTCTT 
TGGGAACAGG 
TGGGTGGATC 
CTCTACAAAA 



41 

I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AAOGAGGATG 
TCTGATGTGA 
CGACACTGTT 
CTTCGGATCA 
CACATGGCTG 
ATGAGGTCAC 
AGCCTATATA 
TCCCAGGTTC 
AGAATTTGAA 
ACCTCGATGG 
ACCATGCGCC 
CTCCACCACT 
ACTAAGAATA 
TGTTGTACAC 
CACTATGTTG 
TCTCAAAGTG 
ATAAGCTGTA 
AGTAAACATG 
GGTAGATCCT 
TTCTGAGATA 
AGTATATATA 
ACACATGGAT 
TCACCAGTAT 
ACAGTTTTCC 
ATGGTATGTT 
TCTTTTATGC 
TTTTCCTTTT 
GTT TT TT TT T 
GTTCTGTCAC 
CTGGGTTCAA 
CGCCACGCCT 
GCTGGTTTCA 
CAAGTGTGGG 
AAGGTGTCAA 
ATTTTTTGAA 
TCAATGTATA 
CCTGAATGCC 
AGATAGTATT 
CACCGTGTTG 
CCCCATGTGC 
TTTGAGATTT 
CAGGGTGTGG 
ACTTGAGCTC 
AATAGAAAAA 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



PCT7US02/12476 



60 
120 
180 



51 
I 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
GACTCAGACA 
ATGAAGCAAA 
CTCTGTTAAG 
GAGCACTCAG 
CTGAAGAAAT 
TGGGAGGAGA 
TTGAAGCTGG 
ACCTCAACTG 
GTTGATGATG 
AAATGTGAGC 
GAGGCACTTC 
CCCTTCACCT 
CTTGGCTAAG 
TATTCTTCCT 
CCCAAGCTGG 
TTGAGATCAC 
TCTGTAATCA 
GTTACATTTG 
GTGTGTCTTG 
CACATTTTCA 
ATGTTTAATG 
TTTGCACATT 
CACCACTTTG 
CTGAGATTCA 
TTAAACAGCT 
TTTGGGTGTT 
TACTTCTAGA 
TTGTTTTGTT 
CCAGGCTGGG 
GTGATTCTCT 
GGCTAATTTT 
AACTCCTGAC 
CCACCGCGGC 
TCCACCTTCA 
AGGACTGCCC 
TGTGGGTTTA 
AATACCATAT 
AATGTGTCAT 
GCCAGGCTGT 
TGGGATTACA 
GTTTTGGCTA 
TGGCTTATGC 
AGGAGTTCCA 
ATTAGCCAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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1380 
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1980 
2040 
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2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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TGTGGTGGTG CATGCCTGTA GTCACAGTTA CACGGCAGGC 
AACCCCAGAG GTCAAGACTG CAGTGAGCTG AGATCACACC 
ACAAAGTGAG ACTCTATCTC AAAAAGAAAT TAGGATCAAT 
AACAACAAAA ACCCCTGTTG GGCACCTTGA TTGAGATTGC 
TTGGGAGAAT TGACATCTTA ATAATATTGA GTCTTCTGGC 
CCTAGGTATT AATGTTTTGT CTTCTATTTC TCTTAATAAT 
CAGGTCTACC ATGTCAGCAT TTCATAGTTT TGATGCTAAA 
TTCTAACCAC TTGTTGCTAG TAAATAGAAA TACAATTGAT 
CCTTGCTAAA CTGTGAGTTC TCATGGTGTT TTTGTAAATT 
TATGAATAAA GAGTTTTACT CCTTC 

Seg ID NO » 156 Protein sequence i 
Protein Accession #: Eos sequence 



TGAGGTGGGA 
ACTGTACTCC 
TTGTCAATTT 
ATTGAATTTA 
CTATAAACAA 
CTTTTGTAGT 
TGGTATTTTA 
GTTGAACTTG 
ACATCAACAG 



GGATCACTTG 
AGCCTGGGTG 
CTACAACAAC 
TATAAAACTG 
GGTCTGTCTT 
TTTCAGTGTA 
AAATTTCAAA 
TATCCTTCAG 
TCATGTGTTC 



51 



1 11 21 31 41 

I I I 1 I I 

MPCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSLiL RNKRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE MEWFNNYKRS 
LATYMRSLGG DEGLDITQDM KPPKSLYIEA GCSGAISAQP ATSTSQVHLN QJLHLPGPVS 
KRLWRI 

Seq ID NOs 157 DNA sequence 

Nucleic Acid Accession #t Eos sequence 

Coding sequence: 148-621 



1 
I 

TTCGGCGCCA 
GAAAGGAGTG 
AGGCCGCGGG 
GAGCTGCATC 
GTTCTGGAGG 
TCAGGTGGAC 
AATOGACGCT 
TGGGAATATG 
CGGTGTCTAA 
AAAAATAGCC 
CTGGAGCACA 
GACTCCTCTG 
TATAGACATT 
TAAGGACTTT 
TTGTTTTGTA 
GCAGTCCTCC 
GGCCCCTACT 
CAGTGTGTTT 
ACTTGGCTGG 
TACAAGCTAG 
GATGGTCTGT 
GCTATTTGGG 
GGCTTGTGGC 
GGCTAGAGAA 
GAAGAGTTGA 
AATCCAGTTT 
CATCCCAAGA 
TATACTTTGG 
TTGTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
AGTTTTACCA 
CTCCCAAAGT 
ATGAATTTTT 
TATCCAGCTG 
TTTTTGTTAA 
CCATTGACCT 
TTTCTAATAA 
TTTTTGTAGA 
CAATACACTT 
CCCAGTGTAC 
ATGTGAAATT 
GAGGCCTAGA 
AAAACTCCGT 
GTCACAGTTA 
CAGTGAGCTG 
AAAAAGAAAT 
GGCACCTTGA 
ATAATATTGA 
CTTCTATTTC 
TTCATAGTTT 
TAAATAGAAA 
TCATGGTGTT 
CCTTC 



11 

I 

AAGCGCGGAG 
AGGCGCCGAG 
AGTGGGAAGC 
GCGCGCCCGA 
AGATGAAAGC 
GAAGTGATTT 
GCACTGTAGC 
GTAGCGTCTT 
AAGACTATGG 
AGCACTTTTT 
TCCTGTCATG 
TACTCACTCT 
GTTTAAGATA 
CTTTTTTTAA 
GAGACTGTCT 
CACCTTAGCT 
CCTTTTTCTA 
TTTAAATGAA 
ACAGGAAGAA 
AGAGCTGAAT 
AGAAATTTTC 
AAGGAAGGAC 
TATGSGGTGA 
GGAACTTTGT 
TTGTCTTTTA 
ATTCGTTTGT 
TCACAATTTT 
TCTATGACCC 
ATGGAGTCTT 
TCTCTATCCC 
AGGCACAGGC 
TGTTGGCCAG 
TTTGGGATTA 
TATATGGTGC 
TTTCACTACC 
AAAGTAGTTG 
GTTTTTCTCT 
TTCTTGAAAC 
GATGGGGTTT 
GCCTCGTCCT 
CACATTTCTT 
TGGGAACAGG 
TGGGTGGATC 
CTCTACAAAA 
CACGGCAGGC 
AGATCACACC 
TAGGATCAAT 
TTGAGATTGC 
GTCTTCTGGC 
TCTTAATAAT 
TGATGCTAAA 
TACAATTGAT 
TTTGTAAATT 



21 
I 

CGGAGGCGGA 
AGCCCAGATA 
GTCCGCCATG 
AGGGCAACTG 
TTTGTATGAA 
GATACCAACT 
ATACCTGTAT 
GCCAAATGCA 
AGAATTTGAA 
ACCTCGATGG 
ACCATGCGCC 
CTCCACCACT 
ACTAAGAATA 
TGTTGTACAC 
CACTATGTTG 
TCTCAAAGTG 
ATAAGCTGTA 
AGTAAACATG 
GGTAGATCCT 
TTCTGAGATA 
AGTATATATA 
ACACATGGAT 
TCACCAGTAT 
ACAGTTTTCC 
ATGGTATGTT 
TCTTTTATGC 
TTTTCCTTTT 
GTTTTTTTTT 
GTTCTGTCAC 
CTGGGTTCAA 
CGCCACGCCT 
GCTGGTTTCA 
CAAGTGTGGG 
AAGGTGTCAA 
ATTTTTTGAA 
TCAATGTATA 
CCTGAATGCC 
AGATAGTATT 
CACCGTGTTG 
CCCCATGTGC 
TTTGAGATTT 
CAGGGTGTGG 
ACTTGAGCTC 
AATAGAAAAA 
TGAGGTGGGA 
ACTGTACTCC 
TTGTCAATTT 
ATTGAATTTA 
CTATAAACAA 
CTTTTGTAGT 
TGGTATTTTA 
GTTGAACTTG 
ACATCAACAG 



31 
I 

GGCGAGAGCC 
CCATTTTGGC 
TTCTGCGAAA 
CCTGCCTTCA 
CAAAACCAGT 
ATCAAATTTC 
GACCGCTTGC 
TTACGATTTC 
GTTGATGATG 
AAATGTGAGC 
GAGGCACTTC 
CCCTTCACCT 
CTTGGCTAAG 
TATTCTTCCT 
CCCAAGCTGG 
TTGAGATCAC 
TCTGTAATCA 
GTTACATTTG 
GTGTGTCTTG 
CACATTTTCA 
ATGTTTAATG 
TTTGCACATT 
CACCACTTTG 
CTGAGATTCA 
TTAAACAGCT 
TTTGGGTGTT 
TACTTCTAGA 
TT GTTT TG TT 
CCAGGCTGGG 
GTGATTCTCT 
GGCTAATTTT 
AACTCCTGAC 
CCACCGCGGC 
TCCACCTTCA 
AGGACTGCCC 
TGTGGGTTTA 
AATACCATAT 
AATGTGTCAT 
GCCAGGCTGT 
TGGGATTACA 
GTTTTGGCTA 
TGGCTTATGC 
AGGAGTTCCA 
ATTAGCCAGG 
GGATCACTTG 
AGCCTGGGTG 
CTACAACAAC 
TATAAAACTG 
GGTCTGTCTT 
TTTCAGTGTA 
AAATTTCAAA 
TATCCTTCAG 
TCATGTGTTC 



41 

I 

TGGCGCTGTA 
GTGAGAGCTG 
AAGCCATGGA 
ACGAGGATGG 
CTGATGTGAA 
GACACTGTTC 
TTCGGATCAG 
ACATGGCTGC 
GCACTTCAGT 
AGCTGATCAG 
CAGGCTTCAC 
CCCTCTTTGA 
AAGTATAATT 
ACTCTTTTTT 
TCTCAAACTC 
AGGCGTGAGC 
CAGCATTCCT 
AATCTCTTAA 
TTTTCTGGTC 
AATCACATGC 
ACATACTAAT 
TCCACCATGG 
GAAGGGGACA 
GATTGACTGA 
GACATTTTAA 
GCATCCGAGA 
AGTGTTATAA 
TTGTTTTTTC 
GTGCAGTGGC 
TGTCTCAGCC 
TGTATTTTTA 
CTCAAGTGAC 
CAGCCTATGA 
CTTTTTCTTG 
TTTGCTCTAT 
TTTCAGGACT 
TTGTATGTAG 
ATTTTTGCTG 
GTTGAACTCC 
GGCGTGAGCC 
TGTTAAGTCC 
CTGTAATCCT 
GACCAGCCCG 

AACCCCAGAG 
ACAAAGTGAG 
AACAACAAAA 
TTGGGAGAAT 
CCTAGGTATT 
CAGGTCTACC 
TTCTAACCAC 
CCTTGCTAAA 
TATGAATAAA 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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60 
120 
180 



51 
I 

GGACTAGAAC ' 
GTGGTTGGCA 
ACTGATCCGC 
ACTCAGACAA 
TGAAGCAAAG 
TCTGTTAAGA 
AGCACTCAGA 
TGAAGAAGTC 
CCTATTAAAA 
ACAAGGAGTC 
TCAACTCATG 
TTTTAGAAGC 
TGCTAACTAT 
GGTTTTGGTT 
CTGGCCTCAA 
CACTGCACCC 
ACAGTTGTTA 
ATAAGCAGTC 
ATGTGTATTG 
AAGTGAAGAT 
TTATCATCTG 
TGGCTGGTGT 
GTGAAATTGG 
AAAGTCACAT 
ATTTTGATGA 
AATCTTTTCC 
TTTTAAGCTT 
GTTTGTTTCT 
GTGATCTTGG 
TCCCAAGTAG 
GTAGAGACAG 
CCACCTTGGC 
TCCATTTTGA 
GGAATATAGA 
CACCTTTGCA 
CTGTTTTGTT 
TGTATGTAAT 
TTGTTTGTAT 
TGAGCTAAAG 
TTGGTGCTGG 
TTTGCTTTTG 
AGAACTTTGG 
GGCCTATGGC 
CATGCCTGTA 
GTCAAGACTG 
ACTCTATCTC 
ACCCCTGTTG 
TGACATCTTA 
AATGTTTTGT 
ATGTCAGCAT 
TTGTTGCTAG 
CTGTGAGTTC 
GAGTTTTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



Seq ID NO: 158 Protein sequence: 
Protein Accession #s Eos sequence 



ll 
I 



21 
) 



31 



41 

I 



51 
I 
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MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMRALY EQNQSDVNEA KSGGRSDLIP 
TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYQSVLPN ALRFHMAAEB VRCLKDYGEF 
EVDDGTSVLL KKNSQHFLPR WKCBQLIRQG VLEHILS 

Seq ID NOi 1S9 OKA sequence 

Nucleic Acid Accession #> Bos sequence 

Coding sequence! 149-229 



PCT/US02/12476 



60 
120 



1 
I 

GTTOGGOGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
GGCACACACC 
GGAAGTTGAA 
TGAATCCCTG 
AAGCTTTGTA 
GTAGCATACC 



11 
I 

AAAGCGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
TGTAGTCCCA 
ACTGCAGTGA 
TCTCAAAAAG 
TGAACAAAAC 
TGTATGACCG 



21 
I 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
GCAACTTAGG 
ACTGTGGTCA 
GAAAAGGAGG 
CAGTCTGATG 
CTTGCTTOGG 



31 
I 

AGGCGAGAGC 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
AGGCTGAAGT 
CGCTATTACA 
ATGGACTCAG 
TGTTCTCTGT 
ATCAGAGCAC 



41 
I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACAATTAGC 
GAGAGGATTG 
CTCCAGCCTG 
ACAAGTTCTG 
TAAGAAATCG 
TCAGATGG 



SI 
I 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
TGGGTGTGGT 
CATGGCTCCA 
GGTGACAGAC 
GAGGAGATGA 
ACGCTGCACT 



Seq ID NO: 160 Protein sequence : 
Protein Accession 8: Eos sequence 

21 



1 11 21 31 41 Si 

I I I I I I 

ATGTTCTGCG AAAAAGCCAT GGAACTGATC CGCGAGCTGC ATCGCGCGCC CGAAGGGCAA 
CTGCCTGCCT TCAACAATTA G 

Seq ID NO i 161 DNA sequence 
Nucleic Acid Accession #: U10694 
Coding sequence t 1333-2280 



GGATCCGGCC 
ACAGGGAGGC 
AGGAACCTAA 
TTTGCCCTGC 
CCTTTATCCT 
CACGTCAGCA 
CCCACTCACC 
CCGCTGGGTG 
AGGTCAACAG 
CTCACCCCAG 
AACCCTGGGC 
ATGTGAGCTC 
CAGGAGAAAG 
AGAACTCAAG 
CAGTCTGCAG 
CTTGGTCTGA 
TGAAGGTGAA 
GGACCCCATA 
AGGCTAGCTG 
ACCAGGAGGA 
GCCTTTGTTA 
TCCCCAGGCC 
ACCAGAGTCA 
GAAGCCCAAG 
GAGACTACCT 
CCTCCCCAGA 
AGCCAATTCG 
CCAGCTCAGC 
CATTTCCTGC 
AGCGTCATCA 
ATGCAGGTGA 
CTTGTCACTG 
AAGGCCGCCC 
GAAGAGGTTA 
TTCTACGGGG 
TACCGGCAGG 
CAOGCTGAAA 
CCCATCTGCT 
GCACCAGCCG 
GCCCCATGTG 
CAGTGGCAGT 
GTGATTTGGA 
AGTTTAATGA 
ATGTTATTTA 
TGAATTGGGA 
CAGCAAAATA 
ACCTCTTTCT 
TCTTTGAGCA 
TTTTTTCTTC 



11 
I 

GGATCTCAGG 
CCTGTGTTCG 
GGGAGGATCG 
CCCTACTGTC 
GGGATCACTG 
GAGGGAGGGT 
AAACACAGAG 
GATGGACTCC 
AGGGAGGGTT 
GACACATGGA 
AGGTGTGGGC 
CTGATCTGAG 
GTCAGGGCOC 
AGTGTCCAGC 
CCTAAGGGCC 
GACAGTGTCC 
GTGTTCACCC 
GCACCTGGCC 
CACGCTGAGT 
CAGGAGCCCC 
GAACCTCCAA 
TGTGGGTCTC 
TCATGTCTCT 
GAGAGGACTT 
CCTCCTCTGA 
GTCCTCAGGG 
ATGAGGGCTC 
TGGAGTTCAT 
TCCACAAATA 
AAAATTACAA 
TCTTTGGCAC 
CTCTTGGCCT 
TCCTGATCAT 
TCTGGGAAGC 
AGCCCAGGAA 
TGCCCGGCAG 
CCAGCTATGA 
ACCCATCCCT 
CAGCCGGGGC 
ACATGAGGCC 
GGGTGGAAGT 
GATTTATCCT 
ACTTCACCAT 
GGAGTAAGAT 
CAAGATAACA 
GAGCTCATAA 
CTCTCCTGTA 
TGTAAGAGAA 
AGACACGCAC 
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GAGGTGAGGA 
ACAGACACAG 
AGGGTACCTC 
ACCCCAGAGA 
GTGTCGGGGA 
CCCAGGCCCT 
GACCTAGCCC 
CCTCACTTCC 
CCAGACCCTG 
CCCCATTGAA 
AGATGTTGGT 
AGACTCTCAG 
TGAGTGAGCG 
CCGCCCTCTT 
CCTCGATTCC 
TCAGGTCGCA 
TGAATGTGCA 
CCATTCCCCC 
AGCCCTCTCA 
AAGAGGCCCC 
GGTTCGGTTC 
CATCGCCCAG 
CGAGCAGAGG 
GGGCCTGATG 
CAGCAAGGAG 
AGGCGCTTCC 
CAGCAGTCAA 
GTTCCAAGAA 
TCGAGTCAAG 
GCGCTACTTT 
TGATGTGAAG 
CTCGTGCGAT 
TGTCCTGGGT 
GTTGAGTGTG 
GCTGCTCACC 
TGATCCTGCG 
GAAGGTCATA 
TTATGAAGAG 
CAAAGTTTGT 
CATTCTTCGC 
GAGCACACTG 
TGCTCCCTTT 
CGAAGTTAAT 
TCTTGCTTTT 
TAGCAGAGGA 
AGAAATAGTG 
AAATTAAAAC 
ATAAAAATTG 
TGAACATCTG 
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CTTTGTTCTC 
TGGTCCCAGG 
CAGGCCAGAG 
GCCCGGGCAG 
GGGCTGGCCT 
GCCAGGAGTC 
CACCCTGCCC 
TCTTCAGGTG 
CAGGCATCAA 
TTTAGACATC 
TGGGGCATGT 
GCAAGTAGAG 
CAGAGGGGAC 
GACAGCACTG 
TCTTCCAGGA 
GAGCAGAGGA 
CCAAGGGCCC 
TACTGTCACT 
CTTCCTCCCT 
AGAGCAGCAC 
TCAGCTGAAG 
CTCCTGCCCA 
AGTCCGCACT 
GGTGCACAGG 
GAGGAGGTGT 
TCCTCCATTT 
GAAGAGGAAG 
GCACTGAAAT 
GAGCCGGTCA 
CCTGTGATCT 
GAGGTGGACC 
AGCATGCTGG 
GTGATCCTAA 
ATGGGGGTGT 
CAAGATTGGG 
CACTACGAGT 
AATTATTTGG 
GTTTTGGGAG 
GGGGTCAGGG 
TCTGTGTTTG 
TATGTCATCT 
TGGAATTGTT 
GAATGACAGT 
GAGTCACATG 
ATTAATAATT 
AAATGAAAAT 
ATATACATGT 
AAAGAATAAT 
TTATTCGGAA 
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AGAGGGTGTG 
ATTGGAGAGC 
AAACTCTCAG 
GGCTGTCTGC 
TGGTCTGAGG 
CAGGTGCAGA 
CTTGTGTCAG 
TCTCCTGGAG 
GATGAGGAGC 
TCTTACTGTA 
CCTTCTGTTC 
GAGTAGAGTC 
CATCCACCCC 
AGGGACCGGG 
GCTCCAGGAA 
GACCCAGGCA 
CACCTGCCCC 
CATAGAGCCT 
CAGGTTCTCG 
TGACGAAGAC 
TCTCTCACAC 
CGCTCCTGAC 
GCAAGCCTGA 
AACCCACAGG 
CTGCTGCTGG 
CCGTCTACTA 
AGCCAAGCTC 
TGAAGGTGGC 
CAAAGGCAGA 
TCGGCAAAGC 
CCGCCGGCCA 
GTGATGGTCA 
CCAAAGACAA 
ATGTTGGGAA 
TGCAGGAAAA 
TCCTGTGGGG 
TCATGCTCAA 
AGGAGCAAGA 
CCCCATCCAG 
AAGAGAGCAA 
CTGGGTTCCT 
CAAATGTTCT 
AGTCACACAT 
GGGAAATCCC 
TTTTTGAAAC 
GTAGTTAATT 
ATACCTGGAT 
TTTTCCTGTT 
CACCCTGGGT 
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TGTGGACAAA 
AGTCCAGGTG 
ATCAAGAGAG 
TGAGGTCCCT 
GGGCTGCACT 
CTGAGGGGAC 
CTGAGGGAAG 
ATAGGGCCTC 
AGGCAGTATC 
CTTCCGAGGA 
CATATCAGGG 
CAGTCCCTGC 
AAAAGTGTGT 
GCTCTGCCTG 
GCAGGCAGGC 
GTGTCAGCAG 
AGCACACATG 
TGATCTCTGC 
GGACAGGCTA 
GTGTAAGTCA 
ACTCCCTCTC 
TGCTGCCCTG 
TGAAGACCTT 
CGAGGAGGAG 
GTCATCAAGT 
CACTTTATGG 
CTCGGTCGAC 
TGAGTTGGTT 
AATGCTGGAG 
CTCOGAGTTC 
CTCCTACATC 
TAGCATGCCC 
CTGCGCCCCT 
GGAGCACATG 
CTACCTGGAG 
TTCCAAGGCC 
TGCAAGAGAG 
GGGAGTCTGA 
CAGCTGCCCT 
TCAGTGTTCT 
TGTCTATTGG 
TTTAATGGTC 
ATTGCTGTTT 
TGTTATTTTG 
TTGAACTTAG 
CTTGCCTTAT 
TTGCTTGGCT 
CACTGGCTCA 
T 
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Seq ID NO: 162 Protein sequence: 
Protein Accession ft: AAA68877.1 
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1 11 21 31 41 SI 

I I 1 I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 60 
PQGGASSSIS VYYTLWSQFD EGSSSQEEBE PSSSVDPAQL EFMFQEALKL KVAELVHFLL 120 
HKYRVKEPVT KAEMLESVIK NYKRYPPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 180 
LGLSCDSMLG DGHSMPKAAI, LHVLGVILT KDNCAPEEVI WEALSVMQVY VGKEHMFYGE 240 
PRKLLTQDWV QENYLEYRQV PGSDPAHYEP LWGSKAHAET SYEKVINYLV MLNAREPICY 300 
PSLYEEVLGE EQEGV 

Seq ID NO: 163 DHA sequence 
Nucleic Acid Accession B: AF292100 
Coding sequence: 30-809 



1 11 21 31 41 51 

I I I I I I 

GGGGGGGGAG AGGCCTGGAG GACACCAACA TGAACAAGTT GAAATCATCG CAGAAGGATA 60 

AAGTTCGTCA GTTTATGATC TTCACACAAT CTAGTGAAAA AACAGCAGTA AGTTGTCTTT 120 

■ CTCAAAATGA CTGGAAGTTA GATGTTGCAA CAGATAATTT TTTCCAAAAT CCTGAACTTT 180 

ATATACGAGA GAGTGTAAAA GGATCATTGG ACAGGAAGAA GTTAGAACAO CTGTACAATA 240 

GATACAAAGA CCCTCAAGAT GAGAATAAAA TTGGAATAGA TGGCATACAG CAGTTCTGTG 300 

ATGACCTGGC ACTCGATCCA GCCAGCATTA GTGTGTTGAT TATTGCGTGG AAGTTCAGAG 360 

CAGCAACACA GTGCGAGTTC TCCAAACAGG AGTTCATGGA TGGCATGACA GAATTAGGAT 420 

GTGACAGCAT AGAACAACTA AAGGCCCAGA TACCGAAGAT GGAACAAGAA TTGAAAGAAC 480 

CAGGACGATT TAAGGATTTT TACCAGTTTA CTTTTAATTT TGCAAAGAAT CCAGGACAAA 540 

AAGGATTAGA TCTAGAAATG GCCATTGCCT ACTGGAACTT AGTGCTTAAT GGAAGATTTA 600 

AATTCTTAGA CTTATGGAAT AAATTTTTGT TGGAACATCA TAAACGATCA ATACCAAAAG 660 

ACACTTGGAA TCTTCTTTTA GACTTCAGTA CGATGATTGC AGATGACATG TCTAATTATG 720 

ATGAAGAAGG AGCATGGCCT GTTCTTATTG ATGACTTTGT GGAATTTGCA CGCCCTCAAA 780 

TTGCTGGGAC AAAAAGTACA ACAGTGTAGC ACTAAAGGAA CCTTTTAGAA TGTACATAGT 840 

CTGTACAATA AATACAACAG AAAATTGCAC AGTCAATTTC TGCTGGCTGG ACTGAACTGA 900 

AGATCAATCC TCACAATTCA GACTGAGGGT TGAGACAAAA CTTTAAGGAT ACATCTTGGA 960 

CCATATCGTA TTTCATTCTT CTAATGGTGG TTTGGGCTTG TCTTCTAGTC TGGGCCGCTC 1020 

TAAACATTTA TAATTCCAAC ATTGTGGATT TCATCTTATA TCTGTGGACC ATCCTAGTTT 10 B0 

ATTCTCCCAT AAGTCTTAGA AGCTTTATGG TGATTATTTT GAGGTTTTCA TTCTCGCATA 1140 

AAGCACAATG CTGTCTTCAT CAGAAAACAG TTGGCATAAG AATTAAACAT ATGAACATCA 1200 

CAAAACAATT TATAAAAACT TCTTAAATAT ACGCTTTGGG CTAGTTGCAA AGACTATGCT 12 GO 

AATAGCACTT CCAGTGAGAG TGATATATTT AAGTGTACTG GATCTGGAAT GGTGTTTTGG 1320 

TTTGGGGGGA ATTTTTTTTT TTTCCTGGCA AATCACATAT GTTGTTGATG TGAGTATCTG 1380 

ATGAAAAAAC AATGTCAGAA TAACCGACAT GAAAATTTTT TAGGATAACT TGGTGCCTAC 1440 

CTGAAAAATG TATTGTGTTT TAGACTCTTG ATTTCAAAAG GTTCCACAGA ACTAGTCTGC 1500 

GCTTACCTTA CCCATGTTTA TATATAGCTG TCCTACAGGG AGCTTTTATT TAGAAAATGT 1560 

CTGCATAATG TTAGATTCTT CTCCTGTCTA CATTATGCAC TACATAATTG GACTTCATTA 1620 

TGCTTTTGAA ATGCTTATCT GCCTGTCACA TAAGTTAAAC TATTTAATTT GTTTTGAATG 1680 

TTTTGGATTG CTACACAATA CAATATTCTA AATTTAGGCA TGAGGGTTTT TTTGTTTTAT 1740 

TTTTACTTTT TTTTTGTCAT TGCACTATGG AACACAAATG AAATTCTCTT AATTTATAAG 1800 

AAGATAGTAG GAGTTAAATT TTGAAAATGG TTGTGATGAG CCACGAAATT CAATCTTTAT 1860 

AATATAGGTA CTGCTCTTTC AGACAAACAG TCCATTTTTA ATGACTTCTT ATTTTGTTGA 1920 

AATTACTTTA ACTGCTAATC ACTGTGGTTG CCAAATATTT ACTTCAGAAG CAAAGATTTT 1980 

CAAACAAGCA TACACGATGC AAAATACCAG TCTGGCTTCT AGTCTATTTA CTGTTTTGTT 2040 

TCACTCAGAT TAGCTCAGTT TTCTCATCAA AGCAGAATGC TATCTTGCGT GTGTGTGTGT 2100 

GTGTGTGTGT GTGTGTGTGT GTATGTGTGT ATATATATAT ATATATATAT ATATATATTT 2160 

TTTTTTTTTT TTTTTTTTAA ATTACAAAAG CCATGAGCTG CTTTTATGCT GAAAATGGTC 2220 

ATTTCCCTGT TCACTTACTG ACATGTGAAG AAGGGTTTCT TGCTTTCTTA AACATTTCCG 2280 

TAAGGCAGGC TAGAAATGTA ATACTTCAAA TGTTTGATGA TTATGGTCTT TTGATAGGAA 2340 

TAGATTCTGC TTGGGATATA TATCCAGGCA CTCTCTAAGG TCTAGGGTTG ATATTAACAA 2400 

AGGAATGTAC TTAGAATAGC AGTACATTTT ATGCAAATAT GGAAATTATT TTAAGAAACA 2460 

ATGACATATC AAAACTGCTT TTTACATGAT TTTGAAATAG ACTAGAAAGC TTTCCCTATA 2520 

GACATATTAA TATTCCAATC ATAACTTTAA TTCAAGAATG CAGTTTTACC AAAAGAAAAA 2580 

TTTGAAAATT TCTATTCAGG CTACTGGAAT TGGTTATTAA AAGAAAAAGG AAAAAGAAGA 2640 

ATCTTGCTGC TTTCAGTATT TCCTGATTTT TTTGTAAATA TAAAGAGGAA CTTCAATTAT 2700 

GAAAAATTTT TAAAAGATAT ATATATCTAT ATATCTATAT ATATGTACTG TTTTGTTTCC 2760 

TGTCTTGAAG ATTTTGAGTT ATGGTTATTG GTTTCAGATT GATTAATTCA CATATGCTGT 2820 

GTTTTCTTTA AAAGTCATAT GGGTTCGTGG CCTAATGCCT TGGATTTTAC ATATTTTTCT 2880 

TTTTAAATGC AAAACCTTTT CAACAAAATA GTGTTTGTCA TCAGGTTGGT ACTAAACATT 2940 

TATAATTACT GTGTAATTAT AAACAAAAAT ACATAAAGCT TTGAATATAA TTATGTAGCA 3000 

TAAAAGTTAA GGTTGTTCAC TATGATGGCA TCTTAGAATT AAACAAAACT TTTACTAGGG 3060 

CTGAAAAGAG AAGACTGATT TAATGTGGTG TGATTATTCT GAAGATAAAT GTCTGGCTAC 3120 

AGGGAATATT TTGTACTAAA AAATGATTAC ACATATGGCT GTGTGTGTTT GAGTCTGTGT 3180 

CTGTGAGAGA GCCAGAGAGA GTGAGAGAGA TTGACAGAGA AAGGGAGAGA CACACACACG 3240 

CCCCTTGAAT TGCTTTAACT CCTAAGTGTT TCAGTCCTCA TTCCGGTAAA CTCCCCATGC 3300 

TGATTCTTTG TTTTAAACTG AACCATAGGT ACAGTTTCCT TTTTGCCAAA TGTCAAAACA 3360 

GGTACAAATT TTAAAATGTA ATGCTTTTTA AATAGAAAAA TGTATAAAAT TAGAAGTGCC 3420 

CACATATAAA AAATACTTGA GATGAAGATT ATCTTTAGTG AATATCATCT GCATATCTCT 3480 

GTAAGTTCAA TTGTGTTTCT TACAGTCCCT GTCATATTAC CAACAGAGGC AATAAAAGCT 3540 
GCAGTGAAAT TG 

. Seq ID NO: 164 Protein sequence: 
Protein Accession ih AAG00606 

1 11 • 21 31 41 51 

1 I I I I I 

MNKLKSSQKD KVRQFMIFTQ SSEKTAVSdi SQNDWKLDVA TDNFPQNPEL YIRESVKGSL 60 

DRKKtiEQLYN RYKDPQDENK IGIDGIQQPC DDLALDPASI SVLIIAWKFR AATQCEPSKQ 120 

EFMDGMTELG CDSIEQLKAQ IPKMBQELKB PGRFKDPYQP TFNFAKNPGQ KGLDLEMAIA 180 

YWNLVLNGRP KFLDLWNKFL LEHHKRSIPK DTWNLLLDPS TMIADDMSNY DEEGAWPVLI 240 
DDFVEFARPQ IAGTKSTTV 
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Seq ID NO: 165 DNA sequence 
Nucleic Acid Accession *: AF256215 
Coding sequence: 220-2028 



1 

• I 

CTCCAGTCCG 
GCCTACGGGC 
CCTGCTCCAG 
CAGCCGCCGG 
GCGGCGOGAG 
CGCGTGAGTC 
TTTCCACGAA 
CAAGTTAAAA 
AAAATGAATA 
CGTAAACTGG 
GGCTTGACAA 
CTCAGACATT 
GGAAAAATTC 
TTGACTGGAC 
CAACTTTCTT 
CAAGTTCACA 
TTTTTCTGTC 
AACTCAAAGA 
AGCTGGCCTC 
AATTTTACCT 
GGAGAGATTA 
GTCTATGTAG 
ACTTCTTGTT 
GCAGTTCTAC 
GGCTCTTTTG 
CTGGAATATA 
TCATTTTTAC 
GTACCTGGAA 
AATGAAATTC 
ACAGGTTTAA 
TTTCCACCAA 
GTTGCTGTCC 
CTATGTGACA 
GGCCTGGGAG 
AACTCCAAAA 
TTAAGTACTG 
TTGCATCTTC 
TATTATAGAC 
TAAAATATTT 
TGCAGTTTTT 
GATTTATAAT 
ATTGAGGATA 
AGTGCAATTT 
AGCACAGAGC 
GATTTCTTGC 
GGATCTTGGG 
CCAGACGTGG 
CTTGAGGTCA 
AATACAAAAA 
TGACAGTCCA 
ACTGCTTGGG 
GCCGAGATCG 
AGTAAAAAAA 

. TGACATTGGA 
GATTAAATGT 

. TTTAAATGTT 
CACAGATTAA 
GCAATTACAG 
AAATTATGAG 
GGAGAGCATT 
ATTATTTAAA 
TCAGTAGCTC 
ATGAAAGTCA 
AATCCAGTTT 
TCGCCAAAAC 
TTAGAACTTC 
TGTTTCCTTC 
CCTCTTGAAC 
TTGAGGACCT 
AATTGGTGTG 
GTGTTCTCTA 
CATTGCAGGT 
CATCATAGCT 
TTTGAGAGGC 
TCTTTGCAAG 
ACTCTGTGCA 
GTCCTGGACT 
GTrTCCTTCA 
CATATTTTTA 
TCGTGAGAAA 
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CATGCTCAGT 
TGCGGTGGCG 
AGCCGCCGCC 
GCTGCGGAGC 
GTAAAGTGTT 
CAGGGACAAG 
AACGCAAAGG 
TGAAGGCCTT 
ACCTGATTGA 
ACAAACTTAC 
ATTCTTATGT 
TAATCCTTAA 
TCTTCGTTTC 
AAAGCTTATT 
CTTTTGATAT 
GTAATCTCCA 
GGATAAAGAG 
AGAAAGAGCA 
CAAATATTGT 
GCCTTGTGGC 
ATGTGAAACC 
ATCAAAGGGC 
ATGAATATTT 
AGAGTAAGGA 
TAACTTTAAA 
TTGTATCTGT 
CTTGTAGCTC 
TGTCTACTGG 
TGGATTTACA 
TGAAAGATAC 
GTCCTTCTGA 
ACAGCCATGA 
ATGATGACAC 
ACCCTGGGGA 
ATGAGAAACA 
TATTGATATT 
CTGTCACAGG 
TCCTTTATTC 
CTAACCAAGA 
TTTAGTTGAG 
AGTAGGTTTG 
GGGCTTACAC 
ATAGTCATAA 
TGGGATATTT 
TTAAAACTCA 
CCCTAGATCT 
TGCTCACGCC 
GGAGTTCAAG 
AAATTACCCA 
TTCATGAGCG 
AGGCTGAGGC 
CACCACTGCA 
AAGATTTAAT 
AACATACTTA 
CTTGTGTAAA 
TAATGTATAT 
ATGATTAACT 
TTATTGCCTG 
GCAATGAGAA 
TTTAACAGTA 
ATACTGCATG 
TTCATAGCTT 
GATAGGCTGG 
CCTGTTCTAT 
AAAGCATTAT 
TGTCAGACAT 
AAGTGTGAAA 
TGATAGTGTC 
GAGAGACATC 
CAGCAGCCTA 
AATTGAGCAG 
TAGACTTGGC 
GGATGGCCTC 
AGAGTTTCTA 
AGGAGGGGCT 
TGGTTTGATC 
CAAAATCTAA 
AAATGTTTTA 
TGGTATACAC 
GAATATATCT 
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AGCTGCTGCC 
GCCGCCGCGG 
TGGGCCGGGG 
CGACCAAGTG 
GAGAGAGGAG 
ACCAACAGCT 
AAGTGATTCA 
CAGAGAAGCT 
AGAACTGTCT 
AGTTTTAAGA 
GGGAAGTAAT 
GACTGCAGAA 
TAAGTCAGTC 
TGACTTCTTA 
TTCACCAAGA 
CGCTGGAAGG 
TTGTAAAATC 
CAGAAAATTC 
TGGAATGGAA 
CATTGGAAGA 
AACTGAATTT 
AACAGCGATT 
TCATCAAGAT 
GAAAATACTT 
AAGCCAATGG 
CAACACTTTA 
TCAATCATCA 
AACAGTACTT 
GAGGTTACAG 
TCATACTGTA 
AATGGGGGAG 
GCCACTCCTC 
AGCCATGGCT 
CTTCAGTGAC 
TTTTAAAGCA 
GTTTGTATCT 
GATGTGGGGA 
AGTGAAATGG 
ATACTACTTA 
GTAATGTAAT 
TATAATTTGG 
ACTTTAAGAA 
TCACATTGAA 
ATGCTCAGTT 
GAAAGTCAAA 
TGGGGATTAA 
TGTAATCCCA 
ACCAGCCTGG 
GGCACTCACT 
CAAAGGCCTC 
GAGAGGATTG 
CTCCAGTCTG 
ATAATCACTG 
GGGATAGATT 
GTCACACATT 
AAACCAGTTT 
AACTGACCCA 
TAACCAAAGG 
ATAATTTAAA 
ATTAATCCAG 
TCTACCTTCT 
GCCAAGTATG 
GAACTCATGG 
ATGGTGCTAC 
TTTGACCCTG 
GTTAATGACA 
TGTGCAGGGG 
CCAGCAATGT 
AGGTTTAGAA 
AGTGTTATAG 
AGGGGCCTGC 
TTCCAAAGAG 
AAAAGCAGAT 
GAGGGAGACC 
GTTCAATTCC 
CTTTATTAGT 
TCCATGCATT 
GTTTTCTTCA 
TATGTTCCTT 
ACAAATTGCA 
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GGCCGGGCTG 
CACCCGGCAG 
CAGGGCGGGC 
GCTCCTGCGA 
AACCAGTGCA 
ATGGGGTCTT 
GACCCATCCC 
CATAGCCAAA 
GCAATGATCC 
ATGGCTGTTC 
TATAGACCAT 
GGCTTCTTAT 
TCCAAAATAC 
CATCCAAAAG 
GAAAAGCTAA 
ACACGTGTGT 
TCTGTCAAAG 
TATACTATCC 
GAAGAAAGGA 
TTACAGCCAT 
ATAACCCGGT 
TTAGGATATC 
GACCACAATA 
ACAGATTCCT 
TTTAGTTTCA 
GTTTTGGGAC 
GAAGAATCCT 
GGTGCTGGTA 
TCTTCTTCAT 
AACTGCAGGA 
CTAGAGGCTA 
AGTGATGGTG 
GCATTTATGA 
ATCCAGTGGA 
TTATTTACGA 
TTTATTAATG 
AATACGTTTT 
CTTATAATCC 
CATATTGTTT 
ATATTGATGT 
AACATTTTCC 
AACAGTGAGT 
TACTGTATTT 
GAGCACTTTA 
AGAGTTTCAG 
CCTCTGCATA 
GTACTTTGGG 
CCAATATGGT 
CTTGAGGTAA 
ATGACCTAAT 
CTTGAACCTG 
GGCAACAGAG 
AAGATCTCTA 
TGTCCTAAAG 
AAATTCAGTC 
CTTTATACAC 
GGAACTAGTT. 
TAATAAAACA 
AACCAATTTT 
AGGTGGCTCA 
CGGGGATCAT 
CTCCCATATT 
GGCAGCCCTC 
ATCTTTCCAG 
CATGCTATTT 
AACATACCAA 
CTCGTGGGCA 
TGGAGGTTGG 
TGAGCCAAAG 
TTAAGTCTAA 
CTACCAATAT 
TCTGCCTAAG 
GGGGGCAGAC 
AGTGCTGCCT 
ATAGACCAGT 
TOGCTCTAAT 
GTATGATACC 
ACTAAATTTG 
TTTTCTACTG 
CGAAAGTATC 
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CGGGGCGGCG 
GGCCCGCCAG 
CCGGGGCTCC 
TGGCGGCGGA 
TTGCTCCTGT 
TCAGCTCACA 
AAGTGGAAGA 
CTGAAAAGCG 
CTCAGTGCAA 
AACACTTGAG 
CATTTCTTCA 
TTGTGGTTGG 
TTAATTATGA 
ATGTTGCCAA 
TAGATGCCAA 
ATTCTGGCTC 
AAGAGCATGG 
ATTGCACTGG 
ACAGTAAGAA 
ATATTGTTCC 
TTGCAGTGAA 
TGCCTCAGGA 
ATTTGACTGA 
ACAAATTCAG 
CAAATCCTTG 
ATAGTGAGCC 
CTAGACAGTC 
GTATTGGAAC 
ACCTTGATGA 
GTATGTCAAA 
CCAGGCAAAA 
CACAGTTGGA 
ATTACTTAGA 
CCCTCTAGCC 
AAAAACTGTC 
TTCTACCACT 
CCTCCCAAGA 
ACTAGTTGCC 
TGGCTTTGrT 
TTTCCTTTGT 
ATGCCTTGCG 
ACTTGAACAT 
GATCTTTGGA 
AGATGAATTT 
CTTTCCTTAC 
TAAGATTTAC 
AGGCTGAGAC 
GAAACCCCGT 
CTAACCAACT 
GGCACACACC 
GGAGGCAGAG 
TGAGACTTCA 
TTATAGATAG 
GAAAAAAGTA 
ACACATTAAA 
ATTTGGGAAA 
GTAGCTTTCT 
AAATGACAAG 
CTAGTTATAA 
AATTGAGTAT 
ACTTTATAAC 
TTCTCTCTCG 
AGACTTCAAT 
AAAATTTCCC 
CTTTAGCTGT 
CAGACAATAA 
AGGATGTATT 
CACCATTCCT 
AAATCCTACA 
AGAAGTATGA 
CACTTTTTAG 
CCAGGGGTGG 
TTGCCCTCGT 
CTCACAGTGG 
GGGCAGATAG 
ATTTTTCTGT 
GTAGCTCTCC 
ATTTTTGCTG 
CGAGTCAATT 
ATAAAAACAG 
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TCCGCTGCGC 
TCCCCGCTTC 
TCCATGCTGC 
AGAGGAGGCT 
GGTTTCCAGC 
CATGACAGAG 
TGGTGAACAC 
GAGGAGAGAT 
CCCCATGGCG 
ATCTTTAAAA 
GGATAATGAG 
ATGTGAAAGA 
TCAGGCTAGT 
AGTAAAGGAA 
AACTGGTTTG 
AAGACGATCT 
ATGCTTACCC 
TTACTTGAGA 
AGACAACAGT 
ACAGAACAGT 
TGGAAAATTT 
ACTTTTGGGA 
CAAGCACAAA 
AGCAAAAGAT 
GACAAAAGAA 
TGGAGAAGCA 
CTGTATGAGT 
AGATATTGCA 
TTCGAGTCCA 
TAAGGAGTTG 
CCAGAGTACT 
TTTCGATGCC 
AGCAGAGGGG 
TTTGATTTTT 
TCAACTATTC 
TTTTATAGAT 
GAACCAAGTT 
ATATTTTTGC 
TTATTTTTGA 
GTCTAAGATT 
AATTTCCTTA 
TTAAAGGGAC 
GACTTAGGCA 
TAAGTGAGAT 
AGAAAAGGAA 
TCTTAATAGG 
GGGCAOATCA 
TTCTACTAAA 
CCCACGATAA 
TGTAATCCCA 
GTTGCAGTGA 
TCTCAAAAAA 
ATTAGGTTTT 
GGCCCGGGCA 
TTCATAGAGT 
ACATTGGTCT 
AAGTAATTAG 
TACATGTTTA 
TTTAAAATTT 
AAGAATTAAG 
ACTTTCTGCT 
TGCCTCGCAA 
GTGGGCTTCA 
TCAGAGCCCC 
AGGTGATAGA 
CCAAAGCAAA 
GGCACACTGT 
GGTCCGACAC 
AGATGGGGAG 
AAGATCCCCT 
GGGACTGAAC 
CAGGGTAGGC 
GATGCCAGGA 
CAGTTTTTTC 
CCAGTTGAAT 
AGATCCTTTT 
TAAGGTTTGT 
TTAGAAGTGA 
TTTTGAATTT 
TACTCTAGAG 
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CAGCGCTGTC CAATAGAAAT ATAATCTGAG CCACATGTAT 
ACATTAAAGA AGTAAAAAGA TACAAGTAGA ACTAATTTTA 
CAAAATATCA TTTGAACATG TAATTAATAT AAAATTATTA 
TTGGTAATAC TAGTCTTCAA AATCTGGTAT GTATCTTACA 
GTACTAGCCA CATTGCAAGT GCTCAGTAGC CACATGTGGC 
AGCACAGTTC TAGGTTCCAC CCTAACACCC AAGTCCTGTG 
AGCTGGAAGT AAACATAGAG ATCAAACCTC CTTTTAAAAA 
GTTTAAATGG CTTGCATGAG GTCATACAGC TAAATTCAGC 
CCAGGCACTC TTCCCACTCC ACTACATTAC TGTAGTGGTA 
TGTAGAGTAG GCCGGGCGCA GTGGCTCATG CCTGTAATCC 
GTGGGCGGAT CACGAGGTCA GGAGATCGAG ACCATCCTGG 
CTCTACTGAA AATACAAAGC AAAATTAGCC AGGTGTGGTG 
CTGCTCTGGA GGCTGAGGCA GAATGGCGTG AACCCAGGAG 
AGATCGCGCC ACTGCACCCC AGCCTGGGCX5 ACAGAGCGAG 
AAAAAAAAAA AAGAAAAGAA AAGAAAAGTC TAGAGAACAT 
GAAGTAGACC AAAGTTTATA CCATAAGGAT ATTTTTCCTT 
AATTATTTAT TGATCCTTGA ATCTGTAAGA TCAAATAACA 
AATTTAACCT TTTGAAAATA ATAAACTTTA AAATATCAGA 
CTTGGAATCA AGTGAAATGA GTTATATGGT CATCACTAAA 
CAAAGACAAA CAGGAAAGTA CAGAATAGAG ACTTTTAGTA 
AAGTGTTTAT TTACAGTGTC ACGACAGAAA AGGATGTCTT 
GATCTCOGTA AAATCTGGGG CACAGGTACA AGAAATAGCC 
TGTTTAGTAG TGTCCAGTTT CAGATCATGC TGCCAAGAGG 
CATCACTGAG CCCTGGAATT GGAGACTCAT ACTTGCCCAG 
GGCCGACATC TATGATTAGC TAGAAGCCAT AAAGAAAAGC 
CCACTTTTCT GTTTTTGTAA TGCTTTCATT AGCAGATCTT 
GCCTATGAGA GGCATTTATG ATTTTTGTGC CTACAATAAG 
TGTTT T ATGA GAAATGCTTT CCAAGGGAGG TCTAGGAAGA 
GGCTTAGAGA GCTTTCCAGG TGTAGTGCCA ATAAAAACTG 
CAGCACGGAA CATGCTTTCT GAACTCACTT GAGAGTGTAT 
TATTCTTGAG TTTAGATTTG TCTTTTATAC AATTTTTAGC 
CTCGTCTGTA TATTGGTATT TTTAAATTTT TGTGGTAAAT 
ATTTTATAAT TACTCATTTG TAGTTTTTTT .TTTTAATTTA 
GCTCCCTTAA AA 



Seq ID NO: 166 Protein sequence: 
Protein Accession #: AAG34652 



1 
I 

MAAEEEAAAG 
QVEDGEHQVK 
QHLRSLKGLT 
LNYDQASLTG 
YSGSRRSPFC 



11 



21 



GKVLREENQC IAPWSSRVS 
MKAFREAHSQ TEKRRRDKMN 



LPQBLLGTSC 
TNPWTKELEY 
SIGTDIANEI 
TRQNQSTVAV 
TL 



QSIiFDFIiHPK 
RIKSCKISVK 
CLVAIGRLQP 
YEYFHQDDHN 
IVSVNTLVLG 
LDLORLQSSS 



DVAKVKEQLS 
EEHGCLPNSK 
YIVPQNSGEI 
NLTDKHKAVL 
HSEPGEASFL 
YLDDSSPTGL 
AQLDFDALCD 



31 

I 

PGTRPTAMGS 
NL I EEL SAM I 
LILKTAEGFL 
SFDISPREKL 
KKEHRKFYTI 
NVKPTEFITR 
QSKEKILTDS 
PCSSQSSEES 
MKDTHTVNCR 
NDDTAMAAFM 



AATTTTATTT 
ATGTTTTAAT 
ATGTGATATT 
TTGATAGCAC 
TAGTGGCTAC 
GATTAGAATC 
TGAGGACGCT 
CTCAACAGGG 
ATTCTTAGGG 
CAGCACTTTG 
CCAACATGGT 
GCGGGCGCCT 
GCAGAGATGG 
ACTCCATCTC 
TATATTAAGT 
AAATACCATG 
AGTCTCTATC 
TGTGTTATTA 
TTTAGAAATC 
AATAAATGGA 
TGTTGTCATA 
AATATTTAGT 
TATCTCCCCC 
CACAATGTTA 
TGCTAAGTGG 
TTTTTTCCAA 
TCAGCCTGTC 
TCCTGACACA 
ACCTGGAAAG 
GGTGTATGTC 
TCTTTTCCAG 
AATGAAAAGA 
ATAAACTTCC 



41 
I 

FSSHMTEPPR 
PQCNPMARKL 
FWGCERGKI 
IDAKTGLQVH 
HCTGYLRSWP 
FAVNGKFVYV 
YKFRAKDGSF 
SRQSCMSVPG 



TCTTCTAGCC 
TCAGTATATC 
TTACATTCTT 
ATCTCACTTT 
TGCACTGGAC 
CCAGAATCAG 
GAGGCACAGA 
TCTTCTGATT 
TTAAAAAAAG 
GGAGGCCGAA 
GAAACCCCGT 
GTGGTCCCAG 
CAGTGAGCCA 
AAAAAAAAAA 
GGTTATTATT 
TTTGAAGAAC 
CATGTTACCA 
CAGGATGATA 
TATTGTGAAA 
ATTTAAAAGA 
GTCTTTGAGG 
TCCCAGACCA 
TCAGGTGGGT 
CGGGCAGACA 
CCACTAGGTG 
GCTCCATGGG 
TGGTGTGAGT 
TAAGAACTTT 
AAAACCTGCC 
ACTTCTCATA 
TTCACTTGTG 
GTGAAATTAT 
TCCAAAAAGT 



51 
I 

KRKGSDSDPS 
DKLTVUIMAV 
LFVSKSVSKI 
SNLHAGRTRV 
PNIVGMEEER 
DQRATAILGY 
VTUCSQWFSF 
MSTGTVLGAG 



Seq ID NO: 167 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence: 66-1126 



GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 

CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 



11 

I 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGOGTGCAGA 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 



21 
I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 
I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAAOGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
COGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



4860 
4920 
49B0 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 



PCT/US02/12476 



NYLEAEGGLG DPGDFSDIQW 



51 
I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACX3CTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCGGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCOGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



250 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTGTATAGTG AAAAAAAA 

Seq ID NO: 168 Protein sequence > 
Protein Accession #: NP_0S521S 

1 11 21 31 41 51 

I I I I I I 

MDPASKAGAQ AMIWTAGWLL LLLLRGGAQA LBCYSCVQKA DDGCSPNKMK TVKCAPGVDV 
CTEAVGAVET IHG0FSLAVX GOGSGIiPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 
SRALDPAGN8 SAYPPNGVEC YSCVGbSREA CQGTSPPWS CYKASDHVYK GCPDGHVTUT 
AANVTVSLPV RGCVQDEPCT RDGVTGPGFT LSGSCCQGSR CNSDLRKKTY FSPRIPPLVR 
UPPEPTTVA STTSVTTSTS APVRPTSTTK PKPAPTSQTP RQGVEHEASR DEEPRLTGGA 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGXiAALLLAV AAGVLL 



PCT/US02/12476 



Seq IO NO: 169 DNA sequence 
Nucleic Acid Accession #: NM_006B75 
Coding sequence: 186-1190 



GAATTCGGCA 
CACCAGTTTC 
CCCGGGCGTC 
CCTCCATGTT 
CGCCAGGAGG 
GTAAGGGGGG 
CCATCAAAGT 
CATGCCCACT 
TGATCCGCCT 
CTTT6CC08C 
CAAGCCGCTG 
TTGTCCATCG 
AACTCATTGA 
6GACAAG66T 
CCACTGTCTG 
AGAGGGACCA 
GCTGTGCCCT 
AGATCCTGCT 
AAAGGAGGCC 
TGGCCCCCAA 
GTTGACTTGG 
ATTGAGGATC 
CAAAGGAGCC 
CTCATTTTGC 
TTATTTTGAT 
TCATATGCTT 
AGTAAAGGGA 
TCAGCCCAGG 
TTTTTTTTTT 
CTGGTGAGAA 
CACCACCAGA 
GCTTGCTGTT 
CTGAGCCGGG 
TCCAAGTGTG 
CCACTATTTA 



11 
I 

CGAGCGCGCG 
TCTGCTTTCC 
CACGCCCTGC 
GACCAAGCCT 
CAAGGATCGG 
CTTTGGCACC 
GATTCCCCGG 
CGAAGTCGCA 
GCTTGACTGG 
CCAGGATCTC 
CTTCTTTGGC 
TGACATCAAG 
TTTTGGTTCT 
GTACAGCCCC 
GTCACTGGGC 
GGAGATTCTG 
AATCCGCCGG 
GGACCCCTGG 
CTGCCCCTTT 
TGGTCAGAAG 
TTTTACAGGT 
AGGGGTTAGA 
TTCCTCCCAG 
TAAGGAAGTT 
GATGTGTCAC 
TTACTTGGGC 



ATTTTTTATT 
TTTTTTTTTG 
GAACCTTAAT 
CAATAGGATG 
TGTTTTCCTG 
ATTGTCCAAT 
CCCTCCTTTT 
ATAAAAGTAA 



21 • 
I 

GCGAATCTCA 
ACCCTGGCGC 
GGGCTTAGCG 
CTACAGGGGC 
GAAGCGTTCG 
GTCTTCGCAG 
AATCGTGTGC 
CTGCTATGGA 
TTTGAGACAC 
TTTGACTATA 
CAAGTAGTGG 
GATGAGAACA 
GGTGCCCTGC 
CCAGAGTGGA 
ATCCTCCTCT 
GAAGCTGAGC 
TGCCTGGCCC 
ATGCAAACAC 
GGCCTGGTCC 
AGCCATCCCA 
CATTACCAGT 
AGACATAAAC 
AACCTGTGGT 
TATTTTGGTG 
CCCACATTGG 
AAGGGTGCTT 
TAGCCTAGGG 
TTGGGGGAGG 
GGTGAGGGGA 
TCCATAATTT 
GGATGGATGG 
GGGCGCTCCC 
TACTAAAATG 
TTTTCCTGCC 
TAGAATCAGA 



31 

I 

ACGCTGCGCC 
CCCCCAGCCC 
GGTTCAGTGG 
CTCCOGCGCC 
AGGCCGAGTA 
GACACCGCCT 
TGGGCTGGTC 
AAGTGGGTGC 
AGGAAGGCTT 
TCACAGAGAA 
CAGCCATCCA 
TCCTGATAGA 
TTCATGATGA 
TCTCTCGACA 
ATGACATGGT 
TCCACTTCCC 
CCAAACCTTC 
CAGCCGAGGA 
TTGCTACCCT 
TGGCCATGTC 
CATTAAAGTC 
CAAGT TTGCC 
CCCTGATTTT 
AAGTTGTTCC 
CACCTCCTAC 
TCCTTCCAAT 
TCCCATATTG 
TAATGCCCTG 
CCCTACTTTO 
GGGAAGGAAT 
TTTTTTGGGG 
TCCAATTTTG 
TAAATAATCA 
TGGATTATTT 



41 
I 

GTCTGCGGGC 
TGGCTCCCCA 
GCTCAATCTG 
CCCCGGGACC 
TCGACTCGGC 
CACAGATCGA 
CCCCTTGTCA 
AGGTGGTGGG 
CATGCTGGTC 
GGGCCCACTG 
GCACTGCCAT 
CCTACGCCGT 
ACCCTACACT 
CCAGTACCAT 
GTGTGGGGAC 
AGCCCATGTC 
TTCCCGACCC 
TGTTACCCCT 
AAGCCTGGCC 
ACAGGGATAG 
CAGTATTACT 
CAGTTCCCTT 
GGAGGGGGAA 
CATTTTGAGC 
TACCACCACA 
ACCCCAGTAG 
GGTCAAGCTG 
TTGTTACCCC 
TTATCCCAAG 
GGAAGATGGA 
GATGGGCTAG 
CAGATTTTTG 
CGTATTGTGG 
AAAAAGCCAT 



51 
I 

GCTTCCGGGC 
GCTGCGCTGC 
CGCAGCGCCA 
CCCACGCCGC 
CCCCTCCTGG 
CTCCAGGTGG 
GACTCAGTCA 
CACCCTGGCG 
CTCGAGCGGC 
GGTGAAGGCC 
TCCOGTGGAG 
GGCTGTGCCA 
GACTTTGATG 
GCACTCCCGG 
ATTCCCTTTG 
TCCCCAGACT 
TCACTGGAAG 
CAACCCCTCC 
TGGCCTGGCC 
ATGGACATTT 
AAGGTAAGGG 
CCCAATCCTA 
CTTCTTGCTT 
CCCGGGACTC 
CAAACTTAGT 
CTTTTATTTT 
CTTACCTGCC 
AAGGCTTCTT 
T6CTCTTATT 
CACCACCGGA 
GGGAAATAAG 
CAACCTCCTC 
GGAGGGGAGT 
GTGTGGAAAC 



Seq ID NO: 170 Protein sequence: 
Protein Accession #: NP_006866 

1 11 



31 



41 



51 



21 

I I I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGPGTVFAGH RLTDRLQVAI 
KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGVI RLLDWFETQE GFMLVLERPL 
PAQDLFDYIT EKGPLGBGPS RCPFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 
IDFGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT VWSLGILLYD MVCGDIPPER 
DQEILEAELH FPAHVSPDCC ALIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 
RPCPFGLVLA TLSLAWPGIA PNGQKSHPMA MSQG 

Seq ID NO: 171 DNA sequence 
Nucleic Acid Accession ft: NM_00364€ 
Coding sequence: 89. .2875 



GCGGCGCGGA 
GCCCTCCGCC 
GAGCAGCGAC 
CGAGCCGGAC 
CGGGCACAGG 

GTCAGCGACA 
CGTTGGGGAG 
AGCCTGCAAG 
CTGTAAGCOG 
GCACCACTGG 
CCAGCAGAAG 



11 
I 

GCGGGCGTGC 
GGCCGGGGCT 
TCCGAGTCGG 
AAGGCGCCGC 
AAAGCCATCA 
CCGTGCAGCG 
TATGGGGAGC 
CAGTACTGTG 
ATTGTGGTGC 
TCCTTCCGTG 
GTACACAGAC 
TTCACCTTCC 



21 

I 

TGAGCCCCGG 
AGGGCCGGAT 
CTTCCGCCTC 
GGCGACTCAA 
CCAAGTCGGG 
AGTCAGAGCG 
ACATCTGGTT 
TAGCCAGGAT 
ACAOGCCCTG 
AATCAGGCTC 
GACGCCAGGA 
ACAGCAAGGA 



31 
I 

CCGCCGGCCC 
GGAGCCGCGG 
GTCCAGCGGC 
CAAGCGGCGC 
CCTCCAGCAC 
GCAGATCCGG 
CGAGACCAAC 
GCTGAAGTCA 
CATCGAGCAG 
CAGGAATGTC 
CGGCAAGTGT 
GATTGTGGCC 



41 

1 

GGCATGGGCG 
GACGGTAGCC 
TCCGAGCGCG 
TTCCCGGGGC 
CTGGCCCCCC 
AGTACAGTGG 
GTGTCCGGGG 
GTGTCTCGAA 
CTGGAGAAGA 
CGCGAGCCAA 
CGGCACTGTG 
ATCAGCTGCT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



60 
120 
1B0 
240 
300 



51 
1 

TCTCCCGCGG 
CCGAGGCCCG 
ACGCCGGTCC 
TGCGGCTCTT 
CTCCGCCCAC 
ACTGGAGCGA 
ACTTCTGCTA 
GAAAGTGCGC 
TAAATTTCCG 
CCTTTGTAOG 
GGAAGGGATT 
CGTGGTGCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



251 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCAGGCATAC CACAGCAAGG TGTCCTGCTT 
GCTGGGGGTC CACGCAGCCG TGGTCATCCC 
CCAGAATACT CTGAAAGCAA GCAAGAAGAA 
CAAGAAAGGG CCTGAGGAGG GCOGCTGGAG 
GCTCATGAAG CCCCTGCTGG TGTTTGTGAA 
GATCATCCAG TCTTTCCTCT GGTATCTCAA 
AGGGCCCAAG GAGGCGCTGG AGATGTACCG 
CGGGGGCGAC GGCACGGTGG GCTGGATCCT 
GCCACCCCCT GTTGCCATCC TGCCCCTGGG 
CTGGGGTGGG GGCTACACAG ATGAGCCTGT 
GAACGTGGTA CAGCTGGACC GCTGGGACCT 
TGAGGACCGA GATGAAGGCG CCACCGACCG 
CAGCCTGGGC TTTGACGCCC ACGTCACCCT 
AGAGAAATTC AACAGCCGCT TTCGGAATAA 
CTTCCTGATG GGCAGCTCCA AGGACCTGGC 
GGACTTGACT CCCAAGATCC AGGACCTGAA 
CAGGTACTGT GCGGGCACCA TGCCCTGGGG 
CCAGCGGCAT GACGACGGCT ACCTCGAGGT 
GCTGCAGGTG GGCGGACACG GCGAGCGGCT 
ATCCAAGGCC ATCCCGGTGC AGGTGGATGG 
CCGCATCGCC CTGCGCAACC AGGCCACCAT 
CCCCCTGCAC AGOGACCAGC AGCCGGTGCC 
CAGCATGCAC GACTATGAGG CCCTGCACTA 
GCCGCTGGGC ACTGTGGTGG TCCCAGGAGA 
TGAGAGACTC CAGCAGGAGC CCGATGGTGC 
GTCCCCCAAG TGGTGCTTCC TGGACGCCAC 
AGCCCAGGAG CACCTCAACT ATGTGACTGA 
CCCTGAGCTG CTGGGGGCAT CGGCCCGGCC 
CACCTCACCC TGCTCACCCA CGCCCCGGTC 
TGAAGAGCTG ATTGAGGCTG CCAAGAGGAA 
AGCTGGGGGC GACCTCATGC ACCGAGACGA 
CAGCACTGGC AGCAAGGATG TGGTCCGCTA 
TGATGCGGTG GAGGAAAACG GGGAGACCTG 
CACCATCTGC CACTACATCG TGGAGGCCGG 
CGACACTCCC CGGCAGCGGG CTGAGAAGGC 
GAACOGGCAG CACTACCAGA TGATCCAGCG 

Seq ID NO: 172 Protein sequence: 
Protein Accession #: NP 003637 



PCT/US02/12476 



CATGCTGCAG 
GCCCACCTGG 
GAAGAGGGCA 
ACCCTTCATC 
CCCCAAGAGT 
TCCCCGACAA 
CAAAGTGCAC 
CTCCACCCTG 
TACTGGCAAC 
GTCCAAGATC 
CCACGCTGAG 
GTTGCCCCTG 
GGAGTTCCAC 
GATGTTCTAC 
CAAGCACATC 
ACCCCAGTGT 
CCACCCTGGG 
CATTGGCTTC 
GACGCAGTGT 
CGAGCCCTGC 
GGTGCAGAAG 
AGAGCAGTTG 
CGACAAGGAG 
CAGTGACCTA 
TGGAGCCAAG 
CACTGCCAGC 
GATCGCACAG 
TGACCTCCCA 
ACTGCAAGGG 
CGACTTCTGT 
GCAGAGTCGC 
CCTGCTGGAC 
TTTGCACCAA 
GGCCTCGCTC 
TCAGGACACC 
GGAGGACCAG 



MEPRDGSPEA 
GLQHLAPPPP 
ML.KSVSRRKC 
DGKCRHCGKG 
PPTVJILRARR 



11 
I 



21 
1 



LSTLDQLRLK 
LHAEPNPEAG 
KMFYAGTAFS 
GHPGEHHDFE 
GEPCKLAASR 
YDKEQliKEAS 
TTASRFYRID 
SLQGDAAPPQ 
YIiLDHAPPEI 
AQDTELAAYL 



TPGAPCSESE 
AACKIWHTP 
FQQKFTFHSK 
PQNTLKASKK 
KIIQSPLWYL 
PPPPVAILPL 
PEDRDEGATD 
DFLMGSSKDL 
PQRHDDGYLE 
IRIALRNQAT 
VPLGTWVPG 
RAQEHLNYVT 
GEELIEAAKR 
LDAVEENGET 
ENRQHYQMIQ 



RQIRSTVDWS 
CIEQLEKINF 
EIVAISCSWC 
KKRASFKRKS 
NPRQVFDLSQ 
GTGNDIARTL 
RLPLDVFNNY 
AKHIRWCDG 
VIGFTMTSLA 
MVQKAKRRSA 
DSDLELCRAH 
EIAQDEIYIL 
NDFCKItQELH 
CLHQAAALGQ 
REDQETAV 



31 
1 

PEPDKAPRRL 
ESATYGEHIW 
RCKPSFRESG 
KQAYHSKVSC 
SKKGPEEGRW 
GGPKEALEMY 
NWGGGYTDEP 
FSLGFDAHVT 
MDLTPKIQDL 
ALQVGGHGER 
APLHSDQQPV 
IERLQQEPDG 
DPELLGASAR 
RAGGDLMHRD 
RTICHYIVEA 



Seq ID MO: 173 DNA sequence 
Nucleic Acid Accession ft: AF232772 
Coding sequence: 1-1662 



1 
I 

ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGOGCAAGT 
GTGGTGGATG 
GO CGGCACCG 
GGTGAGACGG 
AGCACCTTCT 
TTCAAGGCCC 
GATCCAGCCT 
GTCGGGGGAG 
GTGCGGTACT 
CAGTGTATTA 
GACTGGTACC 
ACCAACCGAG 
ACAGAGACCC 
TACTTCCGGG 
TACGAGTCAG 
TTCTACCGGG 
ATTATCAAGG 



11 

I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 
TCGGCGATTC 
GCACCATCGA 
ATGTCCAGAT 
GGATGGCCTT 
GTGGGCCCTT 
ATCAGAAGTT 
TCCTGAGCCT 
CCACTAAGTA 
AGTGGCTCTA 
TGGTCACGGG 
GCCGCATCTG 
CCACCTACGC 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 
GGTGGACTAC 
GATGCTTCGA 
CCTCAACAAG 
CAACGTGGAG 
GGGCATGTAC 
CCTAGGCAGC 
TGGCTACCGA 
CCTCCGGTGG 
CAACTCTCTG 
TTTCTTCCCC 
GAACATTCTC 
CTGCTTCCTT 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCAT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACGGTG 
GGAGGCAAGC 
ATCCAGGTGT 
GTCCTGGAGG 
TACGACTCAT 
CGGGCCTGCC 
CGCAACAGCC 
AAGTGCAGCT 
ACTAAGTATA 
CTCAACCAGC 
TGGTTCCATA 
TTCTTCCTCA 
CTCTTCCTGC 
CGGGGCAATG 



CAGATCGAGG 
ATCCTCCGCG 
TCCTTCAAGA 
ATCAGGCCCA 
GGGGGCAACC 
GTCTTCGACC 
AACCTGCGGA 
GACCAGCTAC 
GACTTGGCCC 
CTCTCCCACG 
CCCAACCCCG 
GATGTCTTCA 
GAGTCTCGAG 
GCCGGGACAG 
CGAGTGGTGT 
GTTGTTTTCC 
GAGCACCACG 
ACCATGACGT 
CGCGAGGTGG 
AAGCTTGCAG 
GCCAAGCGGC 
CGCATCCAGG 
CAGCTCAAGG 
GAGCTCTGCC 
TCCCCGACAT 
CGCTTCTACA 
GATGAGATTT 
ACCCCCACTT 
GATGCTGCAC 
AAGCTCCAGG 
ACGCTCCTGC 
CACGCCCCCC 
GCAGCGGCCC 
ATGAAGACAG 
GAGCTGGCCG 
GAGACGGCTG 



41 

I 

NKRRFPGLRL 
FETNVSGDPC 
SRNVREPTPV 
FMLQQIEEPC 
RPFIIRPTPS 
RKVHNLRILA 
VSKILSHVEE 
LEFHESREAN 
KPQCWFLNI 
LTQCREWLT 
PEQLRIQVSR 
AGAKSPTCQK 
PDLPTPTSPL 
EQSRTLLHKA 
GASLMKTDQQ 



41 
I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGOGGGATGT 
GCGAGGTCAT 
GCGACTCTGA 
AGGATCCCCA 
GGATTTCCTT 
AGTCCTACTT 
TCCTCCAGCA 
TCGGGGATGA 
CCGCGCGCTC 
AAACCCGCTG 
AGCACCACCT 
TTGCCACGGT 
TGACGGTGCA 
CAGAGATGAT 



AGCCGTGCTC 
CCCGGAGGCC 
GGAAGTCCAG 
CCCCCTCCCC 
AGGGTGCAAA 
TGAGCCAGGG 
TCCTGGCGTG 
GCCTGAAGCC 
GAACCCTCAA 
TGGAGGAGGG 
AGGCAGGGCC 
ACAACTACTT 
AGGCCAACCC 
CTTTCTCTGA 
GTGATGGAAT 
TGAACATCCC 
ACTTTGAGCC 
CGTTGGCCGC 
TGCTCACCAC 
CCTCACGCAT 
GGAGCGCCGC 
TGAGTCGCGT 
AGGCCTCTGT 
GTGCCCACAT 
GCCAGAAACT 
GGATCGACCG 
ATATCCTGGA 
CCCCTCTCCC 
CCCCTCAAGG 
AGCTGCACCG 
ACCACGCAGT 
CAGAGATCCT 
TGGGCCAGCG 
ACCAGCAGGG 
CCTACCTGGA 
TGTAGCGGGC 



51 
I 

FGHRKAITKS 
YVGEQYCVAR 
RHHWVHRRRQ 
SLGVHAAWI 
PLMKPLLVPV 
CGGDGTVGWI 
GNWQLDRWD 
PEKFNSRFRN 
PRYCAGTMPW 
TSKAIPVQVD 
VSMHDYEALH 
LSPKWCFLDA 
PTSPCSPTPR 
VSTGSKDWR 
GDTPRQRAEK 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCACCTC 
CAAGTGCCTC 
GAGCAAGTCT 
CTGGATGACC 
TATACAGCTT 
GCTGGTGGGC 
CTTCATGTCC 
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420 
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WO 02/086443 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 
ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 
CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA CACAGCTTAT 
TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG .TCTCTGGGGC TATACTGTAT 
GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG ' 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GACATGGCCC CCAAGCAGAG 
CGGGTAAAGT GCAATGGGTA AGGGAGGGAA GGGGAATGGA AGAGAAAAGA CAGGGTGGGA 
GGGAGGAGGG AGTGCTGTGT TTTAGTCTCT TAATGGTCCA AAGGACAAAT CTAAAATGCA 
AAGAACGGTG ATGTAGTATG GCCTGACAGC TCTGTTTAGA GGAGGCAACA CTGATCCCCC 
AGATGCAGGG CTGCAGGGGA TTCTGTGTTT TCAGACTGCC TGTCTGCTTG CATCTGCACA 
TAGGCAGTAG CCTCCTCCTG GGCTCCAGAG GGCACTCAGA AGTTGTGCTA AACCAAGTTA 
AGTCCCATTC AGTGGCAACT TGTGATAGGT ACCTGAGTGA CGGCAACCTG CGGAAGGAGG 
TTCTCCCAGC CCATCTGAAC ACAACCAGAG GTGGCAGGAG AATTTCTACT GAGCGAGGTG 
GGCCGGTTAG TGTATGTCAC CCCCACCCCA CCCATAAGTA GTCATCAATG CAATAAGATT 
GCGCGTGAGA TACAAGGCCC AGAAGCCTGA TCTTTGGGCA TCAGAAAACA GGGTCCAGGA 
ATGGTGCTTT ATGTGAGATA CCCCACTCCA CATCAACATT CCAGGGATGA GCCAAACCAG 
CAGGGAGTTA GCACTGAACT GCTTTTAAAA GTGCACATTA AAAAGGAAAG TTTGCCAGGA 
GGAACAAAGA GATTGTGGTG GTGCTAAAGG AGGCCATAAG CTACACAGAG GCCTTGGGTG 
TTCCACCTGG AAACTGCTCA GACGTCTAGA TGGGTTCTTA GCTTGTCTGT GATCTCTGCT 
GGGGAGATAA AAAGATTAAG CCCCAACATG TTCAGAAAAG AAGTGAAGTC TTGGGTATTT 
TAACCTGTAT ACTCTTGAAT TCCTCTCAAA TTCAGCTCTG ATCTGAGGCT AAGACACACT 
CCCCACTTCA CTTTCTTCAA AGCCACATTT TTTGAGGTAT CACTGCAGTC ACCTCTTCTA 
CCCTCATCAT CATAGGTAAG GTTTTCAAGG TGGCAATTGG GGCGGAGCCC CGGCTTCTTA 
TAGAAGCTTC AGCAGGAGGC AAGCGTGTTC TCAGCACATA TGGGAACTAT GAGGAGCCTC 
TGATCAAATT GGCTACAATC TTGGAGCTGC TTGGAGGGAT TCCTTGGCAG CCGGGTTAGC 
ATGTGTGACT TTCAGGCTAC TGTTCTTGAC AATCATCTCC AATGGAAAGC TTTTCAGTGT 
TCCCAAAGTG AACTCTCAAA TCCAAAATGG TTATCTTTGA GACCATCCAT TCTCCTCAGT 
GGCTTCTCCA GGGAATTCTT ACAGCCAAGT TGTGACAGTC ACTGCATTTG CCTGCTTCTT 
TCCAGAAACC AAACTAGGAG ATGAAACTGG TTCCTACATC CTA AGGTTC T TGCTTTCTCT 
CTCATGCCTC CTGAGGCTGT TTTTGGCTGT TTTCCCTCTG CTGCTTTTGG GGAATGAGGG 
GAAGCCATTT TCCAAGTGAC TTGCAATCCA GGCTGTTCTC AGCGTTTTGA GTTTAAAACC 
TGGGATCCTG ACTAAGCCTT TGACTTAAGG GTTGCTTGCT TGCCCTCCAA ATGTCCTTTC 
TCAAAGGGGC CAACTAACCC GTGCAGAACC AGCACTAAGG TGGACAGCAG ACAAGAGGGC 
AAGCCTCTAA TGTACCAAGT GCTTCCTACA AAGACGCAAG GTGTGCTCCG AACCACAGAT 
GGGCAAACCC TGGTGCTTTC CTTCATCTCC CACGAACTCA AGGGTTTTCC AAGTGTAGCT 
AACAGTTGCC ACATCACACA GACCTCCAGT TTCTGGTAAG ACTGCTGGTT GACATCAGAC 
CCAACCCATT GAAGGCTGGA AGGCAGCAGG CATTTGCTAA GGCAGCTGAT CCAGGCAATC 
GTTCTGCTGG CCAAGAAGTT AAACTATTTT GAGCATTAGA ATGGAGGAAA TCCGGTCAGC 
CAAGTGCAGA GTTCAGACTT OGCTAAGGGC TTGTTTTTCT TCAGCATTTA CTTGAAGATT 
AATGTAGGAT GACAGGCTCT CCTGGCTGTC CTACCATCAG CTCTGCCTTG CACTGTGGTC 
GTCAACTTTC CTCAAATCAA AAACAGGCAG GTACAGGTAG TGGGCTCACA ACGTTTGACC 
TCGACTGGTT TTTCTAAGTT ATTTTGTACA TTTTTCAGCA GCAAAACCAA ACTGGGTCTT 
CAGCTTTATC CCCGTTTCTT GCAAGGGAAG AGCCTTTATA CAATTGGACG CATTTTGGTT 
TTTCCTCATT GAGAATTCAA ATCCTCTTTT GTATTGTTTC TACAATAATT TGTAAACATA 
TTTATTTTTA CCTGCTTTTT TTTTTTTTTT TAATTTTCAG GTCAAGTTTT TTATACTGCA 
CTTATTTGTC AAAATAAAGA TTCTCACAT 



PCT/US02/12476 



Seq ID NO: 174 Protein sequence: 
Protein Accession #: AAF369B4 



MPVQLTTALR 
LFAFLEHRRM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DWYHQKFLGS 
YPREWLYNSL 
I IKATYACFL 
LIPVSIWVAV 
KKPEQYSLAF 



11 

I 

WGTSLPALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKRBVMYTA 
YDSWISFLSS 



WFHKHHJjWMT 
RGNAEMIFMS 
LLEGLAYTAY 
ABV 



21 

I 

VliGGILAAYV 
SPRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGPPP 
LYSLLYMSSL 
CQDLFSETEL 



31 
I 

TGYQPIHTBK 
IAAYQEDPDY 
WRSNFHEAGB 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
PFLIATVIQL 
LPAKIFAIAT 
APLVSGAILY 



41 

I 

HYLSFGLYGA 
IiRKCLRSAQR 
GBTEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TBTPTKYLRW 
FYRGRIWNIL 
INKSGWGTSG 
QCYWVALLML 



Seq ID NO: 175 DNA sequence 
Nucleic Acid Accession 8: NM_000691 
Coding sequence: 43.. 1404 



CCAGGAGCCC 
GCCGTGAAGC 
TTCCAGCAGC 

GTCCTAGAGG 
GTGGAGAAGA 
GTGGTCCTCG 
GGCGCCA7CG 
GCGAGCCTGC 
AATGGGGGTG 
ACGGGCAGCA 
GTCACGCTGG 
GTGGCCTGCC 
CCAGACTACA 
TCACTGAAAG 
AGTGCCCGGC 
GGCACCGGGG 



11 
I 

CAGTTACCGQ 
GCGCCCGCGC 
TGGAGGCGCT 
CAGACCTGCA 
AGATCGAGTA 
CGCCCCAGAC 
TCATTGGCAC 
CTGCAGGGAA 
TGGCTACCAT 
TCCCTGAGAC 
CGGGGGTGGG 
AGCTGGGAGG 
GACGCATCGC 
TCCTCTGTGA 
AGTTCTACGG 
ACTTCCAGAG 
ATGCCGCCAC 



21 
1 

GAGAGGCTGT 
CGCCTTCAGC 
GCAGCGCCTG 
CAAGAATGAA 
CATGATCCAG 
TCAGCAGGAC 
CTGGAACTAC 
CGCAGTGGTC 
CATCCCCCAG 
CACGGAGCTG 
GAAGATCATC 
GAAGAGTCCC 
CTGGGGGAAA 
CCCCTCGATC 
GGAAGATGCT 
GGTGATGGGC 
TOGCTACATA 



31 

I 

GTCAAAGGCG 
TCGGGCAGGA 
ATCCAGGAGC 
TGGAACGCCT 
AAGCTCCCTG 
GAGCTCTACA 
CCCTTCAACC 
CTCAAGCCCT 
TACCTGGACA 
CTCAAGGAGA 
ATGACGGCTG 
TGCTACGTGG 
TTCATGAACA 
CAGAACCAAA 
AAGAAATCCC 
CTGATTGAGG 
GCCCCCACCA 



41 

I 

CCATGAGCAA 
CCCGTCCGCT 
AGGAGCAGGA 
ACTATGAGGA 
AGTGGGCCGC 
TCCACTCGGA 
TCACCATCCA 
GGGAGCTGAG 
AGGATCTGTA 
GGTTCGACCA 
CTGCCAAGCA 
ACAAGAACTG 
GTGGCCAGAC 
TTGTGGAGAA 
GGGACTATGG 
GCCAGAAGGT 
TCCTCACGGA 



1380 
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3840 
3900 
3960 
4020 



51 
I 

ILGLHLLIQS 
ISFPDLKWM 
MDRVRDWRA 
VLEHDPQVGG 
RNSLLQQFLE 
LNQQTRWSKS 
LFLLTVQLVG 
RKTIWNFIG 
YLAIIARRCG 



60 
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180 
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420 
480 
540 



51 
I 

GATCAGOGAG 
GCAGTTCCGA 
GCTGGTGGGC 
GGTGGTGTAC 
GGATGAGCCC 
GCCACTGGGC 
GCCCATGGTG 
TGAGAACATG 
CCCAGTAATC 
TATCCTGTAC 
CCTGACCCCT 
TGACCTGGAC 
CTGCGTGGCC 
GCTCAAGAAG 
AAGAATCATT 
GGCTTATGGG 
CGTGGACCCC 
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CAGTCCCCGG 
AGCCTGGAGG 
TTCTCCAGCA 
GCGGCCAACG 
AACAGCGGCA 
TCTTGCCTGG 
CCGGCCAAGA 
CCCATCGGAG 
CCCCAGCCCA 
GGGCCCAGGC 
CAATTTTCTA 



TGATGCAAGA 
AGGCCATCCA 
ACGACAAGGT 
ATGTCATCGT 
TGGGATCCTA 
TGAGGCCTCT 
TGACCCAGCA 
TGCGGACCAC 
GCCCCACTCC 
CTCACCATTC 
ACTCGG 



GGAGATCTTC 
GTTCATCAAC 
GATTAAGAAG 
CCACATCACC 
CCATGGCAAG 
GATGAATGAT 
CTGAGGAGGG 
CCTCACTGGC 
TCTGCTGACC 
CAAGTCTCCA 



CAGGGTGAGA 
ATGATTGCAG 
TTGCACTCTC 
AAGAGCTTCG 
GAAGGCCTGA 
GTTGCTCCGC 
TCTCCTGGCC 
TGCTGACCTG 
CCCCTTTCTA 



TGCCCATCGT 
AGCCCCTGGC 
AGACATCCAG 
TGCCCTTCGG 
AGACTTTCTC 
AGGTCAGATA 
CTGGCCTGGC 
CTGGAGAATC 
TGCACACCCC 
GACCAATAAA 



GTGCGTGCGC 
CCTCTACATG 
TGGTGGGGTG 
GGGCGTGGGG 
TCACCGCCGC 
CCCCCCGAGC 
CATACTGTGT 
GCTCCTGCAG 
ACTCCCACAT 
GAGACAAATA 



Seq ID NO: 176 Protein sequence! 
Protein Accession #: NP_000682 



1 

I 

MSKI SEAVKR 
YEEWYVLEE 
TIQPMVGAIA 
FDHILYTGST 
GQTCVAPDYI 
QKVAYGGTGD 
PLAIjYMFSSN 
TFSHRRSCLV 



11 
I 

ARAAFSSGRT 
IEYMIQKLPE 
AGNAWLKPS 
GVGKI IMTAA 
LCDPSIQNQI 
AATRYIAPTI 
DKVIKKMIAE 
RPLMNDEGLK 



21 
I 

RPLQFRFQQL 
WAADEPVEKT 
ELSENMASLL 
AKHLTPVTLE 
VEKLKKSLKE 
LTDVDPQSPV 
TSSGGVAAND 
VRYPPSPAKM 



31 

I 

EALQRLIQEQ 
PQTQQDELYI 
ATIIPQYLDK 
LGGKSPCYVD 
FYGEDAKKSR 
MQEE1PGPVL 
VIVHITLHSIi 
TQH 



41 

I 

EQELVGAIAA 
HSEPLGWLV 
DLYPVINGGV 
KNCDLDVACR 
DYGRIISARH 
PIVCVRSLEE 
PFGGVGNSGM 



51 
I 

DLHKNEWNAY 
IGTWKYPFNL 
PETTELLKER 
RIAWGKFMNS 
FQRVMGLIEG 
AIQPIKQREK 



Seq ID NO: 177 DMA sequence 

Nucleic Acid Accession ft: NM_001067.1 

Coding sequence: 108-4703 



CTAACCGACG 
CTGTTTAGTC 
CATTGCAGCC 
AGAAAAGACT 
TCCGCCCAGA 
ATGAAGATGT 
TTGATGAGAT 
TTAGAGTCAC 
TTCCTGTTGT 
TCCTAACTTC 
ATGGAGCCAA 
AATACAAGAA 
AACTCAAGCC 
AGTTTAAAAT 
ATATTGCTGG 
AAGGATTTCG 
CCTTGAAAGT 
AAAAAGGCTT 
ATGTTGATTA 
ACAAGGGTGG 
ATGCCTTAAT 
CCAAGAGCTT 
GTGGTATTGT 
AGAAGTGTTC 
ATGATGCAGG 
CCAAAACTTT 
CTCTTAGAGG 
CTGAGATTAA 
AAGATTCATT 
ATGGTTCCCA 
TGCGACATCG 
AAGAAATGGC 
ATAAAAAATG 
AAGAATACTT 
ATGATGCTGC 
TAACTAATTT 
TGTATGGACA 
TGTTCTCAAA 
GTCAGAGAAA 
CCCAATTAGC 
TGATGACCAT 
AGCCCATTGG 
ACATCTTTAC 
CGTTGAAGTT 
TTCCCATGGT 
CCAACTTTGA 
. CTTTGCCAAT 
ATCAATATGT 
AGCTTCCCGT 
ATGGCACCGA 
CTGTGAAATT 
TACACAAAGT 
ACGTAGGCTG 
TCAGACTTAA 
CTGCTAAACT 



11 

I 

CGCGTCTGTG 
GCTTTCAGGG 
TGTAAATGAA 
GTCTGTTGAA 
CACCTACATT 
TGGCATTAAC 
TCTAGTTAAT 
AATTGATCCG 
TGAACACAAA 
TAGTAACTAT 
ATTGTGTAAC 
AATGTTCAAA 
CTTCAATGGA 
GCAAAGCCTG 
ATCCACCAAA 
TAGTTATGTG 
AATACATGAA 
TCAGCAAATT 
TGTAGCTGAT 
TGTTGCAGTA 
TGAAAACCCA 
TGGATCAACA 
AGAAAGCATA 
AGCTGTAAAA 
GGGCCGAAAC 
GGCTGTTTCA 
AAAAATACTC 
CAATATCATC 
GAAGACGCTT 
CATCAAAGGC 
TTTTCTGGAG 
ATTTTACAGC 
GAAAGTCAAA 
TGCAGATATG 
TATCAGCCTG 
CATGGAGGAT 
AACTACCACA 
TTCTGATAAC 
GGTTTTGTTT 
TGGATCAGTG 
TATCAATTTG 
TCAGTTTGGT 
AATGCTCAGC 
TTTATATGAT 
GCTGATAAAT 
TGTGCGTGAA 
GCTTCCAAGT 
GATTAGTGGT 
CAGAACATGG 
GAAGACACCT 
TGTTGTGAAG 
CTTCAAACTC 
TTTAAAGAAA 
ATATTATGGA 
GAATAATCAG 



21 
I 

GAGAAGCGGC 
TTCTTGAGCC 
AATATGCAAG 
AGAATCTATC 
GGTTCTGTGG 
TATAGGGAAG 
GCTGCGGACA 
GAAAACAATT 
GTTGAAAAGA 
GATGATGATG 
ATATTCAGTA 
CAGACATGGA 
GAAGATTATA 
GACAAAGATA 
GATGTCAAAG 
GACATGTATT 
CAAGTAAACC 
AGCTTTGTCA 
CAGATTGTGA 
AAAGCACATC 
ACCTTTGACT 
TGCCAATTGA 
CTAAACTGGG 
CATAATAGAA 
TCCACTGAGT 
GGCCTTGGTG 
AATGTTCGAG 
AAGATTGTGG 
CGTTATGGGA 
TTGCTGATTA 
GAATTTATCA 
CTTCCTGAAT 
TATTACAAAG 
AAAAGACATC 
GCCTTTAGCA 
AGAAGACAAC 
TATCTGACAT 
GAGAGATCTA 
ACTTGCTTCA 
GCTGAAATGT 
GCTCAGAATT 
ACCAGGCTAC 
TCTTTGGCTC 
GACAACCAGC 
GGTGCTGAAG 
ATTGTAAATA 
TACAAGAACT 
GAAGTAGCTA 
ACCCAGACAT 
CCTCTCATAA 
ATGACTGAAG 
CAAACTAGTC 
TATGACACGG 
TTAAGAAAAG 
GCTCGCTTTA 



31 

I 

TTGGTCGGGG 
CCTTCACGAC 
TCAACAAAAT 
AAAAGAAAAC 
AATTAGTGAC 
TCACTTTTGT 
ACAAACAAAG 
TAATTAGTAT 
TGTATGTCCC 
AAAAGAAAGT 
CCAAATTTAC 
TGGATAATAT 
CATGTATCAC 
TTGTTGCACT 
TCTTTCTTAA 
TGAAGGACAA 
ACAGGTGGGA 
ACAGCATTGC 
CTAAACTTGT 
AGGTGAAAAA 
CTCAGACAAA 
GTGAAAAATT 
TGAAGTTTAA 
TCAAGGGAAT 
GTACGCTTAT 
TGGTTGGGAG 
AAGCTTCTCA 
GTCTTCAGTA 
AGATAATGAT 
ATTTTATCCA 
CTCCCATTGT 
TTGAAGAGTG 
GTTTGGGCAC 
GTATCCAGTT 
AAAAACAGAT 
GAAAGTTACT 
ATAATGACTT 
TCCCTTCTAT 
AACGGAATGA 
CTTCTTATCA 
TTGTGGGTAG 
ATGGTGGCAA 
GATTGTTATT 
GTGTTGAGCC 
GAATCGGTAC 
ACATCAGGCG 
TCAAGGGTAC 
TTCTTAATTC 
ACAAAGAACA 
CAGACTATAG 
AAAAACTGGC 
TCACATGCAA 
TGTTGGATAT 
AATGGCTCCT 
TCTTAGAGAA 



41 
I 

GTGGTCTCGT 
CGTCACCATG 
AAAGAAAAAT 
ACAATTGGAA 
CCAGCAAATG 
TCCTGGTTTG 
GGACCCAAAA 
ATGGAATAAT 
AGCTCTCATA 
GACAGGTGGT 
TGTGGAAACA 
GGGAAGAGCT 
CTTTCAGCCT 
AATGGTCAGA 
TGGAAATAAA 
GTTGGATGAA 
AGTGTGTTTA 
TACATCCAAG 
TGATGTTGTG 
TCACATGTGG 
AGAAAACATG 
TATCAAAGCT 
GGCCCAAGTC 
TCCCAAACTC 
CCTGACTGAG 
AGACAAATAT 
TAAGCAGATC 
CAAGAAAAAC 
TATGACAGAT 
TCACAACTGG 
AAAGGTATCT 
GAAGAGTTCT 
CAGCACATCA 
CAAATATTCT 
AGATGATCGA 
TGGGCTTCCT 
CATCAACAAG 
GGTGGATGGT 
CAAGCGAGAA 
TCATGGTGAG 
CAATAATCTA 
GGATTCTGCT 
TCCACCAAAA 
TGAATGGTAC 
TGGGTGGTCC 
TTTGATGGAT 
TATTGAAGAA 
TACAACCATT 
AGTTCTAGAA 
GGAATACCAT 
AGAGGCAGAG 
CTCTATGGTG 
TCTAAGAGAC 
AGGAATGCTT 
AATAGATGGC 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



51 

I 

GGGGTCCTGC 

GAAGTGTCAC 

GAAGATGCTA 

CATATTTTGC 

TGGGTTTACG 

TACAAAATCT 

ATGTCTTGTA 

GGAAAAGGTA 

TTTGGACAGC 

CGAAATGGCT 

GCCAGTAGAG 

GGTGAGATGG 

GATTTGTCTA 

AGAGCATATG 

CTGCCAGTAA 

ACTGGTAACT 

ACTATGAGTG 

GGTGGCAGAC 

AAGAAGAAGA 

ATTTTTGTAA 

ACTTTACAAC 

GCCATTGGCT 

CAGTTAAACA 

GATGATGCCA 

GGAGATTCAG 

GGGGTTTTCC 

ATGGAAAATG 

TATGAAGATG 

CAGGACCAAG 

CCCTCTCTTC 

AAAAACAAGC 

ACTCCAAATC 

AAGGAAGCTA 

GGTCCTGAAG 

AAGGAATGGT 

GAGGATTACT 

GAACTTATCT 

TTGAAACCAG 

GTAAAGGTTG 

ATGTCACTAA 

AACCTCTTGC 

AGTCCACGAT 

GATGATCACA . 

ATTCCTATTA 

TGCAAAATCC 

GGAGAAGAAC 

CTGGCTCCAA 

GAAATCTCAG 

CCCATGTTGA 

ACAGATACCA 

AGAGTTGGAC 

CTTTTTGACC 

TTTTTTGAAC 

GGTGCTGAAT 

AAAATAATCA 



60 
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180 
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360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



254 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85. 



WO 02/086443 

TTGAAAATAA GCCTAAGAAA GAATTAATTA AAGTTCTGAT TCAGAGGGGA TATGATTCGG 
ATCCTGTGAA GGCCTGGAAA GAAGCCCAGC AAAAGGTTCC AGATGAAGAA GAAAATGAAG 
AGAGTGACAA CGAAAAGGAA ACTGAAAAGA GTGACTCCGT AACAGATTCT GGACCAACCT 
TCAACTATCT TCTTGATATG CCCCTTTGGT ATTTAACCAA GGAAAAGAAA GATGAACTCT 
GCAGGCTAAG AAATGAAAAA GAACAAGAGC TGGACACATT AAAAAGAAAG AGTCCATCAG 
ATTTGTGGAA AGAAGACTTG GCTACATTTA TTGAAGAATT GGAGGCTGTT GAAGCCAAGG 
AAAAACAAGA TGAACAAGTC GGACTTCCTG GGAAAGGGGG GAAGGCCAAG GGGAAAAAAA 
CACAAATGGC TGAAGTTTTG CCTTCTCCGC GTGGTCAAAG AGTCATTCCA CGAATAACCA 
TAGAAATGAA AGCAGAGGCA GAAAAGAAAA ATAAAAAGAA AATTAAGAAT GAAAATACTG 
AAGGAAGCCC TCAAGAAGAT GGTGTGGAAC TAGAAGGCCT AAAACAAAGA TTAGAAAAGA 
AACAGAAAAG AGAACCAGGT ACAAAGACAA AGAAACAAAC TACATTGGCA TTTAAGCCAA 
TCAAAAAAGG AAAGAAGAGA AATCCCTGGC CTGATTCAGA ATCAGATAGG AGCAGTGACG 
AAAGTAATTT TGATGTCCCT CCACGAGAAA CAGAGCCACG GAGAGCAGCA ACAAAAACAA 
AATTCACAAT GGATTTGGAT TCAGATGAAG ATTTCTCAGA TTTTGATGAA AAAACTGATG 
ATGAAGATTT TGTCCCATCA GATGCTAGTC CACCTAAGAC CAAAACTTCC CCAAAACTTA 
GTAACAAAGA ACTGAAACCA CAGAAAAGTG TCGTGTCAGA CCTTGAAGCT GATGATGTTA 
AGGGCAGTGT ACCACTGTCT TCAAGCCCTC CTGCTACACA TTTCCCAGAT GAAACTGAAA 
TTACAAACCC AGTTCCTAAA AAGAATGTGA CAGTGAAGAA GACAGCAGCA AAAAGTCAGT 
CTTCCACCTC CACTACGGGT GCCAAAAAAA GGGCTGCCCC AAAAGGAACT AAAAGGGATC 
CAGCTTTGAA TTCTGGTGTC TCTCAAAAGC CTGATCCTGC CAAAACCAAG AATCGCCGCA 
AAAGGAAGCC ATCCACTTCT GATGATTCTG ACTCTAATTT TGAGAAAATT GTTTCGAAAG 
CAGTCACAAG CAAGAAATCC AAGGGGGAGA GTGATGACTT CCATATGGAC TTTGACTCAG 
CTGTGGCTCC TCGGGCAAAA TCTGTACGGG CAAAGAAACC TATAAAGTAC CTGGAAGAGT 
CAGATGAAGA TGATCTGTTT TAAAATGTGA GGCGATTATT TTAAGTAATT ATCTTACCAA 
GCCCAAGACT GGTTTTAAAG TTACCTGAAG CTCTTAACTT CCTCCCCTCT GAATTTAGTT 
TGGGGAAGGT GTTTTTAGTA CAAGACATCA AAGTGAAGTA AAGCCCAAGT GTTCTTTAGC 
TTTTTATAAT ACTGTCTAAA TAGTGACCAT CTCATGGGCA TTGTTTTCTT CTCTGCTTTG 
TCTGTGTTTT GAGTCTGCTT TCTTTTGTCT TTAAAACCTG ATTTTTAAGT TCTTCTGAAC 
TGTAGAAATA GCTATCTGAT CACTTCAGCG TAAAGCAGTG TGTTTATTAA CCATCCACTA 
AGCTAAAACT AGAGCAGTTT GATTTAAAAG TGTCACTCTT CCTCCTTTTC TACTTTCAGT 
AGATATGAGA TAGAGCATAA TTATCTGTTT TATCTTAGTT TTATACATAA TTTACCATCA 
GATAGAACTT TATGGTTCTA GTACAGATAC TCTACTACAC TCAGCCTCTT ATGTGCCAAG 
TTTTTCTTTA AGCAATGAGA AATTGCTCAT GTTCTTCATC TTCTCAAATC ATCAGAGGCC 
AAAGAAAAAC ACTTTGGCTG TGTCTATAAC TTGACACAGT CAATAGAATG AAGAAAATTA 
GAGTAGTTAT GTGATTATTT CAGCTCTTGA CCTGTCCCCT CTGGCTGCCT CTGAGTCTGA 
ATCTCCCAAA GAGAGAAACC AATTTCTAAG AGGACTGGAT TGCAGAAGAC TCGGGGACAA 
CATTTGATCC AAGATCTTAA ATGTTATATT GATAACCATG CTCAGCAATG AGCTATTAGA 
TTCATTTTGG GAAATCTCCA TAATTTCAAT TTGTAAACTT TGTTAAGACC TGTCTACATT 
GTTATATGTG TGTGACTTGA GTAATGTTAT CAACGTTTTT GTAAATATTT ACTATGTTTT 
TCTATTAGCT AAATTCCAAC AATTTTGTAC TTTAATAAAA TGTTCTAAAC ATTGC 

Seq ID NO: 178 Protein sequence: 
Protein Accession #: NP 001058.1 
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MEVSPLQPVN 
MWVYDEDVGI 
NGKGIPWEH 
TASREYKKMF 
RRAYDIAGST 
LTMSEKGFQQ 
WIPVNALIEN 
VQLNKKCSAV 
YGVPPLiRGKI 
DQDQDGSHIK 
STPNHKKWKV 
RKEWLTNFME 
GLtKPGQRKVL 
LNLLQPIGQP 
YIPIIPMVLI 
ELAPNQYVIS 
HTDTTVKFW 
DFFELRLiKYY 
GYDSDPVKAW 
KDELCRLRNE 
KGKKTQMAEV 



11 
I 

ENMQVNKIKK 
NYREVTFVPG 
KVEKMYVPAL 
KQTWMDNMGR 
KDVKVFLNGN 
ISPVNSIATS 
PTFDSQTKEN 
KHNRIKGIPK 
LNVREASHKQ 
GLLINFIHHN 
KYYKGLGTST 
DRRQRKLLGL 
FTCFKRNDKR 
GTRLHGGKDS 
NGAEGIGTGK 
GEVAILNSTT 
KMTEEKLAEA 
GLRKEWLLGM 
KEAQQKVPDS 
KEQELDTLXR 
LPSPRGQRVI 
GTKTKKQTOi 



21 

I 

NEDAKKRLSV 
LYKIFDEILV 
IFGQLLTSSN 
AGEMELKPFN 
KLPVKGFRSY 
KGGRHVDYVA 
MTLQPKSFGS 
LDDANDAGGR 
IMENAEINNI 
WPSLLR^RFL 
SKEAKEYPAD 
PEDYLYGQTT 
EVKVAQLAGS 
ASPRYIFTML 
SCKIPNFDVR 
IEISELPVRT 
ERVGLHKVFK 
LGAESAKLNN 



ATKTKFTMDL 
ADDVKGSVPL 
TKRDPAItNSG 
DFDSAVAPRA 



SSSPPATHPP 
VSQKPDPAKT 
KSVRAKKPIK 



KSPSDLWKED 
PRITIEMKAE 
AFKPZKKGKK 
EKTDDEDFVP 
DETEITNPVP 
KNRRKRKPST 
YLEESDEDDL 



31 

I ' 

ERIYQKKTQL 
NAADNKQRDP 
YDDDEKKVTG 
GEDYTCITFQ 
VDMYLKDKLD 
DQIVTKLVDV 
TCQliSEKFIK 
NSTECTLILT 
IKIVGLQYKK 
EEFITPIVKV 
MKRHRIQFKY 
TYLTYNDFIN 
VAEMSSYHHG 
SSIARLLFPP 
EIVNNIRRLM 
WTQTYKEQVL 
LQTSLTCNSM 
QARPILEKID 
ETEKSDSVTD 
LATPIEELEA 
AEKKNKKKIK 
RNPWPDSESD 
SDASPPKTKT 
KKNVTVKKTA 



41 

I . 

EHILLRPDTY 
KMSCIRVTID 
GRNGYGAKLC 
PDLSKFKMQS 
ETGNSLKVIH 
VKKKNKGGVA 
AAIG0GXVE5 
EGDSAKTLAV 
NYEDEDSLKT 
SKNKQEMAFY 
SGPEDDAAIS 
KELILFSKSD 
EMSLMMTIIN 
KDDHTLKFIiY 
DGEEPLPMLP 
EPMLNGTEKT 
VLFDHVGCXiK 
GKIIIENKPK 
SGPTFNYLLD 
VEAKEKQDEQ 



51 
I 

IGSVELVTQQ 
PENNLISIWN 
NIFSTKFTVE 
LDKDIVALMV 
EQVNHRWEVC 
VKAHQVKNHM 
ILNWVKFKAQ 
SGLGWGRDK 
LRYGXIMIMT 
SLPEFEEMKS 
IAFSKKQIDD 
NERSIPSMVD 
LAQNFVGSNN 
DDNQRVEPEW 
SYKNFKGTIE 
PPLITDYREY 
KYDTVLDILR 
KELIKVLIQR 
MPLWYLTKEK 
VGLPGKGGKA 
DGVELEGLKQ 
PPRETEPRRA 



SPKLSNKELK 
AKSQSSTSTT 
rVSKAVTSKK 



GAKKRAAPKG 
SK3ESDDFHM 



Seq ID NO: 179 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 




TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 



TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



41 51 

I I 
GGAGGATTAA AACAAACAAA 
TGAGAAGCAG AGGAGCCGCA 
AGCGTTTCCT CGCTTGCATT 
ACTACAGACA ACAGAGAAAA 
ATCAAAAAAA TTGGGGAAAG 
ATATTGATGA AGATCTTACA 
GGGATAAAAC ATCATTGGAA 



60 
120 
180 
240 
300 
360 
420 



255 



WO 02/086443 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

5 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

10 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

15 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

20 ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

25 AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

30 AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AAOQTGGTAT ACTGGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTOGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACOGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GAOGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

A GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

45 CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGOGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

_ _ ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

50 CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTOGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

60 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

65 GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

. TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCOGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CAOGAAAACA GTCTTATGGA TCAGAATAAT 4560 

70 CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

75 GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

c CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

80 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATQ ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 52 8 D 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

_ _ CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

o5 CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
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AAAGGAAGGA GAAAATGTGA 
TTTCTGGTCA CTCAGAAGAG 
CTAAGAAACA CAAAAATAAA 
ACACAGTATC ACTACACGCA 
CTGACCTTTG TGAGAAAGGC 
CACTGCAGTG CTGGAGTTGG 
CAGATTCAAC ACGAAGGAAC 
AGAAATTATT TGGTACAAAC 
GCCATACTTA GTAAAGAAAC 
CTCCTCATTC CTGGACCAGC 
CAGTCAAATA TACAGCAGAG 
AATOGAACTT CTTCTATCAT 
GGAGAAGGCA CAGACTACAT 
TTCATCATTA CCCAGCACCC 
GACCATAATG CCCAACTGGT 
TTTGTTTACT GGCCAAATAA 
ATGGCTGAAG AACACAAATG 
TTAGAAGCTA CACAGGATGA 
CCAAATCCAG ATAGCCCCAT 
GCTGCCAATA GGGATGGGCC 
ACTTTCTGTG CTCTGACAAC 
TACCAGGTAG CCAAGATGAT 
TATCAGTTTC TCTACAAAGT 
TCCACCTCTC TGGACAGTAA 
GAGTCTTTAG TTTAACACAG 
TTCCTAAAAT TAGGCAGGAA 
GACAGTAACT TTCATGACAT 
CTTTTTGCAA GACTTGTAAT 
TATTTCTAAG AATGGAATTG 
TTTATAGAGG TTAGGAATTC 
GCTGTATTTG TAGCAATTAT 
AAATAAAACA CTCTTCCATA 
AGAAATAATC TGTTACTTAT 
TTTATAATTG TAGATTTTTA 
GTTTAGTTTA ATGAOGTAGT 
TTGTGTTACC TAAGTCATTA 
GAAATACCTT CATTTTGAAA 
AATGGTTTTT ATCCAAGGAA 
AAAAAAAAAA AAAAAAAAAA 



TCAGTACTGG 
TGTGCAAGTG 
AAAGGGCTCC 
GTGGCCTGAC 
AGCCTATGCC 
AAGAACAGGC 
TGTCAACATA 
TGAGGAGCAA 
TGAGGTGCTG 
AGGCAAAACA 
TGACTATTCT 
CCCTGTGGAA 
CAATGCCTCC 
TCTCCTTCAT 
GGTTATGATT 
AGATGAGCCT 
TCTATCTAAT 
TTATGTACTT 
TAGTAAAACT 
TATGATTGTT 
CCTTATGCAC 
CAATCTGATG 
GATCCTCAGC 
TGGTGCAGCA 
AAAGGGGTGG 
AATCAGTCTA 
AGGATTCTGC 
TTACTTATTA 
TGGTATTTTT 
CAAACTACAG 
CAGGTTTGCT 
TGATATTCAA 
TGTAAATACT 
TATTTTACTA 
TCATTAGCTG 
ACTTTGTTTC 
GAAGTTTTTA 
TTGCAAAAAT 



CCTGCCGATG 
CTTGCCTATT 
CAGAAAGGAA 
ATGGGAGTAC 
AAGCGCCATG 
ACATATATTG 
TTTGGCTTCT 
TATGTCTTCA 
GACAGTCATA 
AAGCTAGAGA 
GCAGCCCTAA 
AGATCAAGGG 
TATATCATGG 
ACCATCAAGG 
CCTGATGGOC 
ATAAATTGTG 
GAGGAAAAAC 
GAAGTGAGGC 
TTTGAACTTA 
CATGATGAGC 
CAACTAGAAA 
AGGCCAGGAG 
CTTGTGAGCA 
TTGCCTGATG 
GGGGACTCAC 
GTTCTGTTAT 
OGCCAAATTT 
TGTTTGAACT 
TTCTGTATTG 
AAAATGTTTG 
AGAAATATAA 
CATTTTACAA 
GCCCTAGTGT 
CTGAGTCAAG 
GTCTTACTCT 
AGCATGTAAT 
TGAGAATAAC 
AAATATAAAT 



Seg ID NO: 180 Protein sequence x 
Protein Accession 4h Bos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASXITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HBFLTDGYQD 
LIGTEEIIKB 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
U3TTPAASSS 
ILPQVTSATB 
KTLMFSOVEP 
SLFSGPSHIP 
PVSVAEPTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKL 
PHRDGSVTST 
KCMSCSSYRE 
SPGKSPSAKG 
ENETSTDFSF 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 
VHDEHGGVTA 
SLVSTRQBEN 



11 
1 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGRDIEEG 
GXGDVPNTSI, 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPA8SEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLI>FP SKATS 
SQEKVMNDSD 
LSQKHNDOKE 
ADTNEKDADG 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSUPV 
WDHNAQLWM 
ILEATQDDYV 
GTFCALTTLM 
PSTSliDSNGA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EKSLBGQKFP 
BSVSRFGKQA 
AVFCEVIiTMQ 
DPENYTSUjV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLB 
SSGPEPSYAL 
ASL^QPTHAL 
LSKSEI IYGN 
GSLAHTTTKV 
LSPSTQLLFY 
MLKLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 
ELSHSAKSDA 
THENSIiMDQN 
ENDIOTGSAL 
IIiAAGDSEJT 
IiVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
IjDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 
liEVRHFQCPK 



31 
1 

KLVBEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRVVYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



GGAGTGAGGA 
ATACTGTGAG 
GACCCAGTGG 
CAGAGTACTC 
CAGTGGGGCC 
TGCTAGACAG 
TAAAACACAT 
TTCATGATAC 
TTCATGCCTA 
AACAATTCCA 
AGCAATGCAA 
TTGGCATTTC 
GCTATTACCA 
ATTTCTGGAG 
AAAACATGGC 
AGAGCTTTAA 
TTATAATTCA 
ACTTTCAGTG 
TAAGTGTTAT 
ATGGAGGAGT 
AAGAAAATTC 
TCTTTGCTGA 
CAAGGCAGGA 
GAAATATAGC 
ATCTGAGCAT 
CTGTTGATTT 
ATATCATTAA 
AAAATGATTG 
ATTTTAACAG 
TTTTTAGTGT 
CTTTTAATAC 
CTGCAGTATT 
CTCCATGGAC 
TTTTCTAGTT 
ACCAGTTTTC 
TTTAACTTTT 
ACCTTACCAA 
ATTGCCATTA 



41 

I 

TGALNQKNMG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKPAVLY 
SDQLIVDMPT 
ISTTTHYKRI 
QTVTELPPHT 
TGAEDSSGSS 



AliPDGNIAES 



KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPtiL 
PSLAQYSDVL 
SDNEGSQHIP 
SGDGEWSGAS 
ETEIiQIPSPN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLEI 
GIPTVASDTP 
GLVGGGEDGD 
NPISYSLSEN 
LPLSPESKAW 
PGFPQSPTSS 
CLWLVGIbl 
LHASSGFTBE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDE 
WPNPDSPISK 
VYQVAKMINL 
LESLV 



QGPSVTDLEM 
PSYSSEVFPL 
PFS8ASF8SE 
STTHAASETL 
TVSYSSA1PV 
SDSEFLLPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSPQASD 
VPVFDVSPTS 
KQABPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
SEEDNRVTSV 
AVLTSDEESG 
VTSENSEVFH 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPA0GSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLI* 
SYIMGYYQSN 
PINCESPKVT 
TFELISVIKE 
MRPGVFADIE 



GTACGGGAAC 
GAATTTTACT 
ACGTGTGGTC 
CCTGCCAGTG 
TGTTGTCGTC 
TATGTTGCAG 
COGTTCACAA 
ACTGGTTGAG 
TGTTAATGCA 
GCTCCTGAGC 
CAGGGAAAAG 
ATCCCTGAGT 
GAGCAATGAA 
GATGATATGG 
AGAAGATGAA 
GGTCACTCTT 
GGACTTTATC 
TCCTAAATGG 
AAAAGAAGAA 
GACGGCAGGA 
OGTGGATGTT 
CATTGAGCAG 
AGAGAATCCA 
TGAGAGCTTA 
TGTTTTCCTC 
CCCATCACCT 
CAATGTGTGC 
AATTTTACAG 
AAAATTTCAA 
CAAATTTTTA 
AGTAGCCTGT 
CACCTAAAGT 
CAAATTTATA 
CTGTGTAATT 
TGACATTGTA 
GTGGAAAATA 
ACATTGTTCA 
AAAAAAAAAA 



51 
1 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDOTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPIiLLDNQI 
LFRHLHTVSQ 
EFGSESGVLY 
HDSVGVTYOG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGL 
VFATPVLSID 
NGHVAITAVS 
DRGSDGLSIH 
SSDSQTGMDR 
SGQGTSDSLN 
VSEAEASNSS 
FYLSDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVGTEB 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 
EAANRDGPMI 
QYQFLYKVIL 



5700 
5760 
5820 
5860 
5940 
6000 
6060 
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6180 
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Seq ID NO: 181 DNA sequence 

Nucleic Acid Accession ftt Eos sequence 
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i 
I 

CACACATACG 
CAAAAAAAAC 
CGGCGA6GG6 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
QATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
TTGGAATCCG 
TGTCTAGTGG 
TTTTACTTAG 
CCAATTTCAG 
TTACATGCAA 
GTGCAGAGCT 
AAGCACAAGA 
CAGCTTGCTG 
TACAACAGAC 
TTCTGGAGAA 
GAGAAAGGAA 
AACTTTCTGG 
ACTCTAAGAA 
GTCACACAGT 
GTGCTGACCT 
GTCCACTGCA 
CAGCAGATTC 
CAAAGAAATT 
GAGGCCATAC 
GCACTCCTCA 
AGCCAGTCAA 
AAGAATCGAA 
AGTGGAGAAG 
GAATTCATCA 
TGGGACCATA 
GAATTTGTTT 
CTTATGGCTG 
ATCTTAGAAG 
TGGCCAAATC 
GAAGCTGCCA 
GGAACTTTCT 
GTTTACCAGG 
CAGTATCAGT 
CCATCCACCT 
TTAGAGTCTT 
CTCTTCCTAA 
CCTGACAGTA 
TGCCTTTTTG 
CAGTATTTCT 
CAATTTATAG 
TTAGCTGTAT 
TGTAAATAAA 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
COGCAGACCG 
GTGTTTGCOG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
CAGAGGCCAG 
AGAAGAAGGC 
TTCTTGTGGG 
AGGACAGTAC 
ATGATGTCGG 
GTAGTGGGTT 
GTACTGTTGA 
ATCGATACAT 
AAAAGGATGG 
CAAAAGCTTA 
TGATATGGGA 
GGAGAAAATG 
TCACTCAGAA 
ACACAAAAAT 
ATCACTACAC 
TTGTGAGAAA 
GTGCTGGAGT 
AACACGAAGG 
ATTTGGTACA 
TTAGTAAAGA 
TTCCTGGACC 
ATATACAGCA 
CTTCTTCTAT 
GCACAGACTA 
TTACCCAGCA 
ATGCCCAACT 
ACTGGCCAAA 
AAGAACACAA 
CTACACAGGA 
CAGATAGCCC 
ATAGGGATGG 
GTGCTCTGAC 
TAGCCAAGAT 
TTCTCTACAA 
CTCTGGACAG 
TAGTTTAACA 
AATTAGGCAG 
ACTTTCATGA 
CAAGACTTGT 
AAGAATGGAA 
AGGTTAGGAA 
TTGTAGCAAT 
ACACTCTTCC 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAATAGTAGC 
AGTTATACCC 
TATTCTCATC 
ATCCCCTAGA 
AGCAATTCCA 
TACTGAAGAA 
CTTAGGTATT 
AAATATCGTT 
CAAACTGACT 
TATTGCTGCC 
ACATAATGTG 
TGATCAGTAC 
GAGTGTGCAA 
AAAAAAGGGC 
GCAGTGGCCT 
GGCAGCCTAT 
TGGAAGAACA 
AACTGTCAAC 
AACTGAGGAG 
AACTGAGGTG 
AGCAGGCAAA 
GAGTGACTAT 
CATCCCTGTG 
CATCAATGCC 
CCCTCTCCTT 
GGTGGTTATG 
TAAAGATGAG 
ATGTCTATCT 
TGATTATGTA 
CATTAGTAAA 
GCCTATGATT 
AACCCTTATG 
GATCAATCTG 
AGTGATCCTC 
TAATGGTGCA 
CAGAAAGGGG 
GAAAATCAGT 
CATAGGATTC 
AATTTACTTA 
TTGTGGTATT 
TTCCAAACTA 
TATCAGGTTT 
ATATGATATT 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
CATGAGTCTC 
CTTGTGATCG 
TACTGGAGGA 
GTTATATCCA 
ATAAAGCACT 
TTTGAGACAC 
ACAGCAGACA 
GCCTATGATC 
GATT AT ATCA 
CAAGGCCCAC 
GAAGTTATTG 
TGGCCTGCCG 
GTGCTTGCCT 
TCCCAGAAAG 
GACATGGGAG 
GCCAAGCGCC 
GGCACATATA 
ATATTTGGCT 
CAATATGTCT 
CTGGACAGTC 
ACAAAGCTAG 
TCTGCAGCCC 
GAAAGATCAA 
TCCTATATCA 
CATACCATCA 
ATTCCTGATG 
CCTATAAATT 
AATGAGGAAA 
CTTGAAGTGA 
ACTTTTGAAC 
GTTCATGATG 
CACCAACTAG 
ATGAGGCCAG 
AGCCTTGTGA 
GCATTGCCTG 
TGGGGGGACT 
CTAGTTCTGT 
TGCCGCCAAA 
TTATGTTTGA 
TTTTTCTGTA 
CAGAAAATGT 
GCTAGAAATA 
CAACATTTTA 



41 
i 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
GTATTGGTCT 
TGTCAGCCCT 
AATGCTTCCA 
CACCTCCAAC 
TTCCAAAGCA 
TGAAAGAGTT 
GCTCCAACCA 
ATAGCAGGGT 
ATGCCAATTA 
TGAAATCCAC 
TCATGATAAC 
ATGGGAGTGA 
ATTATACTGT 
GAAGACCCAG 
TACCAGAGTA 
ATGCAGTGGG 
TTGTGCTAGA 
TCTTAAAACA 
TCATTCATGA 
ATATTCATGC 
AGAAACAATT 
TAAAGCAATG 
GGGTTGGCAT 
TGGGCTATTA 
AGGATTTCTG 
GCCAAAACAT 
GTGAGAGCTT 
AACTTATAAT 
GGCACTTTCA 
TTATAAGTGT 
AGCATGGAGG 
AAAAAGAAAA 
GAGTCTTTGC 
GCACAAGGCA 
ATGGAAATAT 
CACATCTGAG 
TATCTGTTGA 
TTTATATCAT 
ACTAAAATGA 
TTGATTTTAA 
TTGTTTTTAG 
TAACTTTTAA 
CAACTGCAGT 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
AGCTGAGGGG 
GACTTTTATC 
GACTGCACAC 
ACCTATCTTT 
TGTTGCAGAT 
TTACCAGGAA 
CCCAGACAAC 
TAAGCTAGCA 
TGTTGATGGC 
AGCTGAAGAT 
AAACCTCGTG 
GGAGTACGGG 
GAGGAATTTT 
TGGACGTGTG 
CTCCCTGCCA 
GCCTGTTGTC 
CAGTATGTTG 
CATCCGTTCA 
TACACTGGTT 
CTATGTTAAT 
CCAGCTCCTG 
CAACAGGGAA 
TTCATCCCTG 
CCAGAGCAAT 
GAGGATGATA 
GGCAGAAGAT 
TAAGGTCACT 
TCAGGACPTT 
GTGTCCTAAA 
TATAAAAGAA 
AGTGACGGCA 
TTCCGTGGAT 
TGACATTGAG 
GGAAGAGAAT 
AGCTGAGAGC 
CATTGTTTTC 
TTTCCCATCA 
TAACAATGTG 
TTGAATTTTA 
CAGAAAATTT 
TGTCAAATTT 
TACAGTAGCC 
ATTCACCTAA 
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Seg ID HO: 182 Protein sequence: 
Protein Accession #: Eos sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
PKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 



TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 



AALPDGNIAE 



LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 
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WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
IYWRKCFQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



31 



KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 
HFYIjEDSTSP 
EVQSCTVDLG 
GYKRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFI ITQHPL 
7LMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



Seq ID NO: 183 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 



CACACATACG 



CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACrr 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 



11 

i ' ■ 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
AGGCCAGTAA 



21 

I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCAGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 



31 
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TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCC6 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 



41 

1 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLIIQD 
IVHDEHGGVT 
LSLVSTRQEE 



41 
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GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
TTGGTCTAGC 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKIiRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
GLESEKKAVI 
FPISDDVGAI 
NKEKNRYINI 
DFWRMIWEHN 
FTLRNTK2KK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTIj 
NPSTSLDSNG 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 . 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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WO 02/086443 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT 
CTAGTGGTTC TTQTGGGTAT TCTCATCTAC TGGAGGAAAT 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC 
GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 
ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 
CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 
GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 
AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 
CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 
GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 
AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 
TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 
GOCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 
AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 
GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 
GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 
GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 
CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 
AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 
CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 
GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 
CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 
TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 
AGTAAAACTT TTGAACTTAT AAOTGTTATA AAAGAAGAAG 
ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 
CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 
AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 
ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 
GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 
AAGGGGTGGG "GGGACTCACA TCTGAGCATT GTTTTCCTCT 
ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 
GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 
TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 
GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 
AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 
AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 
GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 
CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 
AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 
TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 
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CAGCCCTGAC 
GCTTCCAGAC 
CTCCAACACC 
CAAAGCATGT 
AGAGCTGTAC 
ACAAGAATCG 
TTGCTGAAAA 
ACAGACCAAA 
GGAGAATGAT 
AAGGAAGGAG 
TTCTGGTCAC 
TAAGAAACAC 
CACAGTATCA 
TGACCTTTGT 
ACTGCAGTGC 
AGATTCAACA 
GAAATTATTT 
CCATACTTAG 
TCCTCATTCC 
AGTCAAATAT 
ATCGAACTTC 
GAGAAGGCAC 
TCATCATTAC 
ACCATAATGC 
TTGTTTACTG 
TGGCTGAAGA 
TAGAAGCTAC 
CAAATCCAGA 
CTGCCAATAG 
CTTTCTGTGC 
ACCAGGTAGC 
ATCAGTTTCT 
CCACCTCTCT 
AGTCTTTAGT 
TCCTAAAATT 
ACAGTAACTT 
TTTTTGCAAG 
ATTTCTAAGA 
TTATAGAGGT 
CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



TTTTATCTGT 

TGCACACTTT 

TATCTTTCCA 

TGCAGATTTA 

TGTTGACTTA 

ATACATAAAT 

GGATGGCAAA 

AGCTTATATT 

ATGGGAACAT 

AAAATGTGAT 

TCAGAAGAGT 

AAAAATAAAA 

CTACACGCAG 

GAGAAAGGCA 

TGGAGTTGGA 

CGAAGGAACT 

GGTACAAACT 

TAAAGAAACT 

TGGACCAGCA 

ACAGCAGAGT 

TTCTATCATC 

AGACTACATC 

CCAGCACCCT 

CCAACTGGTG 

GCCAAATAAA 

ACACAAATGT 

ACAGGATGAT 

TAGCCCCATT 

GGATGGGCCT 

TCTGACAACC 

CAAGATGATC 

CTACAAAGTG 

GGACAGTAAT 

TTAACACAGA 

AGGCAGGAAA 

TCATGACATA 

ACTTGTAATT 

ATGGAATTGT 

TAGGAATTCC 

AGCAATTATC 

TCTTCCATAT 

GTTACTTATT . 

AGATTTTTAT 

TGACGTAGTT 

AAGTCATTAA 

ATTTTGAAAG 

TCCAAGGAAT 

AAAAAAAAAA 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
ItlGTEEIIKE 



GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHPPKHVAD 
KLAQLAEKDG 
NLVEKGRRKC 
GRWTQYHYT 
SMLQQIQHEG 
YVNALLIPGP 
SSLSGEGTDY 
AEDEFVYWPN 
CPKWPNPDSP 
SVDVYQVAKM 
AESLESLV 



11 

I 

IQLLCVCRLD 
TQVNVNIiKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
BSFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
KLTDYINANY 
DQYWPADGSE 
QWPDMGVPEY 
rVNIFGFLKH 
AGKTKLEKQF 
INASYIMGYY 
KDEPINCESF 
ISKTFELISV 
INLMRPGVFA 



21 

I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FEEVQSCTVD 
VDGYNRPKAY 
EYGNFLVTQK 
SLPVLTFVRK 
IRSQRNYLVQ 
QLLSQSNIQQ 
QSNEFIITQH 
KVTLMAEEHK 
IKEEAANRDG 
DIEQYQFLYK 



31 

I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



FYLEDSTSPR 
LGITADSSNH 
IAAQGPLK8T 
SVQVLAYYTV 
AAYAKRHAVG 
TEEQYVFIHD 
SDYSAALKQC 
PLLHTIKDFW 
CLSNEEKLII 
PMIVHDEHGG 
VILSLVSTRQ 



QGPSVTDLEM 
HESRIGLAEG 
VISfTPPTPIF 
PDNKHKKRYX 
AEDFHRMIWE 
RNFTLRNTKI 
PVWHCSAGV 
TLVEAILSKE 
NREKNRTSSI 
RMIWDHNAQL 
QDFILEATQD 
VTAGTFCALT 
EENPSTSLDS 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTD1 
PHYSTFAYPP 
LESEKKAVXP 
PISDDVGAIP 
NIVAYDHSRV 
HNVEVIVMIT 
KKGSQKGRPS 
GRTGTYIVLD 
TEVLDSHIHA 
IPVERSRVGI 
WMIPDGQNM 
DYVLEVRHFQ 
TLMHQLEKEN 
KGAALPDGKI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
.720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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21 



31 



51 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGC3G CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
5 CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 
AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 
ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 
10 TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 
AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 
AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 
GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 
ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 
15 TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 
ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 
CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 
CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAG TACA AGT 1020 
TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 
20 GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 
GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 
AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 
GTGCTATTCT CAATAATTTG CTACCCAATA. TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 
GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 
25 ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 
AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1S0O 
ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 
CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 
AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 
30 CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 
ACTTGTOGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 
GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 
CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCOG 1980 
35 AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 
AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 
TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 
GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 
CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAG ATCTG GAAATGCCAC 2280 
40 ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAGACAACA GGATTTGGTC TCCACGGTCA AOGTGGTATA CTCGCAGACA ACCCAACCGG 2400 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 
AATCGGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTAT CTGTC 2520 
TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 
45 ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTOGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 
50 TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 
55 TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 
60 GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACPTAG TAAAGAAACT GAQGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
65 GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GT CACT CTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
70 TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 
CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 
CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG AOGGCAGGAA 4260 
CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 
ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 
75 ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 
CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 
AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 
TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 
ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 
80 TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 
ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTA ACAGA AA ATTTCA AT 4800 
TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTA GTGTC AAATTTTTAG 4860 
CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 
AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 
85 GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 
TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 
TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 
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TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 
AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 
ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO; 186 Protein sequence i 
Protein Accession #: EOS sequence 



5220 
5280 
5340 
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1 

I 

MVPKASKITF 
LSILFEVGTE 
PCTDTVDWIV 
SYTGKEEIHE 
TKHEFLTDGY 
PELIGTEEII 
TNRSPTRGSE 
NDGSKTVLRS 
ISENISQGYI 
DITAQPDVGS 
FPTEVTPHAF 
IPLVIVSALT 
IPIKHFPKHV 
IVAYDHSRVK 
NVEVIVMITN 
KGSQKGRPSG 
RTGTYIVLDS 
EVLDSHIHAY 
PVERSRVGXS 
VMIPDGQNMA 
YVLEVRHFQC 



11 
I 

HWGKCNMSSD 
ENLDFKAIID 
FKDTVSISES 
AVCSSEPENV 
QDLGAILNNL 



GAALPDGNIA 



PSGKGDVPNT 
PHMNLSGTAE 
FSSENPETIT 
GRESFLQTNY 
TPSSRQQDLV 
FICLWLVGI 
ADLHASSGFT 
LAQLAEKDGK 
LVEKGPJRKCD 
RWTQYHYTQ 
MLQQIQHEGT 
VNALLIPGPA 
SLSGEGTDYI 
EDEFVYWPNK 
PKWPNPDSPI 
VDVYQVAKMI 
ESLESLV 



21 
I 

GSEHSLEGQK 
GVESVSRFGK 
QLAVFCEVLT 
QADPENYTSL 
LPNMSYVLQI 
EGAIVNPGRD 
SLNSTSQPVT 
SLNTVSITEY 
YDVLI PESAR 
TEIRVDESEX 
STVNWYSQT 
LIYWRKCFQT 
EEFETLKEFY 
LTDYINANYV 
QYWPADGSEE 
WPDMGVPEYS 
VNIFGFLKHI 
GKTKLEKQFQ 
NASYIMGYYQ 
DEPINCESFK 
SKTFELISVI 
NLMRPGVFAD 



31 
I 

FPLEMQIYCF 
QAALDPFILL 
MQQSGYVMLM 
LVTWERPRW 
VAICTNGLYG 
SATNQIRKKE 
KLATEKDISL 
EEESLLTSFK 
NASED5TSSG 
TTKSFSAGPV 
TQFVYNEASN 
AHFYLEDSTS 
QEVQSCTVDL 
DGYNRPKAYI 
YGNFLVTQKS 
LPVLTFVRKA 
RSQRNYLVQT 
IiLSQSNIQQS 
SNEFIITQHP 
VTLMAEEEKC 
KEEAANRDGP 
IEQYQFLYKV 



41 

! 

DADRFSSFEE 
NLLPNSTDKY 
DYLQNNFREQ 
YDTMIEKFAV 
KYSDQfclVDM 
PQISTTTHYN 
TSQTVTELPP 
LDTGAEDSSG 
SEESLKDPSM 
MSQGPSVTDL 
SSHESRIGLA 
PRVISTPPTP 
GITADSSNHP 
AAQGPLKSTA 
VQVLAYYTVR 
AYAKRHAVGP 
EEQYVFIHDT 
DYSAALKQCN 
l»LHTIKDFWR 
LSNEEKLIIQ 
MIVHDBHGGV 
ILSLVSTRQE 



51 

I 

AVKGKGKLRA 
YIYNGSLTSP 
QYKFSRQVFS 
LYQQLDGEDQ 
PTDNPELDIiF 
RIGTKYNEAK 
HTVEGTSASL 
SSPATSAIPF 
EGNVWFPSST 
EMPHYSTFAY 
EGLESEKKAV 
IFPISDDVGA 
DMKHKNRYIN 
EDFWRMIWEH 
NFTLRNTKIK 
VWHCSAGVG 
LVEAIIiSKET 
RBKNRTSSII 
MIWDHNAQLV 
DFILEATQDD 
TAGTFCALTT 
ENPSTSLDSN 



Seq ID NO: 187 DHA sequence 

Nucleic' Acid Accession #: EOS sequence 

Coding sequence: 148-4632 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
AOGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
.GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 

AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
GAATCCGAGA 
CTAGTGGTTC 
TACTTAGAGG 
ATTTCAGATG 
CATGCAAGTA 
CAGAGCTGTA 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
AGGCCAGTAA 
AGAAGGCAGT 
TTGTGGGTAT 
ACAGTACATC 
ATGTCGGAGC 
GTGGGTTTAC 
CTGTTGACTT 



21 

i 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 
TATACCCCTT 
TCTCATCTAC 
CCCTAGAGTT 
AATTCCAATA 
TGAAGAATTT 
AGGTATTACA 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTOCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 
GTGATCGTGT 
TGGAGGAAAT 
ATATCCACAC 
AAGCACTTTC 
GAGACACTGA 
GCAGACAGCT 



41 



I 



GGAGGATTAA 
TGAGAAGCAG 
AACGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGAtTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTOGCAGAC 
TTGGTCTAGC 
CAGCCCTGAC 
GCTTCCAGAC 
CTCCAACACC 
CAAAGCATGT 
AAGAGTTTTA 
CCAACCACCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TOGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 
TTTTATCTGT 
TGCACACTTT 
TATCTTTCCA 
TGCAGATTTA 
CCAGGAAGTG 
AGACAACAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 . 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



262 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CACAAGAATC GATACATAAA TATCOTTGCC TATGATCATA 
CTTGCTQAAA AGGATGGCAA ACTGACTGAT TATATCAATG 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATO 
TTTCTQGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT 
CTAAGAAACA CAAAAATAAA AAAGGGCT CC CAGAAAGGAA 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID HO: 18 B Protein sequence: 
Protein Accession #: EOS sequence 



GCAGGGTTAA 
CCAATTATGT 
AATCCACAGC 
TGATAACAAA 
GGAGTGAGGA 
ATACTGTGAG 
GACCCAGTGG 
CAGAGTACTC 
CAGTGGGGCC 
TGCTAGACAG 
TAAAACACAT 
TTCATGATAC 
TTCATGCCTA 
AACAATTCCA 
ACTGCAACCT 
GGACTATACT 
AATGCAACAG 
GCATTTCATC 
ATTACCAGAG 
TCTGGAGGAT 
ACATGGCAGA 
GCTTTAAGGT 
TAATTCAGGA 
TTCAGTGTCC 
GTGTTATAAA 
GAGGAGTGAC 
AAAATTCCGT 
TTGCTGACAT 
GGCAGGAAGA 
ATATAGCTGA 
TGAGCATTGT 
TTGATTTCCC 
TCATTAACAA 
ATGATTGAAT 
TTAACAGAAA 
TTAGTGTCAA 
TTAATACAGT 
CAGTATTCAC 
CATGGACCAA 
TCTAGTTCTG 
AGTTTTCTGA 
AACTTTTGTG 
TTACCAAACA 
GCCATTAAAA 



GCTAGCACAG 
TGATGGCTAC 
TGAAGATTTC 
CCTCGTGGAG 
GTACGGGAAC 
GAATTTTACT 
ACGTGTGGTC 
CCTGCCAGTG 
TGTTGTCGTC 
TATGTTGCAG 
CCGTTCACAA 
ACTGGTTGAG 
TGTTAATGCA 
GGGTCTCACT 
TCCTCTCCCT 
CCTGAGCCAG 
GGAAAAGAAT 
CCTGAGTGGA 
CAATGAATTC 
GATATGGGAC 
AGATGAATTT 
CACTCTTATG 
CTTTATCTTA 
TAAATGGCCA 
AGAAGAAGCT 
GGCAGGAACT 
GGATGTTTAC 
TGAGCAGTAT 
GAATCCATCC 
GAGCTTAGAG 
TTTCCTCTTC 
ATCACCTGAC 

TTTACAGTAT 
ATTTCAATTT 
ATTTTTAGCT 
AGCCTGTAAA 
CTAAAGTAGA 
ATTTATATTT 
TGTAATTGTT 
CATTGTATTG 
GAAAATAGAA 
TTGTTCAAAT 
AAAAAAAAAA 



I 

MRILKRFIAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDKIVPK 
TGKEEIHEAV 
HEFLrTDGYQD 
LI GT EE I IKE 
RSPTRGSEPS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
.EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRVAGTZLLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEB 
RPGVFADIEQ 



11 

I 

IQLLCVCRLD 
TQVNVNLKKL 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 



LDFKAIIDGV 
DTVSISESQL 
CSSEPEMVQA 
LGAILNNLLP 



GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CliWLVGlLI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMTV 
YQFLYKVTLS 



ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
KSTSQPVTKL 
NTVSITEYEE 
VLIPESARHA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVGTRQEENP 



31 

I 

RLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMIxMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



KSFSAGPVMS 



FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTOLEN 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGONMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGKIAESL 



Seq ID NO: 189 DNA sequence 

Nucleic Acid Accession #: NM_002820 . 

Coding sequence: 304.. 831 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
43B0 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 



PCT/US02/12476 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQUX3EDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINrV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



11 



21 31 41 51 

11)1 
CCGGTTCGCA AAGAAGCTGA CTTCAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 
CCCTGTTCCA CGAACCCAGG AGAACTGCTG GCCAGATTAA TTAGACATTG CTATGGGAGA 
CGTGTAAACA CACTACTTAT CATTGATGCA TATATAAAAC CATTTTATTT TCGCTATTAT 



60 
120 
180 



263 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTCAGAGGAA GCGCCTCTGA TTTGTTTCTT 
GTTTGGAGAA AGCACAGTTG GAGTAGCCGG 
ACGATGCAGC GGAGACTGGT TCAGCAGTGG 
GTGCCCTCCT GCGGGCGCTC GGTGGAGGGT 
GAACATCAGC TCCTCCATGA CAAGGGGAAG 
CTTCACCATC TGATCGCAGA AATCCACACA 
CCTAACTCCA AGCCCTCTCC CAACACAAAG 
GAGGGCAGAT ACCTAACTCA GGAAACTAAC 
AAGACACCTG GGAAGAAAAA GAAAGGCAAG 
AAACGGCGAA CTCGCTCTGC CTGGTTAGAC 
GACCACCTGT CTGACACCTC CACAACGTCG 
CTGGCCCGTA GCCTCAGCGG GGTGCTCTCA 
GCTTGGACAA ACCTAGAATT TTCTCCCTTT 
CAGAGAATAA CTCAGAATAT TGTCTGCCTT 
TGTCCTCCAG CACCATAGAG AGGCGCTAGA 
CATCAATCCT TTACCACTCT ACCAAATAAT 
ATCTTCATAA TTTGCTGGAG AAGTGTATTT 
TTCTTCAGTG TTTTTCATTT CTTACGTTCT 
GAT ATT AT CT ACAAACACTG CAGAACAGCA 
ACTTTTTATT TAATTAAATG TATTTAATTA 
TAAATTATGT TTTAAACACA TGCCTTAAAT 
CCAGCTCATA CAAAATAAAT GGTTTCTGAA 
GGTTTTTCTC ATGTATCTTT TTGTTCATTG 
CGGTAGGAAA AATAAAACTT CACATTTAAA 



Seq ID NO: 190 Protein sequence: 
Protein Accession #: NP_002B11 



PCT/US02/12476 



TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGOGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 



TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCOGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
GCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
AOGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



11 



1 
I 

GGGCTGTCCG 
CCTCGCATGC 
GGTATCGTGG 
GCGGCCACGG 
GCCAACCTGT 
GGCCCGGAAG 
GTGGATTTGC 
CTGGGCAGGG 
TGCCAGGTGT 
GAGCGCGGTG 
GTGGAGCGCG 
GACGCCTTCG 
GCCCTGGCGC 
GCGCACCCCG 
TTCTCTTGCT 
GCCCAGCTTG 
CCAGGGGAGG 
TGACCCCCTA 
TTAGTCCTGG 
GGACACTGCC 
AGCCTTCTTG 
TGGTGCCCCT 
CTATATTAAT 



11 

I 

GCCCACTCCC 
GCCTGGTGGT 
AGGCCTTTCT 
AGAGCGCGGC 
CCGAGGGCCG 

CTCAGGTGCG 
CCCCCACGCA 
TCGCCGAGGC 
TGGGTGGCTG 
ACGCCCGGGC 
TGGGCCTGAG 
GGGTCCTGGA 
GCTACCCCAG 
CTTGGGAGCG 
CCCAGGATGG 
AGGTCCCCTG 
CAGACAACCA 
CCCAGCCCAG 
ACCTCTGGGC 
GCTGCAGGCA 
CCATGTTGCA 
AAAATAACGT 



21 
i 

CTGGGAGCGC 
CACCGCGGAC 
GGCCGGGGCT 
GGAGCTGGCC 
CCCCGTGGGT 
TGGCAAGATG 
GGAGGAGCTC 
CGOGGACGGG 
GCTGCAGGCC 
CACTTGGCTG 
CGCCGTGGGC 
CACTTGCGGC 
AGGTACCCTA 
TGTGCCTCCC 
GCTGCATGAG 
CGTGCAGCTT 
TGAGCCCACT 
AGCACTAATC 
AGCTGGGACC 
TCAGGTCCTC 
GGCCTAGOCT 
ATGCAAACAC 
GTGTCTTTC 



31 
I 

GAGCGGTGGA 
GACTTTGGTT 
GTGACCAGGG 
CGCAGGCACA 
CCGGCCCGOC 
GGATTCCGGG 
GAGGCCCAAC 
CACCAGCAOG 
TATGGGGTGC 
GAGGCCCCOG 
COCTTCTCCC 
CGGCACATGT 
GCGGGCCACA 
ACCGGCGGCT 
CTGCGCGTCC 
TGCGCCCTOG 
CTGGAACCCT 
CCCTTAGTAC 
TGGAGCACGA 
ATGCCTCCAA 
GTGGCAGCGG 
CTTCACCACT 



41 
I 

CCCAGGCGGC 
ACTGCCOGCG 
TGTCCCTGCT 
GCATCCCCAC 
GTGGCGCCTC 
AGGOGGTGGC 
TAAGCTGCTT 
TGCACGTGCT 
GCTTTACGCG 
OGCGTGCCTT 
GCCACGGCCT 
CCGCTCACCG 
CCCTGACAGC 
GCGGTGAAGG 
TCACCGCGCC 
AOGACCTGGA 
TCCTGGAACC 
CAAGAAAGGG 
TCTGTTGACT 
ATGGCATCTA 
GCTAGGGCCC 
GGGGCAGTGG 



51 
I 

CATGTCCCGC 
AOGCGATGAG 
GGTCAACGGT 
GGGCCTCCAC 
ATCGCTGCTC 
GGCOGGAGAC 
CCGGGAGCTG 
CCCAGGCGTG 
ACTGCCGCTG 
CGCCTGCGCC 
GCGGTGGACA 
OGTGTCCGGG 
CGAGCTGATG 
CCCCGACGCT 
CACGCTGCGG 
CTCCAAGAGG 
CTCCCTACTC 
GAGCCAGGAT 
TCCCTGGGTA 
GAGTTTGAGC 
GCAGAGCATT 
GGAGAGATGG 



Seq ID NO t 192 Protein sequence: 
Protein Accession 8: XP 059328 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



21 31 41 51 

I I I I I I 

MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSBVSP NSKPSPNTKN HPVRFGSDDB GRYLTQETNK VETYKEQPLK 
TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGIiEGD HLSDTSTTSL ELDSR 

Seq ID NO: 191 DNA sequence 
Nucleic Acid Accession #: XM_059328 
Coding sequence: 52.. 1023 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1 11 21 31 41 51 

I I I I I I 

MSRPRMRLW TADDFGYCPR RDEGIVEAFL AGAVTSVSLL VNGAATESAA ELARRHSIPT 
GLHANLSEGR PVGPARRGAS SLLGPEGFFL GKMGFREAVA AGDVDLPQVR EELEAQLSCF 
RELLGRAPTH ADGHQHVHVL PGVCQVFAEA LQAYGVRFTR LPLERGVGGC TWLEAPARAF 
ACAVERDARA AVGPFSRHGL RWTDAFVGLS TCGRHMSABR VSGALARVLE GTLAGHTLTA 
ELMAHPGYPS VPPTGGCGEG PDAFSCSKER LHELRVLTAP TLRAQLAQDG VQLCALDDLD 
SKRPGEEVPC EPTLEPFLiEP SLL 

Seq ID NO: 193 DNA sequence 

Nucleic Acid Accession #: NMJJ05688.1 

Coding sequence: 126.. 4439 



60 
120 
180 
240 
300 



11 



51 



21 31 41 

111) I I 

CCGGGCAGGT GGCTCATGCT OGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 



60 
120 



264 
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AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GT6TGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCOGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCOG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTT OCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA GGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATOGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGO ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG OGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 
TGTATAGTGA CAGGAGCATC TACATCCTGG AOGAOCCCCT CAGTGCCTTA GATGCCCATG .2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

.CCTGGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCOGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3460 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC* 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

COCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCT ATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGOGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5260 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
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GTTGGTTCCA AGCCCTGGAG 
ATTCCCACAC CTCCACAGTT 
CTCACCGCAG TCGTCGCACA 
CAGCTCTTGC TAATCAGTGT 
ACCTCAGGTT OCTGGTTGCT 
GGGQCTGOTA GCTCAGGTGG 
ATGTCGTGAC CAACTAGACA 
CAAAAATCTG AAAATGTGAA 
AAAAAAAAAA AAAAAAAA 



Seq ID NOx 194 Protein sequence t 
Protein Accession 8: NP_005679.1 



CCAACTGCTG 
CAGTG6CAGG 
GTCTCTCTCT 
CTCACACTGG 
GTGTGGTTTG 
GCGTGGTCAC 
TTCTGTCGCC 
TAAAATTATT 



CTTTTTGAGG 
GCTCAGGATT 
CTCTCTCCCC 
CGTAGAAGTT 
GTGTGTTCCC 
TGCTGTCATC 
TTAGCATGTT 
TTGGATTTTG 



TGGCACTTTT 
TCGTGGGTCT 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTGAATGG 
TGCTGAACAC 
TAAAAAAAAA 



TCATTTGCCT 
GTTTTCCTTT 
CAACTTTAAG 
AAAGAGACCT 
TTTGTGCTGT 
TCAGOGTTGC 
CTTGTGGAAG 
AAAAAAAAAA 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
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15 

20 

25 

30 
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40 
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50 
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60 

65 

70 

75 
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85 



MKDIDIGKEY 
LDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYRTGVRL 
PWAILGMIY 
VLTYIKFIKM 
HMTIjGFDLTA 
NKPASPHIKI 
VLAEQKGHXJi 
KTSLISAILG 
CCliRPDLAIIi 
NHIFNSAIRK 
FNNIiLLGETP 
VPWSVYGVYI 
VSDSMKDNPH 
KFFDTTPTGR 
LVILFSVLHI 
LDDNQAPFFL 
GLFQFTVRLA 
RYRENLPLVL 
IGLADLRSKL 
SEVMENGDNF 
MLTIAHRlxHT 



11 
I 

IIPSPGYRSV 
ILDEEHPKGK 
EDVWSLSKHE 
LAGPSGPAFM 
RGAILTMAFK 
NVIILGPTGF 
YAWVKAFSQS 
AQAPTWTVF 
EMKNATLAWD 
LDSDERPSPE 
QMTLLEGSIA 
PSSDLTEIGE 
HLKSKTVIiFV 
PVEINSKKET 
QAAGGPLAPL 
MQYYASIYAI* 
ILNRFSKDMD 
VSRVLIRELK 
FTCAMRWLAV 
SETEARFTSV 
KKVSFTIKPK 
SIIPQEPVLF 
SVGERQLLCI 
VLGSDRIMVL 



21 

1 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLE 
VKHLLEYTQA 
KILKLKNIKE 
LGSAVFILFY 
VQKIREEERR 
NSMTFALKVT 
SSHSSIQNSP 
EEEGKHIHLG 
ISGTFAYVAQ 
RGANLSGGQR 
THQLQYLVDC 
SGSQKKSQDK 
VIMALFMLNV 
SMAVMLILKA 
EVDVRLPPQA 
RLDNITQSPF 
RLDLISIALI 
ERINKYIKTL 
EKIGIVGRTG 
SGTVRSNLDP 
ARALLRHCKI 
AQGQWEFDT 



31 
1 



IRTTSKHQHP 
RLWQEELNEV 
TESNLQYSLL 
K5LGELINIC 
PAMMFASRLT 
ILEKAGYFOG 
PFSVKSLSEA 
KLTPKMKKDK 
HLRLQRTLHS 
QAWILNATLR 
QRISLARALY 
DEVIFMKEGC 
GPKTGSVKKE 
GSTAFSTWWL 
IRGWPVKGT 
EMFIQNVILV 
LSHITSSIQG 
TTTGLMIVLM 
SLEAPARIKN 
SGKSSIiGMAL 
FNQYTEDQIW 
LILDEATAAM 
PSVLLSNDSS 



41 
I 

TRPLECQDAL 
VDNAGLFSCM 
GPDAASLRRV 
LVLGLLLTEI 



AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSIi 
RASRGKKEKV 
IDLEIQEGKL 
DNILFGKEYP 
SDRSIYILDp 
ITERGTHEEIj 
KAVKPEEGQI* 
SYWIKQGSGN 
LRASSRLHDE 
FFCVGMIAGV 
LATIHAYNK6 
HGQIPPAYAG 
KAPSPDWPQE 
FRLVELSGGC 
DALERTHMKE 
DTETDLLIQE 
RFYAMFAAAE 



51 
I 

ETAARAEGXiS 
TFSWLSSLAR 
VWIFCRTR1»I 
VRSWSLALTW 
AAVGSLLAGG 
TDERVQKMNE 
VIASWTFSV 
FLMEEVHMIK 
RQLQRTEHQA 
VGICGSVGSG 



PLSALDAHVG 
MNLNGDYATI 
VQLEEKGQGS 
TTVTRGNETS 
LFRRILRSPM 
FPWFLVAVGP 
QEFLHRYQEL 
IiAISYAVQI.T 
GEVTFENAEM 
IKIDGVRISD 
CIAQLPLKIiE 
TIREAFADCT 
NKVAVKG 



Seq ID NO: 195 DNA sequence 
Nucleic Acid Accession fli NM_006470 
Coding sequence : 2 2 8 . . 1 922 



GCTGTCCTGA 
CGCCAGCACA 
TTGCAGCAGC 
TGGGCCAAGG 
ATCTAATGGC 
CAGACTCTGG 
TGGGCTCCTC 
AGGGGGATCC 
GAAGAGTGAA 
TGCAGCCGCA 
ACCACAACTG 
ATCAGCAGTG 
TGGATGCAGC 
GGAAACTCAA 
TGGTGTCGGT 
CTGTGAGGAA 
TGAGCCAGGC 
GCAAGCAGGA 
ACTGCAAGTT 
ATAAACTCTC 
TGGAGAACTA 
CTCAAGTGTC 
GGGAACAGTT 
ATCTCCGGCT 
ACCCGGACCT 
ACCTGCACAG 
CCTGCAAAGG 
TCTCCTGGAG 
CCCCACTCAA 
TCCTTTCCTT 
AATTTTCAGA 
TTGTAGATCT 
AGACTCCAGG 
GGTGATTTGT 
TCTGAATGAA 
GTTTGCAGTA 
AGAGCAGTGG 
CTGCCTCAGC 
TTTGTATTTT 



11 

I 

GCCTGAGTAC 
CAGTAATGAG 
TGCAATCATC 
GACAGAAGAA 
TCCAGGGCCA 
GTCACCCAGC 
GGAGAAGCTT 
TGCTGGTGAG 
GGCAGTGAAG 
TCAGGTGAAC 
GCGATACTGC 
CATCTGCCAG 
CCGCAGGGAC 
GTTGAATGAA 
GTCAGAGGTC 
GGCCCAGGCC 
CAACGGTATC 
GCTGGAGAGG 
TAAGAACACT 
GGGCATCCGC 
TAAGAAAAAG 
TGCCGTTGTT 
CCTCCAATAT 
GCAGGAGGAG 
CCCCAGCAGG 
GTACTATTTT 
CATCGACCGG 
CCTCCAATGG 
AGCTGGCCCT 
CTATGGCGTA 
ACCAGTCTAT 
GGGAGAGGAA 
AGCCATATCC 
GGGCAGAAAT 
AACATTCTCC 
ATTCTTTTTT 
CGCGATCTTG 
CTCCCGAGTA 
TAGTAGAGAT 



21 

! 

TCTAGCTGCC 
TGGCCGAGCT 
TAGGCGTGGT 
AGACAGCCTA 
CTGCCCAGGG 
CCAGATTCTG 
GGCAGGGAGA 
GGGAAAGAGG 
TCCTGTCTAA 
ATCAAACTGC 
CCTGCCCACC 
GACTGTTGCC 
AAGGAGGCTG 
AATGCCATCT 
AAAGCGGTGG 
AATGTGATGC 
AAGGCCCACC 
ATGGCGGCCA 
GAAGACATCA 
AAAGTTATCA 
CTCCAGGAGT 
CAGCGCAAAT 
GCGTATGACA 
AACCGCAAGG 
TTCCTGCACT 
GAGGTGGAGA 
AAAGGGGAGG 
AACGGGAAGG 
TTCCGGAGGC 
GAGTATGATA 
GCTGCCTTCT 
CCCGAGAAGC 
CAGACCTTTG 
AACTGCTGAT 
AGCTGCTCTC 
TTTTTTTTGA 
GCTCACTGCA 
GCTGGGATTA 
GGGGTTTCAC 



31 

1 

TTGTCGCCAT 
TCCTCTGGGA 
TCTCTTGTCT 
GGAGCAGAGC 
CCACTGCTCA 
GGTCAGCCAG 
CGGAGGAACA 
TCCTGTGTGA 
CCTGCATGGT 
AAAGCCACCT 
ACAGCCCACT 
AGGAGCACAG 
AACTCCAGTG 
CCAGGCTCCA 
CTGAAATGCA 
TCTTCTTAGA 
TGGAGTACAG 
TCAGCAACAC 
CCTTCCCTAG 
CGGAATCCAC 
TTTCCAAGGA 
ATT GGACT TC 
TCACGTTTGA 
TCACCAACAC 
GGCGGCAGGT 
TCTTCGGGGC 
AGCGCAACAG 
AGTTCACGGC 
TCGGGGTCTA 
CCATGACTCT 
GGCTTTCCAA 
CAGCACCGTC 
CCAGCTACAG 
GGTAGCTGGC 
TTTTGCTCCA 
GACGGAGTCT 
AGCTCCGCCT 
CAG6TGCCTG 
CATGTTGGCC 



41 
I 

CGCATCTGGC 

GGGAGGAAAC 

GACTTGGGCT 

CTCCCAGATG 

GCCCCCAGCC 

CCCAGTGGAA 

GGACAGCGAC 

CTTCTGCCTT 

GAATTACTGT 

GCTGACCGAG 

GTCTGCTTTC 

TGGCCACACC 

CACCCAGTTA 

GGCTAACCAA 

GTTTGGGGAA 

GGAGAAGGAG 

GAGTGCCGAG 

TGTCCAGTTC 

TGTTTACGTA 

TGTACACTTA 

AGAGGAGTAT 

CAAACCTGAG 

CCCGGACACA 

CACGCCCTGG 

GCTGTCCCAG 

AGGCACCTAT 

TTGCATTTCC 

CTGGTACAGT 

TATCGACTTC 

GGTTCACAAG 

GAAGGAAAAC 

CTTGGGGGTG 

TGATGGGATT 

TTTTGAAATC 

TATGGTGCTG 

CGCACTGTTG 

CCCGAGTTCA 

CCACCACACC 

AGGCAGATCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



51 
1 

TGCCATCCAG 

AGTTAAAATC 

GCACAGATCC 

GCTGAGTTGG 

CCTCTCAGCC 

GAAGAGGACG 

TCTGCAGAGC 

GATGACACCA 

GAAGAGCACT 

CCAGTGAAGG 

TGCTGCCCTG 

ATAGTCTCCC 

GACTTGGAGC 

AAGTCTGTTC 

CTCCTTGCTG 

CAAGCTGCGC 

ATGGAGAAGA 

TTGGAGGAGT 

GGGCTGAAGG 

ATCCAGTTGC 

GACATCAGAA 

CCCAGCACCA 

GCACACAAGT 

GAGCATCCCT 

CAGAGTCTGT 

GTTGGCCTGA 

GGAAACAACT 

GACATGGAGA 

CCGGGAGGGA 

TTTGCCTGCA 

GCCATCCGGA 

ACTGCTCCCT 

TGCATTTTAG 

CTATGGGGTC . 

TTCTCTATGT 

CCCAGGCTGG 

AGCAATTCTC 

CAGCTAATGT 

CAAACTCCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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ACCTCGTGAT GCACCCACCT CGGCCTCCCA AAGTGCTGGG ATTACATGCG TGAGCCACTG 2400 
CGCCCTGCCT GTTTGTAGTA ATTTTTAGGC ACCAAATCTC CCTCATCTTC TAGTGCCATT 2460 
CTCCTCTCTQ TTCAGGTAAA TGTCACACTG TGCCCAGAAT GGATGACCAQ GAACCTTAAA 2S20 
GAGTGGCTGA AAAGATTGCA GAGTTATCAT AATAAATTGC TAACTTGCGT 



Seq ID NO: 196 Protein sequence » 
Protein Accession #: NP_006461 

1 11 21 31 41 51 

I I 1 I I I 

MAELDLMAPG PIiPRATAQPP APLSPDSGSP SPDSGSASPV EEEDVGSSEK LGRETEBQDS 60 

DSABQGDPAG EGKEVLCDFC LDDTRRVKAV KSCLTCMVNY CBEHLQPHQV NIKLQSHLLT 120 

EPVKDHNWRY CPAHHSPLSA FCCPDQQCIC QDCCQEHSGH TIVSLDAARR DKEAELQCTQ 180 

LDLERKLKLN ENAISRLQAN QKSVLVSVSE VKAVAEMQPG ELLAAVRKAQ ANVMLFLEEK 240 

EQAALSQANG IKAHLEYRSA EMEKSKQELE RMAAISNTVQ FLEEYCKFKN TEDITFPSVY 300 

VGLKDKLSGI RKVITESTVH LIQLLENYKK KLQEFSKEEE YDIRTQVSAV VQRKYWTSKP 360 

EPSTREQFU3 YAYDITFDPD TAHKYLRLQE ENRKVTNTTP WEHPYPDLPS RFLHWRQVLS 420 

QQSLYLHRYY FEVEIFGAGT YVGLTCKGID RKGEERNSCI SGNNFSWSLQ WNGKEFTAWY 480 

SDMETPLKAG PFRRLGVYID FPGGILSFYG VEYDTMTLVH KFACKFSEPV YAAFWLSKKB 540 
NAIHIVDLGE EPEKPAPSLG VTAP 

Seq ID NO: 197 DNA sequence 
Nucleic Acid Accession #j NM_004316 
Coding sequence: . 433-1149 

1 11 21 31 41 51 

I I I I I I 

CCCGAGACCC GGOGCAAGAG AGOGCAGCCT TAGTAGGAGA GGAACGCGAG ACGCGGCAGA 60 

GCGCGTTCAG CACTGACTTT TGCTGCTGCT TCTGCTTTTT TTTTTCTTAG AAACAAGAAG 120 

GCGCCAGCGG CAGCCTCACA CGCGAGCGCC ACGCGAGGCT CCCGAAGCCA ACCCGCGAAG 180 

GGAGGAGGGG AGGGAGGAGG AGGCGGCGTG CAGGGAGGAG AAAAAGCATT TTCACCTTTT 240 

TTGCTCCCAC TCTAAGAAGT CTCCOGGGGA TTTTGTATAT ATTTTTTAAC TTOCGTCAGG 300 

GCTCCCGCTT CATATTTCCT TTTCTTTCCC TCTCTGTTCC TGCACCCAAG TTCTCTCTGT 360 

GTCCCCCTCG CGGGCCCCGC ACCTCGCGTC CCGGATCGCT CTGATTCCGC GACTCCTTGG 420 

CCGCCGCTGC GCATGGAAAG CTCTGCCAAG ATGGAGAGCG GCGGCGCCGG CCAGCAGCCC 480 

• CAGCCGCAGC CCCAGCAGCC CTTCCTGCCG CCCGCAGCCT GTTTCTTTGC CACGGCCGCA 540 

GCCGCGGCGG CCGCAGCCGC CGCAGCGGCA GCGCAGAGCG CGCAGCAGCA GCAGCAGCAG 600 

CAGCAGCAGC AGCAGCAGCA GCAGGCGCCG CAGCTGAGAC OGGCGGCCGA CGGCCAGCCC 660 

TCAGGGGGCG GTCACAAGTC AGCGCCCAAG CAAGTCAAGC GACAGCGCTC GTCTTCGCCC 720 

GAACTGATGC GCTGCAAACG CCGGCTCAAC TTCAGCGGCT TTGGCTACAG CCTGCCGCAG 780 

CAGCAGCCGG CCGCCGTGGC GCGCCGCAAC GAGCGCGAGC GCAACCGCGT CAAGTTGGTC 840 

AACCTGGGCT TTGCCACCCT TCGGGAGCAC GTCCCCAACG GCGCGGCCAA CAAGAAGATG 900 

AGTAAGGTGG AGACACTGCG CTCGGCGGTC GAGTACATCC GCGCGCTGCA GCAGCTGCTG 960 

GACGAGCATG ACGCGGTGAG CGCCGCCTTC CAGGCAGGCG TCCTGTCGCC CACCATCTCC 1020 

CCCAACTACT CCAACGACTT GAACTCCATG GCCGGCTCGC CGGTCTCATC CTACTCGTCG 1080 

GACGAGGGCT CTTACGACCC GCTCAGCCCC GAGGAGCAGG AGCTTCTCGA CTTCACCAAC 1140 

TGGTTCTGAG GGGCTCGGCC TGGTCAGGCC CTGGTGCGAA TGGACTTTGG AAGCAGGGTG 1200 

ATCGCACAAC CTGCATCTTT AGTGCTTTCT TGTCAGTGGC GTTGGGAGGG GGAGAAAAGG 1260 

AAAAGAAAAA AAAAGAAGAA GAAGAAGAAA AGAGAAGAAG AAAAAAACGA AAACAGTCAA 1320 

CCAACCCCAT CGCCAACTAA GCGAGGCATG CCTGAGAGAC ATGGCTTTCA GAAAACGGGA 1380 

AGCGCTCAGA ACAGTATCTT TGCACTCCAA TCATTCACGG AGATATGAAG AGCAACTGGG 1440 

ACCTGAGTCA ATGCGCAAAA TGCAGCTTGT GTGCAAAAGC AGTGGGCTCC TGGCAGAAGG 1500 

GAGCAGCACA CGCGTTATAG TAACTCCCAT CACCTCTAAC ACGCACAGCT GAAAGTTCTT 1560 

GCTCGGGTCC CTTCACCTCC CCGCCCTTTC TTAGAGTGCA GTTCTTAGCC CTCTAGAAAC 1620 
GAGTTGGTGT CTTTC 



Seq ID NO: 198 Protein sequence: 
Protein Accession #: NP_004307 

1 11 21 31 41 51 

I I I I I I 

MESSAKMESG GAGQQPQPQP QQPFLPPAAC FFATAAAAAA AAAAAAAQSA QQQQQQQQQQ 60 
QQQQAPQLRP AADGQPSGGG HKSAPKQVKR QRSSSPBLMR CKRRLNFSGF GYSLPQQQPA 120 
AVARRNERER NRVKLVNLGF ATLREHVPNG AANKKMSKVE TLRSAVEYIR ALQQLLDEHD 180 
AVSAAFQAGV LSPTISPNYS NDLNSMAGSP VSSYSSDEGS YDPLSPEEQE LLDFTNWF 

Seq ID NO: 199 DNA sequence 
Nucleic Acid Accession #: NM_00701S 
Coding sequence: 1-1005 

1 11 21 31 41 51 

I I I I I I 

ATGACAGAGA ACTCCGACAA AGTTCCCATT GCCCTGGTGG GACCTGATGA CGTGGAATTC 60 

TGCAGCCCCC CGGGGTACGC TACGCTGACG GTGAAGCCCT CCAGCCCCGC GCGGCTGCTC 120 

AAGGTGGGAG CCGTGGTCCT CATTTCGGGA GCTGTGCTGC TGCTCTTTGG GGCCATCGGG 180 

GCCTTCTACT TCTGGAAGGG GAGCGACAGT CACATTTACA ATGTCCATTA CACCATGAGT 240 

A7CAATGGGA AACTACAAGA TGGGTCAATG GAAATAGACG CTGGGAACAA C7TGGAGACC 300 

TTTAAAATGG GAAGTGGAGC TGAAGAAGCA ATTGCAGTTA ATGATTTCCA GAATGGCATC 360 

ACAGGAATTC GTTTTGCTGG AGGAGAGAAG TGCTACATTA AAGCGCAAGT GAAGGCTCGT 420 

ATTCCTGAGG TGGGCGCCGT GACCAAACAG AGCATCTCCT CCAAACTGGA AGGCAAGATC 480 

ATGCCAGTCA AATATGAAGA AAATTCTCTT ATCTGGGTGG CTGTAGATCA GCCTGTGAAG 540 

GACAACAGCT TCTTGAGTTC TAAGGTGTTA GAACTCTGCG GTGACCTTCC TATTTTCTGG 600 

CTTAAACCAA CCTATCCAAA AGAAATCCAG AGGGAAAGAA GAGAAGTGGT AAGAAAAATT 660 

GTTCCAACTA CCACAAAAAG ACCACACAGT GGACCACGGA GCAACCCAGG CGCTGGAAGA 720 

CTGAATAATG AAACCAGACC CAGTGTTCAA GAGGACTCAC AAGCCTTCAA TCCTGATAAT 780 



267 
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WO 02/086443 

CCTTATCATC AGCAGGAAGG GGAAAGCATG 
ATCTGTTGTA TAGAATGTAG GOGGAGCTAC 
GGGGGCTATT ACCCATGGCC TTATAATTAT 
ATGCCATGTA GCTGGTGGGT GGCCCGTATC 
CACGTGCTGT AAAATAAGAA CTAGCTGAAG 
ATGCTGATGG GACCATAAAA TATTTTTACA 
TAACA6AATT TTTTTAATCQ TTTTCCAGAA 
AGTTCAAGTC TAAAATGCCA TAACCCCGTT 
CATAAGTCTT CCCTTGCTTG CATCTTCCAA 
A6TTTGCC 



Seq ID NO: 200 Protein sequence: 
Protein Accession 8: NP_008946 



ACATTOGACC 
ACCCACTGCC 
CAAGGCTGCC 
TTGGGCATGG 
AGACAACCAA 
CGCAGCCTGA 
CTTTAGTATA 
ATTTGTTATT 
AGCTATTTCG 



CTAGACTGGA 
AGAAGATCTG 
GTTCGGCCTG 
TGTGAAATCA 
AGAAGCATTA 
GCGGTTATTC 
TGCAAATGCA 
TTTTATTTGC 
AAATAAACAC 



TCACGAAGGA 
TGAACCCCTG 
CAGAGTCATC 
CTTCATATAT 
AGGCAGGTTG 
TTGACACTCT 
CTGAAAGGGT 
ATTGATTTGC 
GAAAATTTAC 



840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 



PCT/US02/12476 



MTENSDKVPI 
AFYFWKGSDS 
TGIRFAGGEK 
DMSFLSSKVL 
LNNETRPSVQ 
GGYYPWPYKY 



11 

I 

ALVGPDDVEP 
HIYNVHYTMS 
CYIKAQVKAR 
BLCGDLPIPW 
EDSQAFNPDN 
QGCRSACRVI 



21 
I 

CSPPAYATLT 
INGKLQDGSM 
IPEVGAVTKQ 
LKPTYPKEIQ 
PYHQQEGESM 
MPCSWWVARI 



31 
I 

VKPSSPARLL 
EIDAGMNLET 
SISSKLEGKI 
RERREWRKI 
TFDPRliDHEG 
XjGMV 



41 



51 
I 



KVGAWLISG AVLLLFGAIG 
FKKGSGAEEA IAVNDFQNGI 
MPVKYEENSL IWVAVDQPVK 
VPTTTKRPHS GPRSNPGAGR 
1CCIECRRSY THOQKICEPL 



Seq ID NO: 201 DNA sequence 

Nucleic Acid Accession 8: NM_000728.2 

Coding sequence: 112.. 495 



GTAATAAGAG 
GTCGACCGGC 
CGGAAGTTCT 
CAGGCGGCGC 
GAGGACGCGC 
GAGCTGAAGC 
AACACTGCCA 
GTGAAGAGCA 
GACCTTCAAG 
CATATCCTTA 
AAGGAGGCAC 
TGGAAGAAGA 
GAGAATAATT 
GGAAACTAAT 
GGTTATTTGG 
ACTGTACCAC 
GTATGTAGCA 
AGCATCTATT 
AAATTTTTGT 
TGGATGCAAG 
TTGCTTTTTC 
TCCAATTCAT 
TTGCCTAACT 
TATAGTTTTA 
TGAGAGGTGT 
TTTGTTAAAA 
CTGATCATAT 
ACCATTCTTT 
ACCAGATAAT 
CTAAAATATT 
TTCTGATGAG 
CATATTAATA 
GTTTTCTCTG 
ATATCTTGTT 
ATTTTTGTTT 
AAAAAAAAAA 



11 

I 

CGGGGTCTCC 
CGCTCGCGCT 
CCCCCTTCCT 
CATTCAGGTC 
GCCTCCTGCT 
AGGAGCAGGA 
CCTGTGTGAC 
ACTTCGTGCC 
CCTGAGCAGA 
TAAGAGATTC 
AAGCCAAGGA 
GCAGCCCTGC 
TCTGTTGTTT 
ACAATACATT 
AAAGTGTGTA 
TTCGCCTTCT 
GTATCTCATT 
TTACCATATG 
TGGCTTGCTT 
ATTGTTTTCA 
ATTTTCTTAG 
CTTTTTTTTT 
AAGGTCCCAA 
TATTTTATAT 
AGGTTGAAAT 
AGACTGTTAT 
TTGTGTGGGT 
TGCCAATGTC 
GTGGGTCTAC 
TTCTACATCT 
ATTTTTAATG 
ATATTAAGTC 
TTTTTTTTTT 
AGATTTTTAA 
TTAATTGTTC 
AAAAAAAAAA 



21 
I 

GCGGGGAAGG 
GCCCTGAAAC 
GGCTCTCAGT 
TGCCCTGGAG 
GGCTGCACTG 
GACACAGGGC 
TCATCGGCTG 
CACCAATGTG 
TGAATGACTC 
ACTCAGAAGA 
AGTCTGTGTC 
TGACACCTAG 
TAAGCCACAA 
TTCATTTATT 
TTTAACTCTG 
TGCCAGCCAC 
GCTGTTTTAA 
TTTATCACCT 
GCTTTATTAG 
GATATATAGT 
CAGTGTCTCT 
CTTTTATGTA 
GGTCACAATA 
GTAGATTAGT 
TCATACCTGT 
TTCACCATTT 
ATATTTCTGG 
ATACTGCCTT 
CAACATTGTT 
TTTATACATT 
GGATTGTGTT 
GTTCAATTCA 
TTTAACAGTG 
CTATTTTATT 
ATTGCTAGTA 
AAAAAAAAAA 



31 . 
I 

CGCCCACAGC 
TCTAGTCGCC 
ATCTTGGTCC 
AGCAGCCCAG 
GTGCAGGACT 
TCCAGCTCCG 
GCAGGCTTGC 
GGTTCCAAAG 
CAGGAAGAAG 
CACATGTGGA 
TACCAGAAGC 
AGTTTGGACT 
AGTTTGTGGT 
TTGGGTAAAT 
TAAGAAACTG 
ATATGAGAGC 
TTTGTATTTC 
TTATTGAAGG 
TGTTGAGTTT 
TTGGAAACTT 
CACAGAGAAA 
TTGTGCTTTT 
ACCTTATTCT 
GATCTATTTT 
GAATATAGAT 
AATTGCCCCT 
GTTCTCAATT 
GATTAGTGTA 
CATTCTTGTT 
TTAGAATCAG 
AAATCAGTGG 
TGAACACAAT 
TTCTCAGTTT 

GATAGAAATA 



41 

I 

AGGTGTGGTG 
AGAGAGGCGG 
TGTACCAGGC 
ACCCGGCCAC 
ATGTGCAGAT 
CTGCCCAGAA 
TGAGCAGATC 
CCTTTGGCAG 
GTGTGTCCTA 
GAAGGTGACA 
CAGAATCACA 
TCCAGCTTCC 
AATTTGTTAT 
GCCTTGGAGT 
CCAAACTATT 
TCTAGTATTT 
CCCAATGACT 
GTCTGTTTAA 
TTAGAGCTCT 
CCTTCCCCTG 
AAGTTGTAAT 
AGTTCATGTC 
ATACTTTCTT 
GAGTTAATTT 
ACCCAATTGT 
GCACCTTTGT 
CTGTCTCATT 
GTGTTAAAGT 
CAAAAAGATT 
TGTGTTACTA 
GTTAATTTTG 
ACATGTTTTC 
TCAACAGAAA 
AATGTAAATG 
CAATATTTAA 



51 
I 

TTCATCCCGG 
CATGGGTTTC 
GGGCAGCCTC 
ACTCAGTAAA 
GAAGGCCAGT 
GAGAGCCTGC 
AGGGGGCATG 
GCGCCGCAGG 
AATCCAATGA 
TGACAGAGGC 
GAACAGTCTC 
AGAACTGTGA 
GACAGCCCTA 
GGGATTGCTG 
TTCTGAAGTG 
CCACAAATAG 
AATGACGTTG 
ATCTTCTGCT 
TTATATGTTG 
AATCTGCGGA 
TTGAATAAGA 
TAAGAACTCT 
GTAAAAGTTT 
TTGTATAAGG 
TTCAGTGCCA 
CAAAAAGCAA 
GATTGATTTG 
GAATCTCAAA 
TTAGCTACAT 
TCTACAAAAT 
6GAGAATTAG 
ACTTATTTAG 
TATTCTACAC 
GTACTTAAAC 
AATATTAGGA 



Seq ID NO: 202 Protein sequence: 
Protein Accession ft: NP_000719.1 



• 60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



11 



21 31 41 . 51 

I I I I 

MGFRKFSPFIj ALSILVLYQA GSLQAAPFRS ALESSPDPAT LSKEDARLLL AALVQDYVQM 
KASELKQEQE TQGSSSAAQK RACNTATCVT HRLAGLLSRS GGMVKSNPVP TNVGSKAFGR 
RRRDLQA 

Seq ID NO: 203 DNA sequence 
Nucleic Acid Accession #: NMJ) 01741 
Coding sequence: 71.. 496 

1 11 21 31 41 51 

I I I I I I 

CTCTGGCTGG ACGCCGCCGC CGCCGCTGCC ACCGCCTCTG ATCCAAGCCA CCTCCCGCCA 
GAGAGGTGTC ATGGGCTTCC AAAAGTTCTC CCCCTTCCTG GCTCTCAGCA TCTTGGTCCT 
GTTGCAGGCA GGCAGCCTCC ATGCAGCACC ATTCAGGTCT GCCCTGGAGA GCAGCCCAGC 
AGACCCGGCC ACGCTCAGTG AGGACGAAGC GCGCCTCCTG CTGGCTGCAC TGGTGCAGGA 
CTATGTGCAG ATGAAGGCCA GTGAGCTGGA GCAGGAQCAA GAGAGAGAGG GCTCCAGCCT 



60 
120 



60 
120 
180 
240 
300 
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GGACAGCCCC AGATCTAAGC GGTGCGGTAA TCTGAGTACT 
GCAGGACTTC AACAAGTTTC ACACGTCCCC CCAAACTGCA 
AAAGAAAAGG GATATGTCCA GCGACTTGGA GAGAGACCAT 
CCAGAATGCC AACTAAACTC CTCCCTTTCC TTCCTAATTT 
TAACTTGATG CATGTGGTTT GGTTCCTCTC TGGTGGCTCT 
TTCCTTGTGG CAGAGGATGT CTCAAACTTC AGATGGGAGG 
GTTGGAAGAG AATCACCTGG GAAAATACCA GAAAATGAGG 
GATGTCATCA GAGCTCCTCT GTCCTGCTTC TGAATOTGCT 
TATTTTTCCC C 



Seq ID NO: 204 Protein sequence: 
Protein Accession S: NP_001732 

11 



PCT/US02/12476 



TGCATGCTGG 
ATTGGGGTTG 
CGCCCTCATG 
CCCTTCTTGC 
TTGGGCTGGT 
AAAGAGAGCA 
GCCGCTTTGA 
GATCATTTGA 



GCACATACAC 
GAGCACCTGG 
TTAGCATGCC 
ATCCTTCCTA 
ATTGGTGGCT 
GGACTCACAG 
GTCCCCCAGA 
GGAATAAAAT 



SI 



21 31 41 

I I I.I I I 

MGFQKFSPFL ALSILVLLQA GSLHAAPFRS ALESSPADPA TLSEDEARLL LAALVQDYVQ 
MKASELEQEQ EREGSSLDSP RSKRCGNLST CMLGTYTQDP HKFHTPPQTA IGVGAPGKKR 
DMSSDLERDH RPHVSMPQNA N 

Seq ID NO: 205 DMA sequence 
Nucleic Acid Accession #: NM_0 05361 
Coding sequence: 1-945 



ATGCCTCTTG 
GAGGCCCTGG 
TCCTCTTCTA 
CCTCCCCACA 
AGACAATCCG 
CTGGAGTCCG 
CTCCTCAAGT 
AGAAATTGCC 
GTCTTTGGCA 
TGCCTGGGCC 
CTCCTGATAA 
ATCTGGGAGG 
CATCCCAGGA 
GTGCCCGGCA 
ACCAGCTATG 
TACCCACCCC 



11 
I 

AGCAGAGGAG 
GCCTGGTGGG 
CTCTAGTGGA 
GTCCTCAGGG 
ATGAGGGCTC 
AGTTCCAAGC 
ATCGAGCCAG 
AGGACTTCTT 
TCGAGGTGGT 
TCTCCTACGA 
TCGTCCTGGC 
AGCTGAGTAT 
AGCTGCTCAT 
GTGATCCTGC 
TGAAAGTCCT 
TGCATGAACG 



21 
I 

TCAGCACTGC 
TGCGCAGGCT 
AGTTACCCTG 
AGCCTCCAGC 
CAGCAACCAA 
AGCAATCAGT 
GGAGCCGGTC 
TCCCGTGATC 
GGAAGTGGTC 
TGGCCTGCTG 
CATAATCGCA 
GTTGGAGGTG 
GCAAGATCTG 
ATGCTACGAG 
GCACCATACA 
GGCTTTGAGA 



31 

I 

AAGCCTGAAG 
CCTGCTACTG 
GGGGAGGTGC 
TTCTCGACTA 
GAAGAGGAGG 
AGGAAGATGG 
ACAAAGGCAG 
TTCAGCAAAG 
CCCATCAGCC 
GGCGACAATC 
ATAGAGGGCG 
TTTGAGGGGA 
GTGCAGGAAA 
TTCCTGTGGG 
CTAAAGATCG 
GAGGGAGAAG 



41 
I 

AAGGCCTTGA 
AGGAGCAGCA 
CTGCTGCCGA 
CCATCAACTA 
GGCCAAGAAT 
TTGAGTTGGT 
AAATGCTGGA 
CCTCCGAGTA 
ACTTGTACAT 
AGGTCATGCC 
ACTGTGCCCC 
GGGAGGACAG 
ACTACCTGGA 
GTCCAAGGGC 
GTGGAGAACC 
AGTGA 



Seq ID NO: 20G Protein sequence: 
Protein Accession &: NP_005352 

1 11 21 31 41 

I I 1 1 I 

MPLSQRSQHC KPEBGLEARG EALGLVGAQA PATEEQQTAS SSSTLVEVTL 

PPHSPQGASS FSTTHJYTLK RQSDEGSSNQ EEEGPRMFPD LESEFQAAIS 

LLKYRAREPV TKAEMLESVL HNCQDFFPVI FSKASEYLQL VFGIEWEW 

CLGLSYDGLL GDNQVMPKTG LLIIVLAIIA IEGDCAPEEK IWEBLSMLEV 

HPRKLLMQDL VQENYLEYRQ VPGSDPACYE FLWGPRALIE TSYVKVLHHT 
YPPLHERALR EGEE 

Seq ID NO: 207 DNA sequence 
Nucleic Acid Accession #: NM_021115 
Coding sequence: 743-2893 



AAAGGAAGGG 
GGCACCGCCC 
CCCAAACTAA 
CCCTTTGGGT 
GCACCCTGAA 
GGGCGAGCTG 
ACCGCTGCTT 
TTCGCTCAAG 
CACTGTCCAA 
CACGGAGAAG 
AGAAGTGCCC 
GCAAATCTCC 
ACCCGGGGAG 
GGCCCTGATG 
GACCACTACC 
CTGCAGTGTG 
GCCCCTCAAC 
GGAGCTCCAG 
GGACGGCCCT 
CCGAAGCCCC 
GACCTTCCAG 
CTCTGGGGAT 
CCTGGGCTAT 
CTGGAGCAGC 
CATCGGCCGC 
CTGGACGATT 



11 

I 

AGGGAGGGAG 
TTAGGAGGGC 
CTGGTGTCTT 
CCTTACCTCC 
GAGAGAGTGG 
GTGCTGGATG 
CCAGAGGAGG 
CAGGTGAACT 
AGGGCAGGGT 
CCTGGCCCAC 
CTTTGGCTGG 
CCCTTCACTT 
CCTGGGCCTG 
GACAAAGGTG 
TCCACCATTA 
AGCTTCTCCA 
AACTTTCTGG 
GTGAAGAGTG 
ACCCTGACCG 
ACCAACACCA 
CTTCACTACC 
GTCACGGTGA 
GAGCTCCAGG 
CAGGAGCCCA 
GTCCTCTCCC 
GAAGCTCCAG 



21 

I 

AAAGGAGAAG 
CACCCTCAGA 
TTCTCCTCTT 
TGCCCTCAGG 
TAACAGCGCC 
GGACCGCAOC 
CCCGCCCCAA 
CTGCCAGGAA 
CCCAGCCAGC 
CGGGGGACCC 
ACCGAAAGGA 
CGCAGCCCTA 
ACATGGCCCA 
AGAATGAGCT 
TCACCACCAC 
ATCCTGAGGG 
AGTGCACATA 
TGAACCTGTC 
TCCTGGCCAA 
TCTCOGTCTA 
AGGCCTTCAT 
TGGACCTGCA 
GCGCTAAGAT 
TCTGCTCAGC 
CAAGTTACCC 
AGGGCCAGAA 



31 

1 

TTGGTTTAGA 
GTCTGACAGC 
CCAAGATGCT 
AGCCCCGGAG 
CCCCAGTTCC 
CTCTGCACAT 
GCACGCCTTG 
GCAGCTGAGG 
GTCCCAGGGC 
GGACCCCATC 
GAGTGCGGTC 
TGTGGCCCAC 
GGAGGCCCCC 
GACTGGGrCA 
GGTCATCACC 
GTACATTGAC 
CAACGTGACA 
CGATGGGGAA 
CCAGACACTC 
CTTCCGGACC 
GCTGAGCTGC 
CTCAGGTGGG 
GCTGACATGC 
TCCTTGTGGA 
TGAAAACACA 
GCTGCACCTG 



41 

I 

GGCCAGCCGG 
AGGTGAAGGT 
CTTCCCGAGG 
AGAGGCAGTC 
TCACAGTCGG 
CACGACATCC 
CCCCCCAAGA 
CCCAAGGCCA 
CTAGATCTCC 
GTGGCCTCCG 
CCTACAACAC 
ACACTCCCCC 
CAGGAGGACA 
GCCTCAGAGG 
ACCGAGCAGG 
TCCAGCGACT 
GTCTACACTG 
CTGCTCTCCA 
CTGGTGGAGG 
TTCCAGGACG 
AACTTTCCCC 
GTGGCCCACT 
ATCAATGCCT 
GGGGCAGTGC 
AATGGGAGCC 
CACTTTGAGA 



51 
I 

GGCCOGAGGA 
GACCGCTTCT 
CTCACCGAGT 
CACTCTTTGG 
GTTTCCCGAC 
TCATTTTCTG 
GAGTGTCCTC 
CTTGCAGCTG 
CCTTGTCACC 
CAAGACAGGC 
TGAGGAGAAA 
TGTCTTCGCA 
GTACCGGCAG 
CCTCATTGAA 
TCACATTTCC 



51 
I 

GEVPAADSPS 
RKMVELVHFL 
PISHLYILVT 
FEGREDSVFA 
LKIGGEPHIS 



51 
I 

ACGAGCTTTG 
CCTAAATCTC 
GAGATGCTAG 
CTGGCAAAGA 
CGGAAGTGCT 
CAGCCCTGTC 
AGAAACTGCC 
CCTCCGCAGC 
TCTCCTCCTC 
AGGAGGCATC 
CCGCACCCCT 
AGAGGCCAGA 
CCAGCCCCAT 
AGAGCCAGGA 
CACCAGCTCT 
ACCCACTGCT 
GCTATGGGGT 
TCCGCGGGGT 
GGCAGGTAAT 
ACGGCCTTGG 
GCCGGCCTGA 
TTCACTGCCA 
CCAAGCOGCA 
ACAATGCCAC 
AATTCTGCAT 
GGCTGTTGCT 



360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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WO 02/086443 

GCATGACAAG GACAGGATGA CGGTTCACAG 
CGACTCCCTT CAAACCGAGA GTGTCCCTTT 
CCGCATOGAG TTCACGTCCG ACCAGGCCCG 
AGCGTTTGAG AAAQGCCACT GCTATGAGCC 
CGACCCGACC TATAACATTG GGACTATAGT 
GGAGCAGGGC CCGGCCATCA TCGAATGCAT 
AGAGCCCCTG TGCAGAGCCA TGTGTGGTGG 
GTCCCCAAAC TGGCCCGAGC CCTACGTGGA 
GGGAGAAGAG AAACGGATCT TCTTAGATAT 
CTTGACCATC TACGATGGCG ACGAGGTCAT 
CAGTGGCCCC CAGAAACTGT ACTCCTCCAC 
CCCTGCTGGC CTCATCTTTG GAAAGGGCCA 
AAGGAATGAC TCCTGCTCGG ATTTACCOGA 
CACGGAGTTG GTGCGGGGAG CCAGAATCAC 
GGGGAGTGAC ACCCTCACCT GCCAGTGGGA 
TGAGAAAATT ATGTACTGCA CCGACCCCGG 
GGATCCTGTG CTGCTGGTGG GGACCACCAT 
TGAAGGGAGT TCTCTTCTGA CCTGCTACAG 
TCGCCTGCCC CACTGCGTTT CAGAAGCGGC 
GGCCCTGGCT ATCTTCATCC CGGTCCTCAT 
TTACATCACA AGATGTCGCT ACTATTCCAA 
CTACAGCCAG ATCACCGTGG AAACCGAGTT 
CCAAAAGGTT TAGGGTTTCA TTTAAAAAGA 
AACCCCAATT TCCCCGAGAC ATTTATCCAA 
AAAGGCGGCT GTTTTTTGGT TAAACTTTTT 
XTTATAAATT TTAAAAGTG 



Seq ID NO: 208 Protein sequence: 
Protein Accession ft: NP 066938 



MAQEAPQEDT 
PEGYIDSSDY 
LANQTLLVEG 
DLHSGGVAHP 
SYPENTNGSQ 
VPFBGLLSEG 
TIVEFTCDPG 
YVBGEDCIWK 
SSTPDLTIQF 
RITYQCDPGY 
TTIQYTCNPG 
VLIZSLLLGG 



11 
I 

SPMALMDKGE 
PLLPLNNFLE 
QVIRSPTNTI 
HCHLGYELQG 
PCIWTIEAPE 
NTIRIEFTSD 
HSLEQGPAII 
IKVGEEKRIP 
HSDPAGLIFG 
DIVGSDTLTC 
FVLEGSSLLT 
AYIYITRCRY 



21 
I 

NELTGSASEE 
CTYNVTVYTG 
SVYFRTPQDD 
AKMLTCINAS 
GQKLHLHFER 
QARAASTFNI 
ECINVRDPYW 
LDIQFLNLSN 
KGQGPIMNYZ 
QWDLSWSSDP 
CYSRETGTPI 
YSNLRLPLMY 



CGGGCAGACC 
TGAGGGCCTG 
GGCGGCCTCC 
CTACATCCAG 
GGAGTTCACC 
CAATGTGCGG 
GGAGCTCTCT 
AGGTGAAGAT 
CCAGTTCCTG 
GCCCCACATC 
GCCAGACTTA 
GGGATTTATC 
GATCCAGAAT 
CTACCAGTGT 
CCTCAGCTGG 
AGAGGTGGAT 
CCAATACACC 
CCGTGAAACA 
AGCAGAGACG 
CATCTCCTTA 
CCTCOGCCTG 
TGACAACCCC 
GGTACCCTTT 
AGGCCCTGGG 
AACAAAGGGT 



31 
I 

SQETTTSTI1 
YGVELQVKSV 
GLGTPQIiHYQ 
KPHWSSQEPI 
LLLKDKDRHT 
RFEAFEKGHC 
NDTEPLCRAM 
SDILTIYDGD 
EVSRNDSCSD 
PFCEKIMYCT 
WTSRLPHCVS 
SHPYSQITVE 



AACAAGTCAG 
CTGAGCGAAG 
ACCTTCAACA 
AATGGGAACT 
TGCGACCCCG 
GACCCATACT 
GCTGTGGCTG 
TGTATCTGGA 
AATCTGAGCA 
TTGGGGCAGT 
ACCATCCAGT 
ATGAACTACA 
GGCTGGAAAA 
GACCCCGGCT 
AGCAGCGACC 
CACTCGACCC 
TGCAACCCCG 
GGGACTCCCA 
TCGCTGGAAG 
CTGCTGGGAG 
CCTCTGATGT 
ATTTACGAGA 
AAAAAGGGGC 
GGCCTTGATT 
TACGGGTTTT 



41 
I 

TTTVITTEOA 
NLSDGELJjSI 
AFKLSCNFPR 
CSAPCGGAVH 
VHSGQTNKSA 
YEPYIQNGNF 
CGGELSAVAG 
EVMPHILGQY 
LPEIQNGWKT 
DPGEVDHSTR 
EAAAETSLEG 
TEFDNPIYET 



CTCTTCTCTA 
GCAACACCAT 
TCCGATTTGA 
TCACTACATC 
GCCACTCCCT 
GGAATGACAC 
GGGTGGTATT 
AGATCCACGT 
ACAGTGACAT 
ACCTTGGGAA 
TCCATTCGGA 
TAGAGGTATC 
CCACTTCTCA 
ATGACATCGT 
CCCCATTTTG 
GCTTAATTTC 
GTTTTGTGCT 
TCTGGACGTC 
GGGGGAACAT 
GAGCCTACAT 
ACTCCCACCC 
CAGGGGGAAC 
TTGTGAACTC 
TAAACCCCCA 
TTCCCCGGAT 



51 
I 

PALCSVSFSN 
RGVDGPTLTV 
RPDSGDVTVM 
NATIGRVLSP 
LLYDSLQTES 
TTSDPTYNIG 
WLSPNWPEP 
LGNSGPQKLY 
TSHTELVRGA 
IiISDPVLLVG 
GNMALAIFIP 
GGTQKV 



Seq ID NO: 209 DNA sequence 

Nucleic Acid Accession ft: NM_001327.1 

Coding sequence: 89-631 



AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTOGCT 



11 
I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 

I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



Seq ID NO: 210 Protein sequence: 
Protein Accession #: NP_001318.1 

21 



1 11 21 

I 1 1 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG 
GAAGGCCAGG GCACAGGGGG TTOGACGGGC 
CCTGATGGCC CAGGGGGCAA TGCTGGCGGC 
GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC 
CCGCATGGCG GTGCCGCTTC TGCGCAGGAT 
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51 
I 

GACCTTCTCT 
CAGGGGGTTC 
GGGGCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



11 21 31 41 51 

III I I 1 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAA8 GLNGCCRCGA RGPESRLLEF YliAMPFATPM EAELARRSLA QDAPPLPVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCPLPVP LAQPPSGQRR 



Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52-459 



31 41 51 

I I I 

CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 
GATGCTGATG GCCCAGGAGG CCCTGGCATT 
CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 
TCGGGGCCGA GAGGAGGGGC CCCGCGGGGT 
GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 



60 
120 
180 
240 
300 



270 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GACAGCCGCC TGCTTCAGTT CCGACTGACT 
ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG 
GTGTTTTTGG CTCAGGCTCC CTCAGGGCAO 
TAGGTCATGC CTCCTCCCCT AGGGAATGGT 
GCCTGATTGT TTGTCGCTGG AGGAGGACGG 
CTGAGCTA 

Seq ID NO: 212 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I 1 I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 

PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 

Seq ID NO: 213 DNA sequence 
Nucleic Acid Accession g: NM_00055S 
Coding sequence: 416.. 1498 



GCTGCAGACC ACCGCCAACT GCAGCTCTCC 
TTGATGTGGA TCACGCAGTG CTTTCTGCCC 
AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 
CCCAGCACGA GTGGCCAGTT CATTGTGGGG 
CTTACATGTT TGTTTCTGTA GAAAATAAAG 



360 
420 
400 
540 
600 
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CTTATTTTTT 
AGCACAAAGA 
TCTGGGGGGA 
ACTCCCCCTT 
AACCTTGGGT 
TACAACTGTT 
CCCGAAGTTC 
ACTTGATTTT 
GATGAATGGG 
CTTGCAGGCA 
COGCTACTTC 
CTTGCTGGCT 
TTACATTTAC 
GGAAAGCTAT 
CAATCCCAAC 
GGCTAGCAGC 
TACCATCATC 
GACAGCCCAC 
CGGGGTTGTC 
CTTTGGTGAT 
TGATTTTTCT 
TGGCCCAAAG 
CCGAAGCAAG 
CAAGTCTAAG 
GGACCTGTAC 
GAGGGGAGAG 
TCTGCTCAAG 
TATTTTGAAA 
CACTGATCCA 
TCCAGGGATG 
TAAATTTGCC 
ATACAGCAAT 
ACAAACACAA 
GCAAGGCAGC 
CACCAATTCT 
TGCTTAGGAT 
ACATTTCCGA 
TTTGGAATCA 
TCTAGAAGGC 
TGTTCTATTG 
GGTCTGACAC 
CATGTCTGTA 
TTTTGTCTAA 
ATGTTATTTA 
AATAGCAGAG 
CATTAATAAT 
ATATTTTAAG 
CTTTCACTTG 
TAGTTTTCAG 
GCCCATAATG 
GCTTTNACCA 
CACAGCATCC 
CTAAATGGAA 
GTGTGTGTGT 
GTAATGGATT 
GTGGTGGTGG 
ACATTTTCGG 
ACCCCAAATG 
CCATTAATAA 
GTGCCTCCCA 
GGGCATAAAG 
CACCCATAGT 
TGGAGGCTGG 
ACACTAGCTC 
GGATATATTT 



11 
! 

ATGAATGTCG 
CACTGGCTGT 
GGGGATGCAC 
CATAGTCATT 
AGCTCCTTCT 
AGTCATGTGG 
CAATTTGATA 
GGACACTTTG 
TTGCCTAGCC 
CTGAGTAATG 
AAGGGGATTG 
GACCTGACGC 
ACCATTGATG 
GTCTGTTCCT 
TGGTCTGTCA 
AACAGTGCAC 
CGCAGTGGGG 
TCTTTTGAGC 
AAAAAACTCT 
GATGATGTGT 
CTGGATGAAA 
GCATCCCCAA 
TCTCCAGCTG 
CAGTCTCCCA 
CTGCCTCTGT 
TGCTCAGAGT 
TGTCCAACAG 
AACACATTGT 
CAGTTACCAA 
CAAAATGTGC 
CCGTTTAAAT 
TAAAAAGTTT 
GTGCCCCTTT 
TCCCCAGCCT 
GGCTGTCAAT 
CCTGGTGCTG 
AGAGTTTATA 
ATAAGCAATT 
TGTCTAACAT 
AATGCCTTGT 
CTTTCAGTCT 
TTTCAGGAGC 
AAAACACATG 
GAAATTATGC 
GCCAATTCAA 
TTCAATGTGG 
CAACTCTTTT 
TCTTTAACAT 
TCTTTTGAGA 
GCAAAAACAA 
AATATAAAAA 
AAACCAAGCT 
TGAGCTTGCT 
GTGTGCATCT 
GGTGGCAACT 
GGTATCTCAA 
TTCAAGAAAA 
ATGAGGATCT 
GCCCATTTTA 
NAACATTTTG 
AATGGTGGGA 
NTCACTTTAG 
TAAAGAGCAG 
TNTGAGTATT 
TCTTTAGGAT 



21 

I 

GATAGCTGCA 
TCCCTGGAGG 
ACATTAGAGT 
GTACTGAAAT 
GTTCTCTTCA 
GCATGTGTGA 
GGAGCCACTG 
ACGAAAGAGA 
CCACTCACAG 
AGAAGAAAGC 
TGTAOGCTGT 
GATCTCTGTC 
GATCCAGGAA 
CAGACAACTT 
ACGTAAAAAC 
AGGCCAGGGA 
TGAAGCCTCG 
AAGTCCTCAC 
ACACTCTGGA 
TTATTGCCTG 
ATGAATGCCG 
CACCTCAGAA 
ACTCAGCAAA 
TCTCTACGCC 
CCTTGGATGA 
CCAGAGTACA 
GGCTATTGGT 
AATATGTTGG 
TTATGAGAGA 
TAGTCCATGA 
TTGCCCAAAC 
GTGTGGGGAA 
TCTCTGGATC 
CACTCTTCAC 
GGGGAGAAAT 
GGTTAGCTAA 
AAGCACAGTG 
GATAATAGTT 
ACCACATGAT 
TAACAGCCAA 
CTTTTTATAG 
AAACTCTTCA 
AAGAAAATTT 
TGTCACTGCC 
TAGAATCAGT 
ACCAGACATT 
TATCTATAAT 
TAGAAAGGAT 
TACAGGTTTA 
CTAATTTTAA 
TTCCCTTATT 
GCTGTTTGGC 
GTGTGTGTGT 
GCAGCTGCTT 
GGGTGGCACT 
ATGCCCCTAG 
GTGAGATGAT 
CTTTTTGCCC 
CTAANCCCCT 
TAGTTAATTG 
GGCCTGATTT 
GTCTCATTTA 
GACCAGAGGA 
TCCTTGATTG 
AACCTTTGAA 



31 

I 

CCAGCTTGGT 
CTGTCCCTTT 
AGGAAAGAGG 
GCAAAGACTG 
AGGGGAATTT 
GGAAACAGAT 
TCAGTCTCTG 
TAAGACATCC 
CGCCCACTGT 
CAAGAAGGTA 
GTCCTCTGAC 
TGACAACATC 
GATCGGAAGC 
CTTTAAAAAG 
ATCTGCCAAT 
GAACAAGGAC 
GAAGGCTGTG 
TGATATCACA 
TGGAAAACAG 
TGGTCCTGAA 
AGTCATGAAG 
GACTTCAGCC 
CGGAACCTCC 
CACCAGTCCT 
CTCGGACTCG 
AATCCAAGCC 
GCTTTCAAGT 
GTTTATTTTC 
TAGATTGATA 
CCTTTCAATG 
AGTTTTCCTT 
AAAAAAAACT 
TCAAGAATGG 
TCCTGATTGA 
AAACCAACAA 
GAGAATAGAC 
AATTCCTGGT 
TGGAGTAAGG 
TACATGAACT 
CACTGAAAAC 
CAAGAAATCA 
GGCTCCTTTT 
ACCAGAAAAA 
AAACAGTAAC 
TTTTTGATAG 
CTAATTATAT 
CCTAATATTT 
TTCTCTTTAC 
TAACACTGCT 
TTGAAGGTCT 
CCTTGGTAAT 
TACTGAATGG 
GTGGTGGTGG 
CAAAATTAAG 
GCTGATGTGC 
ACAAGCTTCA 
GGTAGTACTG 

ATTTCTTTCT 
GGAAAAAGTG 
TAAAATTCAG 
GTCCATCACC 
AGAATCCAGA 
CGGTATATGT 
CCAACAATNT 



41 
I 

GGGGAAAGGG 
AAAGGAGAAT 
GCTTGGAATA 
CTTCCTAAGC 
TGTCAGGCTA 
GCCAGTTTTA 
AGGTTCCACC 
AGGAACATGC 
AGCTTCTACC 
CGTTTCTACC 
CGTTTTCGCA 
AACCTGCCTC 
ATGGATGAAC 
GTGGAGTACA 
ATGAAAGCCC 
TTTGTGCGCC 
CGTGTGCTTC 
GAAGCCATCA 
GTAACTTGTC 
AAATTTCGCT 
GGAAACCCAT 
AAGAGCCCTG 
AGCAGCCAGC 
GGCAGCCTCC 
CTTGGTGATT 
TATCATTGTA 
TTTTATTTTG 
CTGTGATTTC 
ACCATCCTTT 
GAAAGCTTAG 
TTGTAGAGGG 
CATTGGCAGA 
TGGAGGACCC 
GGCCCGGGTT 
CTTATAATTG 
AGAATTGGAA 
CAATCTCTCC 
GACTTCATAT 
GTATGGTATC 
ACTGTGAGAA 
ATATCCTTTT 
TTATAAACTG 
AAAAAAAAAG 
CTCCAGGAGA 
CTTTTTAACA 
TTTAAATGAA 
CATACTGAAG 
TAAGGACTGA 
TTTTTTTTCC 
TGCTTGCCAN 
GGTGCAAATN 
CTTGCAGTTG 
TGGGAGGGGG 
AAATACTACA 
ACTGTGTAGG 
GATGTCTGTA 
GTTTCTGGTG 
TTTTGTAAAC 
AGAAGCTCAG 
ATACTTGGAT 
GCCAGAACCC 
TTTATTTTAA 
TTTCCTTATG 
ACTACTAGAA 
TCAATAACAA 



51 

I 

TTTGATGAAT 
CTTAGTTTAT 
AAATGAAAAC 
TGGAGATGCT 
TGGATTCATT 
ATGTATTTAG 
AAAATATGGA 
GAGGCTCCCG 
GAACCAGAAC 
GCAATGGGGA 
GCTTTGACGC 
AGGGAGTGCG 
TGGAGGAAGG 
CCAAGAATGT 
CCCAGTCCTT 
CCAAGCTGGT 
TGAACAAGAA 
AACTGGAGAC 
TCCATGATTT 
ATGCTCAGGA 
CAGCCACAGC 
GTCCTATGCG 
TCTCTACCCC 
GGAAGCACAA 
CCATGTAAAG 
GTAGGGTACT 
TTGTTGTTGT 
TCCTCTGGGC 
GGGGCAGCAT 
GGGCCTGGGG 
GTGTTTAAAT 
TCCAAGAATG 
TGGAAGGACA 
TGTTGTCCAG 
TGACACCAGA 
AATACTGCAG 
ACTGAGGCAA 
ACCTGATTCC 
CATCTATCTC 
TTTGTTTTCA 
TATAAAAATT 
GTGATTTTTC 
CCGAAGAATA 
AAACAAGATG 
GTTATGCTTG 
ATGTTACAGC 
ACACAGAAAT 
TCATTTGAAA 
TGTAAACATA 
TCCTGTGTTG 
TTTGGAAAGG 
TTCCTCCACT 
TGGTGCATGT 
AGACACCCCT 
GGGGAACCCA 
GCTACCAAAA 
AAATTGAAAA 
CCATTCAAAA 
GGTTTNCTTA 
TAGGGGGTGT 
CCAATGACTC 
GTTGAGGAAG 
CTTGGGCCTC 
AATACCAAAT 
TAGTACATCT 



60 
120 
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TCCATCTTAC TTTTAATCGA GTATAAGGAA ATGTTTCTTT ATGGCCATTT TGGAGGGAGC 3960 

AGGGGATGAG GCTTGGCATA GTCCAAAATT TAAGNCTCCA ATAATTAATT GCATTTTAAA 4020 

TTGTTTTAAA TTGGCCCACT TTCAAGGCAA TTTTTTTTGT GTGTCTGTAA CTGAGCTCCT 4080 

CCACCCCTGT CATTCACTTC CAATTTTACC CAATCCAATT TTAOCACTCA AGTTCCATTG 4140 

TGTTAATTTT TGCACGGTCT ACACACATCA AGTCAGCAAG CATTTGCCAC CACTCCCTAT 4200 

ACTTCTCCCT CTTTTTTACA CACACACACA CACACACACA CACAATCCAT CTCTTGCTTG 4260 

TTCCTACCTC CCTGATTTTT CTTCCCTACA GAAATAGAAA TAGGGACAAA GAAGGGGAAA 4320 

ATGTATATAT TGGGGCTGGG CTGAACAACT AACTTCATAA GTAGTATTAA CTAGGGGTAA 4380 

ATTGAGAGAA AAGCTCCTTT TCTCTTCACT GTTTTGGAAA GGATAGCCAT TAGCATGACT 4440 

GCTTTGTGTC CTTATGGACT TTAGTATTAG CCTAGATTGA ATTATAGCGT TTTTCTAGCT 4500 

GAAGGAACCT TAAGATCACA TCATCTACTC CTCTACTCCA AATTTCTCAT TCTTCAGGCC 4560 

AGGAAACOGA GACACAGAGG TAAAGTAATT TCCCCAAGGT CACACAGCTG GCTGGGGCAG 4620 

GATTGGGTTT ACAACCCACA TCTCCTGGCT CTTATTCCAG GGCCTTTTCC CACTAAGTAG 4680 

TATTGCCTTC CATTAGGCTC CTGAGAGTTA TTTCTCAGGG TCATGTTGCA TCTTGGAGCC 4740 

ACATGCTGCT GCCCTGATCT CAGTGGGAAA TNCACCCAGC AACCTAATAC AGCCCCTTTT 4800 

CCCTGCATTC ACCTGGTTCC CATCCACATG GGTTGCAGAT GTCCTTGAAG AGAGTGAGGC 4860 

ATTGAGGGCC AATAGGAGCA ATGGGGTCCC TGGCCTTGTC CATCTGATTC AGGAGATCAC 4920 

TGCTCCATCG TGAGGAGCCC TCTGAATAGC CCCCCACTGA ATGCTTGCCT TGCCCAAATG 4980 

GAATGGAGGA AGATTGATTT TCTCCATCAG TTCACCTTGT GTCATCTCAT AATGGTTGGT 5040 

CTTTCCAGGC TGAGGGAAAT GTTTCTTGTT TCCANAGTAN AAAAAAGAAA GAGTGGAACA 5100 

ATANCTTTGT TCATCCTAAC TTTCTGAGAT GGCTTTTCAA CATTTAAAAA AAACTAGTGT 5160 

GGTACCATTC ACTGGCANGA TTTNTTTTAG AATATGGGAG TAAGATGAGG TAGAGAAAAT 5220 

AACCTGGTCT CACTGTGGTT GCCCTCATCC ACAATGTCCC CAAAGCCATC CTGCTNTGAT 5280 

GAGGACAATT TCCAGGTATA AGCAAGGGGC TTTGTGACAA AAATGTACCC TGGCTGATGT 5340 

TAAACATTGG CTCCTGTGTT TGCACCAAAA TAGCAAGCTG TGTGCTCTAT ACACTCTTCC 5400 

CATCGTCTTG TGTACACTGC TCCTGTGGCC TTCCACAGCA GAAACCAGGG CAAAAGGGTC 5460 

CAAACACATG GTTTTCCTTG CTGCAAGGCT NTTCCTGGGA ACTAAGGGGG TATTTATTAG S520 

TTCAGTTNTA AGAGACCTCC TTCTGGGCTT ACCCCACTCC TCAGGTACTT CTCTCTCCTT 5580 

CCTCCTTCTC CTCCACAGTC ACAAGTAACC AAGGAACCTG AAAGTGGATG TGTAGCTATT 5640 

TGAAGAAGGC AAGGAACCCT GAGATTCTTC TTTGAATCCT TTAGTCCAAG TCTTAGACCA 5700 

GTGATTGGTG CTTACCTTGA ACAAAATTTT GTCTGTGTTC CTAATCCCTT CAATACTNTG 5760 

GGTACAATGC TCCCAATCAC CCTGCACATT TGATTCTAAA TGGCTTTTAT TTTTTAAAAA 5820 

TCCATATCCC TAGGACAAGA NAACAGGATG CCTATATCCC CAAAATGAGC TCCAGGACAC 5880 

TGATGGGAAT GATCCCAANG ATCACCCCAC CTCAGAAAAC GTCTGTGCCA ANAGACTTCC 5940 

CCAGATAGAA NCACTGGGAC AGTGGTTTGA ACGACTTCTT TTATGGTTGT CCAGTTTGCT 6000 

ATGGAAATAA AAGGCATTGA TTTTTTAAAA AAGATGATTG GAACCTGTCT TTGGCCACAT .6060 

AGGGCCACTT GGATCCATTT CCAGGCCTTA CTCATATATT GCCTTCACTG AAGGGCTTTG 6120 

GCTTTAAGTC CCAGACTGGT CTCCCAAGTG AACCATAAGT GTTTTGGAGC TCATCTGGGG 6180 

TGAGGCATGA GAATGTTGCC CCATCTATCC CTTCAGGAAA AGGTGCCTTC CCTCCCTTTC 6240 

TCCTAAAGCC TGGTCCCCAA AAATTGTTTT TGTCTOCAAA AGTCTAGTAT GGTCTTTATA 6300 

CACCCANACT CTTAGTGTTG CGTCCTGCCT TGTTTCCTTG TTAAGGATCT ATGCANACCT 6360 

CCCGCTTTGG CTTAGCTAGC GTGACATTGG CTATCATTTG ACAAGACTAA CTTTTTTTTT 6420 

TTTTTTTTTG ACTGAGTCTC CCTCTGTCAC CTAGGCTGGA GTGCAGTGGC ACAATCTTGG 6480 

CTCGCTGCAA CCTTCACCCT TCACCTCCCA GGTCGAAGCG ATTCTCCTGC CTCAGTCTCC 6540 

CGAGTAGCTG GGATTACAGG CGTGCGCCAC CAAATCTGGC TATTTTTTTA TTATTATTAT 6600 

TTTTAGTAGA GATGGGGTTT CACCATGTTG GCCAGACTGG TCTTGAACTC TTGGCCTCAA 6660 

ATTATCTGCC CACCTCGGCC TCCCAAAGTG CTGGGATTAC AGGCATGAGC ACCATGCCCA 6720 

GCTGACAAGA CTAATTTTTT ATCCCTTGGT TTATTGGCTT CAACATCTTC TGGAATCAGA 6780 

GGTGATTTTT TCTTACCTTG GATGCCTGAG AC7AGGGGAG TATAGAATTC CAATTGGTAA 6840 

TTAAGGCATC TTTCTGCTCC TGATCAGAAG GGCAGGTTAG TTGGGAGAGG TCAGATGGCA 6900 

CAACAGAAG7 CACCTTGTAA GTAAGGCAAA GACTTTGAAG GCATTAGOGT TTCTCATTAC 6960 

TTAGGTCAAT AACCTTGAGG GAATCAATGG CTTTTTTGCC GCTCTACCTC TTTGTGTATC 7020 

TCTTTGACTT TTCTTTCTCT GTCTAGTTTC CTCTGTTCTC AGTTTATATT CTATGTTATC 7080 

AGTCTCTCTT TCCACAGTAC AAACATCCAT CCTTTCTCCT GTGCAATTCT GTCTCTCCCT 7140 

CTTATTATCT TTATTTGTAC TTTTTCCTTC CTCCCTGTCT AGGCATTGGG CATGTGCCTC 7200 

TTCTTAGCCT GTGATTTTGC CTTGGGACTG ATGATAAATT ATTTCCAGAT TCAATCAGCC 7260 

CTGGTCCTAC CCCAGTCCAA TCAGAAGTAT GTTGGTGGGG AATCAACCTG ATCCTGGCCC 7320 

TTTCTTCTTC TCCATTTTCA TTOGTAATCC CCCTCAGCAG ATCTTTACAA GCAGTTTCCT 7380 

TATAGCTCAT GTATCTTTAG GTCTTTGCCT TCCAAGCACT GTACAGAATA CTTTGTGGTT 7440 

CCTTTTTAGT CTGACATTTT GTGGAGCAGT GAAGCGTGCT CAGAGACATA ATCAGCTGAA 7500 

GAGAAAAAAT CCACCCATGG ATTTATATCA GCTAAATACT AATAATTGAT TTTGTTTGAT 7560 

GTGCCCATAA TTTTTAAAGC TGCAATATAA TATAATGAGG GACCACAGGT AATTTCTCCT 7620 

GTCATTTGTT TTGGCTGGAT GGGGGTGGGG GAGTAATTGC TTAAAGTTTT ACCATTACAC 7680 

ATTAAACTCT CTATAATAAT CTTGTTTGGG GCTTGCTAAC TGTTGAGCTG TTTTAACTAA 7740 

ACTGGTAGGC AATCGGAGTT GATTTAAATG AAAAGATAAT TTAACAAATC TATACTATAA 7800 

AAAGAGACAT TTGCTTAATT GACATGTATT TTTTCCTTCT GAGTCACCTA AACATTTACT 7860 

CTTGACACCA ACTGTTCATG ATACTGAATA GACAGTCCAT ATAAGAGAAA TTAGTGGACC 7920 

TAAAGAAGCC AGATTGTAGG TGTTAATTTA TTAAACAGAA TTGCAAAGCC CTTGGAAATG 7980 

TCACTGCTTG GCAATACCAT ATGGCATGCC AAAATTTACA ATGACTTTTC TTTATAAGTT 6040 

ATCCAAAAGG GATTTGAACA AGTAAGAGGT TATGCCAAAA TGTCTCCAAT GTATGGTCCT 8100 

GTAATATATT GCAGCTTGAA GCCAATGATC CCTTATGACT TGTATACAAC TAATGCATGT 8160 

TTTATTGAAT TTTGCATTTC CCACGTGTGG TAAGTCTTTA AAATGTTTTT GATCACCTTT 8220 

NTGTGCCATT AAACTTGTAC AGAAAATGTT TTTATGGCCA TTTTCAAAGG GAGAAAGTTT 8280 

AAAATGGAAA CAGCCCACCC TTTCTGCCCT ATAGCTGTAG TTAGAATTGA GTACCTGTAG 8340 

CAAAACAGCT GTAATTGGTG GTTGTAGTGT TAGAGGTGTT AGCTTGCTAG TGACTAGCTT 8400 

TGGAGAGTAA ATGCATGGTA TTGTACATCA CATTTCTTAA CTOGTTTTAA CCTCTGAAAA 8460 

GAATATATTC TTCTTTGTAG TCCTTCTTCC CACCCCCTTG CCCTCTCCCT CTCCCTGCTC 8520 

CCAGTTGTCT TACAGTTGTA AATATCTGAT TTGAGGCCCA ATAACTCTTG CCAAGTAAAG 8580 

TCAGCAAACA ACAAACAAAC CAAAATGTGG GGAAAAGGCA TTTCTCAACC ATCTCTCAGC 8640 

AGTTATTGAT CATTTCTTAA GGAACAGCAT TGTGATCAAA GACTCAACTT TACGTAAAAA 8700 

TCAGTGGTAA ATTGGGGTTG TATTGGCCAT TGATTACATT CAGGATTGAA TAGTTTTCAG 8760 

AATCACATGT AATCCAAAGA CAGTAGGTAG TGATGTCCCT TATCCCTGCA GCTGTTTTAA 8820 

GATAGAGACC TCAGAAGACT CTGCTTGACC GATGACCAAT AATTATTTGA AAAAAAAAGA 8880 

AAAAATGAGA GAAATAAAAC AGATATTTAA GAACTTTAGC CACCTATTTA GAATAGTTAT 8940 

AGCCAGAAAA AAAAACAAGG GCATGAGTTC AAATGCATTA CTATCAGTGT CCTAGGCAAT 9000 

ACCTAACCTA CTCTGAAATT GTGATTCAAA AGCAGTATTT CAAGAGGCAT TCTCCTTTTT 9060 

TGGTTTGCTG ACCCCACTTG GACTGGTAGG TTTGGTGAGG CCCCCATAAA CCAGCTGGAG 9120 
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CAGACCCTTT TCATCTCCTG TGCCTGTAAC ACCCCTCTTC CCCCACCCCC TCCGCAATTC 
AATGAGGGCT TTCTTGGGTC AGAGGACTTC AAGGTTGTCT AGAGAAGTTT GCCATGTGTG 
TAAGGTGCTG TGAACTGTQA GTGCTGAAGA TTCGCAGCAT TCAATACCAG GCAGCCAAAG 

AGCTGCTCTT GCAATTATTT tggctctcaa gctctgttct tcatogcatt ctcatttctg 

TGTACATTTG CAAGATGTGT GTAATGTCAT TTTCCAAAAA TAAAATTTGA TTTCAAT 



Seq ID NO: 214 Protein sequence: 
Protein Accession #: NP 000546 



1 .11 21 31 

1 1 I I 

MELDFGHFDE RDKTSRNMRG SRMNGLPSPT HSAHCSFYRT 
GDRYFKGIVY AVSSDRFRSF DALLADLTRS LSDNINLPQG 
BGESYVCSSD NFFKKVEYTK NVNPNWSVNV KTSANMKAPQ 
LVTIIRSGVK PRKAVRVLLN KKTAHSFBQV LTD I TEA I KL 
DFFGDDDVPI AOGPEKFRYA QDDFSLDENE CRVMKGNPSA 
MRRSKSPADS ANGTSSSQLS TPKSKQSPIS TPTSPGSLRK 



Seg ID NO: 215 DNA sequence 
Nucleic Acid Accession ft: NM_130467 
Coding sequence: 3 12.. 644 



41 



51 



I I 
RTLQALSNBK KAKKVRFYRN 
VRYIYTIDGS RKIGSMDELB 
SLASSNSAQA RENKDFVRPK 
ETGWKKLYT LDGKQVTCLK 
TAGPKASPTP QKTSAKSPGP 
HKDLYLPLSL DDSDSLGDSM 



GGCACGAGGC 
CTTTCCAACA 
GTCTTCCTGG 
TCCTGTGGCA 
CCCAGGTCGT 
AAGTGAGAGA 
AAGAGTCTTC 
AAGAAGAGGA 
AAGGAGCACC 
TTAAGATAGA 
TTGATCCCAC 
ATGAAGACTG 
TAATAAAGTT 



11 
I 

AGAGCTCTGC 
TCTTCGTTCT 
TAATTTAGTT 
CAGTCCGTGG 
GATGCAGGOG 
TATGAGTGAG 
CCAGCCAGTT 
ACCACCAACT 
TGCTGTTCAA 
GGATGCACCT 
TAAAGTGCTG 
AAACCAAGAA 
TTACAGTTTT 



21 
I 

AAGGAGAGGT 
TTCTCACTGA 
GTGAGTGAAT 
CTTTGAGGGA 
CCATGGGCCG 
CATGTAACAA 
GGACCTGTGA 
GATAATCAGQ 
GGGACTGATG 
GGAGATGGTC 
GAAGCAGGTG 
TATTGTTCTT 
CTGCAAAAAA 



31 

I 

TGTGTCTTCG 
CCGAGACTCA 
GTGTGGAGGA 
AAAGGGCCTC 
GTAATCGTGG 
GATCCCAATC 
TTGTCCAGCA 
OTATTG CACC 
TGGAAGCTTT 
CTGATGTCAG 
AAGGGCAACT 
ATGCTGGAAA 
AAAAAAAAAA 



41 
I 

TTCTTTCOGC 
GCOGGTAGGT 
GCCAGCGGGC 
GCGGTGGTCC 
CTGGGCTGGA 
CTCAGAAAGA 
GCCCACTGAG 
TAGTGGGGAG 
TCAACAGGAA 
GGAGGGGACT 
ATAGGTTTAA 
TTTGACTGCT 



51 
I 

CATCTTCGTT 
CTGCAGAGTG 
TTAGGACAGG 
TCCGCCTTCC 
ACGAGGGAGG 
GGAAATGACC 
GAAAAACGTC 
ATCAAAAATG 
CTGGCTCTGC 
CTGCCCACTT 
ACCAAGACAA 
AACATTCTCT 



Seq ID NO: 216 Protein sequence: 
Protein Accession fi: NP 569734 



51 



1 11 21 31 41 

I I I I I I 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTJjPTFDPT KVLEAGEGQL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession ft* NMJ) 014 76.1 

Coding sequence : 82 . . 43 5 



1 
I 

GCCAGGGAGC 
TGAGATTCAT 
CCAAGGCGCT 
GATGAAGTGG 
GCAGCTGCTC 
GCTGATAGCC 
GGGCAGGAGG 
CAATCACAGT 
TTGTTCATTA 



11 
I 

TGTGAGGCAG 
CTGTGTGAAA 
ATGTACAGCC 
AACCAGCAAC 
AGGAGGGAGA 
AGGAACAGGG 
TGGACCCGCC 
GTTAAAAGAA 
AAATTCTCCC 



21 

I 

TGCTGTGTGG 
TATGAGTTGG 
TCCTGAAGTG 
ACCTGAAGAA 
GGATGAGGGA 
TCACCCACAG 
AAATCCAGAG 
GACACGTTGA 
AATAAAGCTT 



31 

I 

TTCCTGCCGT 
CGAGGAAGAT 
ATTGGGCCTA 
GGGGAACCAG 
GCATCTGCAG 
ACTGGGTGTG 
GAGGTGAAAA 
AATGATGCAG 
TACAGCCTTC 



41' 
I 

CCGGACTCTT 
CGACCTATTA 
TGCGGCCCGA 
CAACTCAACG 
GTCAAGGGCC 
AGTGTGAAGA 
CGCCTGAAGA 
GCTGCTCCTA 
TGCAAAA 



51 
I 

TTTCCTCTAC 
TTGGCCTAGA 
GCAGTTCAGT 
TCAGGATCCT 
GAAGCCTGAA 
TGGTCCTGAT 
AGGTGAAAAG 
TGTTGGAAAT 



Seq ID NO: 218 Protein sequence: 
Protein Accession #: NP 001467.1 



31 



41 



51 



1 11 21 

I I I I ■ I I 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QPSDEVEPAT PEEGEPATQR QDPAAAQEGE 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEKQSQC 

Seq ZD NO: 219 DNA sequence 
Nucleic Acid Accession #: NM_001476 
Coding sequence: 90-3671 ~ 



1 

I 

ACAGCGGAGC 
AGACAGAGAC 
GCTTCTCGCT 
ATGGGAAGTC 
TCCGCTGCCT 
GCTTTTACCG 



11 
1 

GCAGAGTGAG 
TGAGCGGCCC 
CCTCCTGCCC 
CAGGCAGTGT 
CAACTGCAAT 
GCACAGAGAA 



21 
I 

AACCACCAAC 
GGCACCGCCA 
GCAGCCCGGG 
ATCTTTGATC 
GACAACACTG 
AGGGACCGCT 



31 
I 

CGAGGCGCCG 
TGCCTGCGCT 
CCACCTCCAG 
GGGAACTTCA 
ATGGCATTCA 
GTTTGCCCTG 



41 

I 

GGCAGCGACC 
CTGGCTGGGC 
GAGGGAAGTC 
CAGACAAACT 
CTGCGAGAAG 
CAATTGTAAC 



51 
I 

CCTGCAGCGG 
TGCTGCCTCT 
TGTGATTGCA 
GGTAATGGAT 
TGCAAGAATG 
TCCAAAGGTT 



9190 
9240 
9300 
9360 



PCT/US02/12476 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 



60 
120 
180 
240 
300 
360 
420 
480 



60 



60 
120 
180 
240 
300 
360 



273 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTCTTAGTGC 
CCAGATGCGA 
ACCAGAGACT 
ACOCQGGCCO 
CAGGTTACTA 
GGCATTCAGC 
TTCATCAAGA 
AATGGTCACA 
TTGTGGCTCC 
TTGACTACCG 
GTGCTGGTCT 
TCACCAAGAC 
TGAGTTACTT 
CATATGGAGA 
TCTCTGGAGC 
AATTCTGCCA 
GCACCTGTAT 
GTTATTCAGG 
ACGATCCGCA 
CAGTGATGCC 
CCOGCTGTGA 
TGAGGCCTTG 
GTGACCGGCT 
ACCAGTGCAA 
GAGCTTGCAA 
GTGTTTGCAA 
CTTGCTATAA 
AGGCCCTGAT 
GCAGGATGCA 
AAGGTGCTAG 
ACCAGAGCCG 
AGTACCAGAA 
CAGAAAGTGA 
CAAATGGCTT 
CAGCCAGTAA 
CACTGGTGCG 
CTGTGGTGCA 
CAAGGGAGGC 
TCCTGGATTC 
CAAAGAGGAT 
AGTTCAAGCG 
AGAATGGAAA 
AAAGCAGAGC 
TCCTTAAAAA 
AAGCCATGAA 
AGCAAGCAGA 
CCGGGGAGGC 
AAGCCAATGT 
GTGAGATGAG 
TGGATGCAGT 
CTGGGGTTAC 
AGCCTCTCAG 
AGACCCAGAT 
AGCAGAGGGG 
AGAACTTGGA 
AGCAACAGTG 
GGCTCGGGAG 
TATGCTCAGG 
TGCACCATAC 
ATGATCAAGG 
ATGTTTGCCT 
ATAGTCAACT 
ATGAAATTCT 
ACTATTGCCT 
ACCCAGGGTG 
AGGACCTGTA 
GTTCTGGACC 
ATTTTTATTA 
GTTTCAAAGT 
ATTAGTCCTA 
ATCTTATTTT 
CACACTTCAG 
TTACCTCCAT 
GTGGGACAGT 
AGCATTTTTA 
GCAATAACCG 
CATGGGGGCA 
CATTCCAGCT 
TAACACCAGT 
TGGTGCTGCC 
CAATTGTTAG 



TCGATGTGAC 
CCGATGTCTO 
GCTAGACTCC 
CTGTGTCTGC 
TAATCTGGAT 
CAGCTGCCGC 
TGTTGATGGC 
GCGCCATCAA 
TGCCAAATTT 
TGTGGACAGA 
ACGGATCACA 
TTACACATTC 
TGAGTATCGA 
ATACAGTACT 
CCCAGCACCC 
GGATTGTGCT 
TCCTTGTAAC 
GGATGAGAAT 
CGACCCCCGC 
GGAGACGGAG 
GCTCTGTGCT 
TCAGCCCTGT 
GACAGGCAGG 
AGCAGGCTAC 
CTGTAACCCC 
GCCAGGATTT 
TCAAGTGAAG 
TTCAAAGGCT 
GCAGGCTGAG 
CAGATCCCTT 
CCTGGATGAC 
CCGAGTTCGG 
AGCTTCCTTG 
TAAAAGTCTG 
CATGGAGCAA 
CAAGGCCCTG 
AGGGCTTGTG 
CACTCAAGCG 
AGTGTCTCGG 
CAAACAAAAA 
TACACAAAAG 
AAGTGGGAGA 
ACAAGAAGCA 
CCTCAGAGAG 
GAGACTCTCC 
AAGAGCCCTG 
CCTGGAAATC 
GACAGCAGAT 
GGAAGTGGAA 
ACAGATGGTG 
AATCCAAGAC 
TGTAGATGAA 
CAACAGCCAA 
CCACCTCCAT 
GAACATTAGG 
AAGCTGCCAT 
CCATGTCATG 
TCAACTGACC 
TCCTTGCTTC 
ATCTGGACCC 
CATAATAGTC 
TATTCTTTGA 
TCCTAATGTC 
CATATTGTCC 
TGAACATGTT 
AGGCAGGCCC 
TGGGCATGAC 
AAGCATTTCC 
GATAGAAAAG 
ATTCAATCCT 
CTCAATCTCC 
CTGGGTCACA 
CCATCCTTCC 
GGTGACATAG 
AAAAATAAAT 
CTTGGTTTGC 
CTTGAGTTTT 
GTCACTCTGT 
GGGAATTGC7 
TTGCTTCTGT 
ATGCC 



AACTCTGGAC 
CCAGGCTTCC 
AAGTGTGACT 
AAGCCAGCTG 
GGGGGGAACC 
AGCTCTGCAG 
TGGAAGGCTG 
GATGTGTTTA 
CTTGGGAATC 
GGAGGCAGAC 
GCTCCCTTGA 
AGGTTAAATG 
AGGTTACTGC 
GGGTACATTG 
TGGGTTGAAC 
TCTGGCTACA 
TGTCAAGGGG 
CCTGACATTG 
AGCTGCAAGC 
GAGGTGGTGT 
GATGGCTACT 
CAATGCAACA 
TGTTTGAAGT 
TTCGGGGACC 
ATGGGCTCAG 

ATTCAGATGG 
CAGGGTGGTG 
CAGGCCCTTC 
GGTCTCCAGT 
CTCAAGATGA 
GATACTCACA 
GGAAACACTA 
GCTCAGGAGG 
CTGACAAGGG 
CATGAAGGAG 
GAAAAATTGG 
GAAATTGAAG 
CTTCAGGGAG 
GCGGATTCAC 
AATCTGGGAA 
GAGAAATCAG 
CTGAGTATGG 
TTTGACCTGC 
TACATCAGCC 
GGGAGCGCTG 
TCCAGTGAGA 
GGAGCCTTGG 
GGAGAGCTGG 
ATTACAGAAG 
ACACTCAACA 
GAGGGGCTGG 
CTGCGGCCCA 
TTGCTGGAGA 
GACAACCTGC 
AAATATTTCT 
TGAGTGGGTG 
TGACCCCATT 
CTGATGCTGG 
CAAAGAATAG 
GTAAGTGGAG 
GTAATGTGAC 
AGAACAGAGT 
TCTGCAAGCT 
CTCCATTTTC 
ATTCAGAGCT 
ATCCTTTCTT 
TACCAGCAAA 
TGTGGCTTGG 
ACTTTTCGAA 
TCTCTCTTTC 
TCCATCCCTC 
AACATATATT 
TCTCTGCCCT 
TTAAACTTAC 
AACCTCTTTG 
GGCAAGGCTG 
GCCTTTCTAC 
GGAGGAACCA 
ATTTCCTTGG 



GGTGCAGCTG 
ACATGCTCAC 
GTGACCCAGC 
TTACTGGAGA 
CTGAGGGCTG 
AATACAGTGT 
TCCAACGAAA 
GCTCAGCCCA 
AACAGGTGAG 
ACCCATCTGC 
TGCCACTTGG 
AGCATCCAAG 
GGAATCTCAC 
ACAATGTGAC 
AGTGTATATG 
AGAGAGATTC 
GAGGGGCCTG 
AGTGTGCTGA 
CATGTCCCTG 
GCAATAACTG 
TTGGGGACCC 
ACAATGTGGA 
GTATCCACAA 
CATTGGCTCC 
AGCCTGTAGG 
ACTGTGAGCA 
ATCAGTTTAT 
ATGGAGTAGT 
AGGACATTCT 
TGGCCAAGGT 
CTGTGGAAAG 
GGCTCATCAC 
ACATTCCTGC 
CCACAAGATT 
AAACTGAGGA 
TCGGAAGCGG 
AGAAAACCAA 
CAGATAGGTC 
TCAGTGATCA 
TCTCAACGCT 
ACTGGAAAGA 
ATCAGCTGCT 
GCAATGCCAC 
AGGTGGACAA 
AGAAGGTTTC 
CTGCTGATGC 
TTGAACAGGA 
CCATGGAAAA 
AAAGGAAGGA 
CCCAGAAGGT 
CATTAGACGG 
TCTTACTGGA 
TGATGTCAGA 
CAAGCATAGA 
CCCCAGGCTG 
CAACTGAGGT 
GGATGGGGAC 
CCTGATCCCA 
GCAATGAGGC 
ACTGGATGGA 
TCCTGGAATT 
TAAAGGAAAA 
GCAACCCAGT 
TCTTGCTGAT 
AAGCTGGAAG 
ATGGTGCTTG 
TTAATGATGC 
GCAAATGTTG 
GCATTGAAAG 
CACCAAAAAT 
CTCCACCCAT 
CATTCATCCT 
TATTGAGTAC 
CATAGAGTTG 
AAACTTTGTT 
CTCAACAGAA 
ACAGAGCTCT 
AACTGATTGC 
GAGGCACTTC 
ATTTTCCTGA 



TAAACCAGGT 
GGATGCGGGG 
TGGCATCGCA 
ACGCTGTGAT 
TACCCAGTGT 
CCATAAGATC 
TGGGTCTCCT 
ACGACTAGAC 
CTATGGGCAA 
CCATGATGTG 
CAAGACACTG 
CAATAATTGG 
AGCCCTCCGC 
CCTGATTTCA 
TCCTGTTGGG 
AGCGAGACTG 
TGATCCAGAC 
CTGCCCAATT 
TCATAACGGG 
CCCTCCCGGG 
CTTTGGTGAA 
CCCCAGTGCC 
CACAGCCGGC 
CAACCCAGCA 
ATGTCGAAGT 
TGGAGCATTC 
GCAGCAGCTT 
ACCTGATACA 
GAGAGATGCC 
GAGGAGCCAA 
AGTTCGGGCT 
TCAGATGCAG 
CTCAGACCAC 
AGGAGAAAGC 
CTATTCCAAA 
AAGCGGTAGC 
GTCCCTGGCC 
TTATCAGCAC 
GTCCTTTCAG 
GGTAACCAGG 
AGAAGCACAG 
TTCCCGTGCC 
TTTTTATGAA 
CAGAAAAGCA 
AGATGCCAGT 
ACAGAGGGCA 
GATTGGGAGT 
GGGACTGGCC 
GCTGGAGTTT 
TGATACCAGA 
CCTCCTGCAT 
GCAGAAGCTT 
GCTGGAAGAG 
TGGGATTCTG 
CTACAATACC 
TCTTGGGATA 
ATTTGAACAT 
TGGCCAGGTG 
AGATAGCACT 
AAGACAAACT 
TGGACAAGTG 
AACTTTGACT 
CACACTGTGG 
CAGAGTTCCT 
AAGTGAGCAG 
CTGGTGCCTG 
CATGGCAACT 
GGAAAGTATT 
AGGTAAAATT 
GATGCGCATC 
AATAAGAGAA 
TCCATCCATC 
CTACTGTGT6 
ATTGTCTAGT 
TGTCACAAGT 
CATATGTTGC 
GGGTTGTGCA 
AACAGACTGT 
CACCTTGGCT 
AAGTGTTTTT 



GTGACAGGAG 
TGCACCCAAG 
GGGCCCTGTG 
AGGTGTCGAT 
TTCTGCTATG 
ACCTCTACCT 
GCAAAGCTCC 
CCTGTCTATT 
AGCCTGTCCT 
ATTCTGGAAG 
CCTTGTGGGC 
AGCCCCCAGC 
ATCCGAGCTA 
GCCCGCCCTG 
TACAAGGGGC 
GGGCCTTTTG 
ACAGGAGATT 
GGTTTCTACA 
TTCAGCTGCT 
GTCACOGGTG 
CATGGCCCAG 
TCTGGGAATT 
ATCTACTGCG 
GACAAGTGTC 
GATGGCACCT 
AGCTGTCCAG 
CAGAGAATGG 
GAGCTGGAAG 
CAGATTTCAG 
GAGAACAGCT 
CTGGGAAGTC 
CTGAGCCTGG 
TACGTGGGGC 
CACGTTGAGT 
CAAGCCCTCT 
CCGGACGGTG 
CAGCAGTTGA 
AGTCTCCGCC 
GTGGAAGAAG 
CAT ATGGATG 
CAGCTCTTAC 
AATCTTGCTA 
GTTGAGAGCA 
GAAGCTGAAG 
GACAAGACCC 
AAGAATGGGG 
CTGAACTTGG 
TCTCTGAAGA 
GACACGAATA 
GCCAAGAACG 
CTGATGGACC 
TCCCGAGCCA 
AGGGCACGTC 
GCTGATGTGA 
CAGGCTCTTG 
CAGATCTCAG 
GTTTAATGGG 
GTTGTCTTAT 
GGGTGTGAGA 
GCACAGGCAG 
CTGTTGGGAT 
TTGCCCAGGC 
CCAGTAAAAT 
CCTACTTACA 
TGTTX5GAGTG 
CCACGTTCAA 
TAGAGATTGC 
TACTTTTTCG 
CTCTAGATTT 
AATGTATTTT 
TGTTCCTACT 
TTTCCATCCA 
CCAGGGGCTG 
GAGGAAGACA 
GGTGTTTATT 
AAGACCCTCC 
CATTTCTTTG 
TGAGTTATGA 
GGGAAGACTA 
AAATAAAGAA 



420 
480 
540 
6O0 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 



Seq ID NO: 220 Protein sequence: 
Protein Accession #:NP_005553 



11 
I 



21 

I 



31 



51 



274 



WO 02/086443 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSHQCIFD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAG^RITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ L5YFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPttVB QCICPVGYKG QPCQDCASGY KRDSARLGPP GTCIPCNCQG 420 

GGACDPDTGD CVSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGPSC SVMPETEEW 480 

CNNCPPGVTG ARCELCADGY FGDPPGEHGP VRPOQPCQCM NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQIiQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKAliHEG VGSGSGSPDG AWQGLVEKL EKTKSLRQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EPKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEOEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLIiHLMD QPLSVDEEGI* VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLEHIRDNL PPGCYNTQAL EQQ 

Seq ID NO: 221 DNA sequence 
Nucleic Acid Accession #: NM_016S29 
Coding sequence: 13-1854 

1 11 21 31 41 51 

1 1 1 1 I I 

GTCAAGAAAA GAATGTCTGT AATTGTTCGA ACTCCTTCAG GACGACTTCG GCTTTACTGT 60 

AAAGGGGCTG ATAATGTGAT TTTTGAGAGA CTTTCAAAAG ACTCAAAATA TATGGAGGAA 120 

ACATTATGCC ATCTGGAATA CTTTGCCACG GAAGGCTTGC GGACTCTCTG TGTGGCTTAT 180 

GCTGATCTCT CTGAGAATGA GTATGAGGAG TGGCTGAAAG TCTATCAGGA AGCCAGCACC 240 

ATATTGAAGG ACAGAGCTCA ACGGTTGGAA GAGTGTTACG AGATCATTGA GAAGAATTTG 300 

CTGCTACTTG GAGCCACAGC CATAGAAGAT CGCCTTCAAG CAGGAGTTCC AGAAACCATC 360 

GCAACACTGT TGAAGGCAGA AATTAAAATA TGGGTGTTGA CAGGAGACAA ACAAGAAACT 420 

GCGATTAATA TAGGGTATTC CTGCCGATTG GTATCGCAGA ATATGGCCCT TATCCTATTG 480 

AAGGAGGACT CTTTGGATGC CACAAGGGCA GCCATTACTC AGCACTGCAC TGACCTTGGG 540 

AATTTGCTGG GCAAGGAAAA TGACGTGGCC CTCATCATCG ATGGCCACAC CCTGAAGTAC 600 

GCGCTCTCCT TCGAAGTCCG GAGGAGTTTC CTGGATTTGG CACTCTCGTG CAAAGCGGTC 660 

ATATGCTGCA GAGTGTCTCC TCTGCAGAAG TCTGAGATAG TGGATGTGGT GAAGAAGOGG 720 

GTGAAGGCCA TCACCCTCGC CATCGGAGAC GGCGCCAACG ATGTCGGGAT GATCCAGACA 780 

GCCCACGTGG GTGTGGGAAT CAGTGGGAAT GAAGGCATGC AGGCCACCAA CAACTCGGAT 840 

TACGCCATCG CACAGTTTTC CTACTTAGAG AAGCTTCTGT TGGTTCATGG AGCCTGGAGC 900 

TACAACCGGG TGACCAAGTG CATCTTGTAC TGCTTCTATA AGAACGTGGT CCTGTATATT 960 

ATTGAGCTTT GGTTCGCCTT TGTTAATGGA TTTTCTGGGC AGATTrTATT TGAAGGTTGG 1020 

TGCATCGGCC TGTACAATGT GATTTTCACC GCTTTGCOGC CCTTCACTCT GGGAATCTTT 1080 

GAGAGGTCTT GCACTCAGGA GAGCATGCTC AGGTTTCCCC AGCTCTACAA AATCACCCAG 1140 

AATGGCGAAG GCTTCAACAC AAAGGTTTTC TGGGGTCACT GCATCAACGC CTTGGTCCAC 1200 

TCCCTCATCC TCTTCTGGTT TCCCATGAAA GCTCTGGAGC ATGATACTGT GTTTGACAGT 1260 

GGTCATGCTA CCGACTATTT ATTTGTTGGA AATATTGTTT ACACATATGT TGTTGTTACT 1320 

GTTTGTCTGA AAGCTGGTTT GGAGACCACA GCTTGGACTA AATTCAGTCA TCTGGCTGTC 1380 

TGGGGAAGCA TGCTGACCTG GCTGGTGTTT TTTGGCATCT ACTCGACCAT CTGGCCCACC 1440 

ATTCCCATTG CTCCAGATAT GAGAGGACAG GCAACTATGG TCCTGAGCTC OGCACACTTC 1500 

TGGTTGGGAT TATTTCTGGT TCCTACTGCC TGTTTGATTG AAGATGTGGC ATGGAGAGCA 1560 

GCCAAGCACA CCTGCAAAAA GACATTGCTG GAGGAGGTGC AGGAGCTGGA AACCAAGTCT 1620 

CGAGTCCTGG GAAAAGCGGT GCTGCGGGAT AGCAATGGAA AGAGGCTGAA CGAGCGCGAC 1680 

CGCCTGATCA AGAGGCTGGG CCGGAAGACG CCCCCGACGC TGTTCCGGGG CAGCTCCCTG 1740 

CAGCAGGGCG TCCCGCATGG GTATGCTTTT TCTCAAGAAG AACACGGAGC TGTTAGTCAG 1800 

GAAGAAGTCA TCCGTGCTTA TGACACCACC AAAAAGAAAT CCAGGAAGAA ATAAGACATG 1860 

AATTTTCCTG ACTGATCTTA GGAAAGAGAT TCAGTTTGTT GCACCCAGTG TTAACACATC 1920 

TTTGTCAGAG AAGACTGGCG TCCAAGGCCA AAACACCAGG AAACACATTT CTGTGGCCTT I960 

AGTTAAGCAG TTTGTTAGTT ACATATTCCC TCGCAAACCT GGAGTGCAGA CCACAGGGGA 2040 

AGCTATCTTT GCCCTCCCAA CTCGTCTGCA GTGCTTAGCC TAACTTTTGT TTATGTCGTT 21 00 

ATGAAGCATT CAACTGTGCT CTGTGAGGTC TCAAATTAAA AACATTATGT TTCACCAATA 2160 
AGAAAAAAAA AAAAAAA 

Seq ID NO: 222 Protein sequence: 
Protein Accession #: NP_057613 

1 11 21 31 41 51 

I I I I I I 

MSVIVRTPSG RLRLYCKGAD NVIPERLSKD SKYMEETLCH LEYFATEGLR TLCVAYADLS 60 

ENEYEEWLKV YQEASTILKD RAQRLEECYE IIEKNLLLLG ATAIEDRLQA GVPETIATLL 120 

KAEIKIWVLT GDKQETAINI GYSCRLVSQN MALILjLKEDS LDATRAAITQ HCTDLGNLLG 180 

KENDVALI ID GHTLKYALSF EVRRSFLDLA LSCKAVICCR VSPLQKSEIV DWKKRVKAI 240 

TLAIGDGAND VGMIQTAHVG VGISGNEGMQ ATNNSDYAIA QFSYLEKLIil* VHGAWSYNRV 300 

TKCILYCFYK NWLYIIELW FAFVNGFSGQ ILFERWCIGL YNVIFTALPP FTLGIFERSC 360 

TQESMLRFPQ LYKITQNGEG FNTKVFWGHC INALVHSLIL FKFPMKALEH DTVFDSGHAT 420 

DYLFVGNXVY TYVWTVCLK AGLETTAWTK FSHLAVWGSM LTWLVFFGIY STIWPTIPIA 480 

PDMRGQATMV LSSAHFWLGL FLVPTACLIE DVAWRAAKHT CKKTLLEEVQ ELETKSRVLG 540 

KAVLRD5NGR RLNERDRLIK RLGRKTPPTL FRGSSLQQGV PHGYAFSQEE HGAVSQEEVI 600 
RAYDTTKKKS RKK 

Seq ID NO: 223 DNA sequence 
Nucleic Acid Accession ft: BC017001 
Coding sequence: 1-394 

1 11 21 31 41 51 
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WO 02/086443 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



AACGCTGGGC 
CGCGCAGGGC* 
CGTGCTCATC 
CACGATCCGG 
CCAGGGAGGT 
ACTGCAGCGA 
GTACCTGGGA 
TTCAAGAGCC 
AGGGGTGCAT 
CTACAAGGCC 
AGGATCATGG 
GCCACGTGCT 
TAACCTCTTA 
TAAACAAACA 
TATTGGTTCA 
CAGTGCTCGT 
TTCAAGGAAG 
ATCAGAAAGT 
CTTAGATGCC 
ACAGCAGGGA 
GGCACACTGC 
TGTACCAGTT 
CCCTGTATTT 
CTGCTGCTCA 
TGGATGGTCT 
TGCTTGGTTC 
GCTTCAGGAT 
GGAACTGGGA 
GGCTTTACCC 
AAGAGCTTGT 
AAAGGTATTT 
GAAGGCACTA 
AGTGAGTCTC 
GGTATGGGGT 
GGGAAAGGGA 
CTCGGCACTT 
GCATTCTCTT 
GTGTCTTCAG 
GGGAAAATTG 
TGGTTTTGAC 
GGGTATACAG 
AAAATAAAAG 



I 

AGGGCCGGCG 
GGCAGCATCC 
CGGCCGCTGC 
GAGGACCCAG 
GACTACTTCT 
GAGACCATCC 
GCATCCACAC 
CAGAAGACAC 
TCTCTTTGCA 
TTGGACTCAC 
CCTTACCCTT 
GCCGGCTCCA 
TATGGACTAC 
GAAGATTGTT 
ACAACTCCAC 
TACTTCATGG 
GAGGGGTGGA 
AAAACCTCGT 
CTTGGCCCGA 
ACAGAATTGT 
CTCCTGCAAT 
AGCTTTTGCT 
TCCCACAATC 
CACCTGGGAT 
AGGATAGCCT 
TCCTCCATGT 
TCCAAAGAGA 
CTAGGACAGT 
AGATTCAAGG 
GGCCATTTTT 
CCCTCTTCCC 
AGACATTGTC 
TCCATGGGCT 
TAGGCCAGGT 
ACAGAGTAAG 
TTAAGCCTCA 
TCCCCGGGAA 
TGCAATGTGG 
GAAATCTGAA 
AGATGTACCA 
GAACTCTTTG 
TTTATTTAAT 



CGGGTCGGGG 
ACTCGGGCCQ 
CGTCCGTGTT 
ACAGCGTGCC 
ACTCCTTTGG 
CCGCCAAGCT 
CAGACTTGCA 
ACCTGGCCTC 
CCTGGCGAGA 
TGTACAGTGT 
GAAGCATTAC 
AATTCCCAAG 
ATTCAGCTGC 
TTTCCACATA 
GGAGGTAGGG 
TCCCAAGATG 
GGAAGAGGAA 
CAGAAGTCTG 
GCCAGCTACC 
CATGGCTGAA 
AAAACTGGGA 
GTGTAACAAA 
CTATGGGTTG 
CCCTCATGGA 
TACTCACTTG 
GGCCTCTCCA 
GTGAGAGTAG 
GTCACTTCTG 
GATGAGAAAC 
CACCTATCAC 
CCTTTCTCTC 
CTGGCCCTCA 
GGACCCACTG 
AGGACATTCC 
GCAGGCCTTG 
GTTTCTCCAG 
AGTTGACTGA 
CATCCTGGAT 
AAAAGTCTGA 
TGGTGATGTA 
TACTATCTCT 
TTAAAAAAAA 



GGOGCCCGAG 
CATCGCCGCG 
GGACCCOGCC 
CCCCATCGAT 
GGGCTGCCAC 
TGTCCAGTCC 
GTAGCAGCCT 
CAGCAGGCTG 
GGGTCTGACT 
GGGAGCCCCA 
CGAGAAGGAG 
GACAAGGATC 
AAGGAAAGGA 
GCATGGATTC 
GTCACGTCTT 
GCTGCTGTAT 
GGGCCAAACT 
TTTCCTGCTC • 
ATTGCACCTC 
TAGACCAATC 
TCCCATTACC 
CCATCCCCAA 
GCAATTTGGG 
GCTAAGGTCA 
CCTGGCAGGT 
GCAGGCTAGC 
AAGCTGAAAG 
CTAAGTTCTT 
AGACTACATG 
AAATAATTTT 
TGTCTCATGG 
GGGTCTAGGG 
TAGTAGGAGT 
AGAGGGGCTT 
TTCTCACTGC 
TTCAATAATA 
GGTGACCAGT 
TGGGTGTTGG 
ATTTTAGTTA 
AGATGTTGAC 
GCAAJCTTCTC 
AAAAAAAAAA 



I 
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GGGCCCGGGC 
GTGCACAACG 
AAGGTGCAGA 
GTCCTCTGGA 
CGCTACGCGG 
ACTCTCTCAG 
CCTTGGCACC 
GGCCATGCAG 
CTGGGCACCC 
GTTCCCACCT 
AACAGAGATG 
CCTCTGCATT 
AAACCTTGAT 
TGGAGATGGG 
GGATCCTTTT 
CCCCAAGAAT 
AGCTGGACCC 
TCTCCCTCTG 
TAGCTGCAAA 
GTGTTCCATC 
AAGAGAGAAA 
ACTTGGCAGC 
CTGGGCTCAA 
GCTGTTACCT 
GACAGGCTGT 
TCAGGCTTAT 
ACTTCTTGAG 
TTGGTCAGAG 
TCTTGATGAG 
GGATGGGTAT 
GGCCTCACTC 
GAAGAGGTGT 
GCCTCCTTGT 
CTGAAAACCA 
CCTCTAAGGG 
AGGACAAGAG 
AATAGAATTG 
AACAAAAACA 
ATATACCAAT 
CTTGGGGTAG 
TGTAAATCTA 
AA 



CGAGCGGCGG 

TGCCGCTGAG 

GCCTCGTGGA 

TCAAAGGGGC 

CCTACCAGCA 

ACCTAAGGGT 

TGCTGCCACC 

AAGGGATAGC 

CTCTCACCGG 

CTGTGACAAT 

GGCTTGAAGA 

TTTGTCTATG 

TGCAGTGGTT 

TGGCTAATGG 

GCCTTAATCT 

CATGTCTGCG 

GTCACCTTCT 

CATATCTTCA 

CAAAGCTAAG 

TACTGAGACT 

TGCAGAATTG 

TAGAAACAAA 

CAGGGCAGTT 

CAGCTGGGCC 

TGGCTGGAAT 

TCACATGATG 

TTCTTGGCCT 

CAAATCACAA 

GGGAACCACA 

TTATTTGGAT 

TGCCAAGTTG 

TGGGGCAGGA 

CTGCACTGCT 

AGAGTCCCTG 

AACTTGGTCA 

CTTTTCCCAT 

AAAAGGGAGA 

GGACATTAGT 

TTCAGTCTCT. 

GCTGGGTGAA 

GTATCATTCC 



Seq ID NO: 224 Protein sequences 
Protein Accession fl: AAH17001.1 

X 11 21 31 41 51 

I I I I I I 

TLGRAGAGRG APEGPGPSGG AQGGSIHSGR IAAVHNVPLS VLIRPLPSVL DPAKVQSLVD 
TIREDPDSVP PIDVLWIKGA QGGDYFYSPG GCHRYAAYQQ LQRETIPAKL VQSTLSDLRV 
YLGASTPDLQ 

Seq ID NO j 225 DNA sequence 
Nucleic Acid Accession #: NM_021048 
Coding sequence: 1..1110 



ATGCCTCGAG 
GAGACACAGG 
TCCACTTCCA 
TCCTCCTGCT 
CCAAATCCTC 
CCATTAGATC 
CAGGTCCTGC 
TTGGTGCAGT 
CTGGAGAGTG 
GAGTGCATGC 
TTTGTCCTTG 
ATGCCCAAGA 
ACCCCTGAGG 
CACCTCATTT 
CTGGAGTACC 
AGGGCTCATG 
AGTGATCCAA 
GCCCAGGACA 
GCTACAGGTA 



11 
I 

CTCCAAAGCG 
GCCTCGAGGG 
CCAGCTCCTC 
ATCCTCTAAT 
CCCAGAGTGC 
AATCTGATGA 
CAGACAGTGA 
TTCTGCTCTT 
TCATAAAAAA 
TGCTGGTCTT 
TCACCTCCCT 
CTGGCATTCT 
AGGTCATCTG 
ATGGGGAGCC 
GGCAGGTGCC 
CTGAAATTAG 
GATCCTTCCC 
GAATTGCCAC 
GCTTCTCCTA 



21 
1 

TCAGCGCTGC 
TGCACAGGCT 
TTTTCCATCC 
ACCAAGCACC 
TCAGATAGCC 
GGGCTCCAGC 
GTCTTTACCC 
CAAGTATCAA 
TTATGAAGAC 
TGGCATTGAT 
GGGCCTCACC 
CATACTTATC 
GGAAGCACTG 
CAGGAAGCTG 
TGGCAGTGAT 
GAAGATGAGT 
ACTGTGGTAT 
CACAGATGAT 
CCCTGAATAA 



31 
I 

ATGCCTGAAG 
CCCCTGGCTG 
TCTTTTCCCT 
CCAGAGGAGG 
TGCTCCTCCC 
AGCCAAAAGG 
AGAAGTGAGA 
ATGAAGGAGC 
CACTTCCCTT 
GTAAAGGAAG 
TATGATGGGA 
CTAAGCATAA 
AATATGATGG 
CTCACCCAAG 
CCTGCACGGT 
CTCCTGAAAT 
GAGGAGGCTT 
ACTACTGCCA 



41 

I 

AAGATCTTCA 
TGGAGGAGGA 
CCTCCTCCTC 
TTTCTGCTGA 
CCTCGGTCGT 
AGGAGAGTCC 
TAGATGAAAA 
CGATCACAAA 
TGTTGTTTAG 
TGGATCCCAC 
TGCTGAGTGA 
TCTTCATAGA 
GGCTGTATGA 
ATTGGGTGCA 
ATGAGTTTCT 
TTTTGGCCAA 
TGAAAGATGA 
TGGCCAGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 



51 
1 

ATCCCAAAGT 
TGCTTCATCA 
TTCCTCCTCC 
TGATGAGACA 
TGCTTCCCTT 
AAGCACCCTA 
GGTGACTGAT 
GGCAGAAATA 
TGAAGCCTCC 
TGGCCACTCC 
TGTCCAGAGC 
GGGCTACTGC 
TGGGATGGAG 
GGAAAACTAC 
GTGGGGTCCA 
GGTAAATGGG 
GGAAGAGAGA 
AAGTTCTAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO i 226 Protein sequence i 
Protein Accession #: NP_066386 

i 11 21 31 41 51 

| ] I II 

KPRAPKRQRC MPEBDLQSQS ETQGLEGAQA PLAVEEDASS STSTSSSFPS 
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WO 02/086443 

SSCYPLIPST PEEVSADDET 
QVliPDSESLP RSEIDEKVTD 
ECMIiLVFGID VKEVDPTGHS 
TPEEVIWEAL NMMGLYDGME 
RAHAEIRKMS LLKFLAKVNG 
ATGSFSYPE 



PCT/US02/12476 



PNPPQSAQIA CSSPSWASL PLDQSDEGSS SQKEESPSTL 120 

LVQPLLFKYQ MKEPITKAEI LESVIKNYED HFPLLFSEAS 1BO 

FVLVTSLGLT YDGMLSDVQS MPKTGILILI LSIIFIEGYC 240 

HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 300 

SDPRSFPLWY EEALKDEEER AQDRIATTDD TTAMASASSS 360 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
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65 
70 
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80 
85 



Seg ZD NO: 227 DNA sequence 

Nucleic Acid Accession #: NM_005025.1 

Coding sequence: 82-1314 



GCGGAGCACA 
GAGGCTTGAA 
AGTATGGCTA 
TATAATCGTC 
GCTCTTGCAA 
CACTCAATGG 
TCAAACATGG 
GTGCAAAATG 
GCAGCAGTAA 
TGGGTGGAGA 
GCTGCCACTT 
TTTAGGCCTG 
ATTCCAATGA 
GAAGCTGGTG 
ATGCTGGTGC 
CAGCTGGTTG 
AGGTTCACAG 
GAAATTTTCA 
TCCAAAGCAA 
GTCTCAGGAA 
CATCCATTTT 
GTCATGCATC 
TTATTTGAAT 
TAGGATTTGT 
AATATATGTA 
TGTTATGTCA 



11 
I 

GTCCGCCGAG 
ACTGTTACAA 
CAGGGGCCAC 
TTAGAGCCAC 
TGGGAATGAT 
GATATGACAG 
TAACTGCTAA 
GATTTCATGT 
ATCATGTGGA 
ATAACACAAA 
ATCTGGCCCT 
AAAATACTAG 
TGTATCAGCA 
GTATCTACCA 
TGTCCAGACA 
AAGAATGGGC 
TGGAACAGGA 
TCAAAGATGC 
TTCACAAGTC 
TGATTGCAAT 
TCTTTCTTAT 
CTGAAACAAT 
AACAAGGAAA 
GTTTTACAGT 
AATTATAAGT 
TTGTGTTTGT 



21 
I 

CACAAGCTCC 
TATGGCTTTC 
TTTCCCTGAG 
TGGTGAAGAT 
GGAACTTGGG 
CCTAAAAAAT 
AGAGAGCCAA 
CAATGAGGAG 
CTTCAGTCAA 
CAATCTGGTG 
CATTAATGCT 
AACCTTTTCT 
AGGAGAATTT 
AGTCCTAGAA 
GGAAGTTCCT 
AAACTCTGTG 
AATTGATTTA 
AAATTTGACA 
CTTCCTAGAG 
TAGTAGGATG 
CAGAAACAGG 
GAACACAAGT 
ACAGTAACTA 
ATATCTTAAG 
AACTTGTCAA 



31 
I 

AGCATCCCGT 
CTTGGACTCT 
GAAGCCATTG 
GAAAATATTC 
GCCCAAGGAT 
GGTGAAGAAT 
TATGTGATGA 
TTTTTGCAAA 
AATGTAGCCG 
AAAGATTTGG 
GTCTATTTCA 
TTCACTAAAG 
TATTATGGGG 
ATACCATATG 
CTTGCTACTC 
AAGAAGCAAA 
AAAGATGTTT 
GGCCTCTCTG 
GTTAATGAAG 
GCTGTGCTGT 
AGAACTGGTA 
GGACATGATT 
AGCACATTAT 
ATAATATTTA 
GGAATGTTAT 
TTAAAATAAA 



41 
I 

CAGGGGTTGC 
TCTCTTTGCT 
CTGACTTGTC 
TCTTCTCTCC 
CTACCCAGAA 
TTTCTTTCTT 
AAATTGCCAA 
TGATGAAAAA 
TGGCCAACTA 
TATCCCCAAG 
AGGGGAACTG 
ATGATGAAAG 
AATTTAGTGA 
AAGGAGATGA 
TGGAGCCATT 
AAGTAGAAGT 
TGAAGGCTCT 
ATAATAAGGA 
AAGGCTCAGA 
ATCCTCAAGT 
CAATTCTATT 
TCGAAGAACT 
GTTTGCAACT 
AAATAGTTCC 
CAGTATTAAG 
AGTACCTATT 



Seq ID NO: 228 Protein sequence: 
Protein Accession # : NP_005016 . 1 



MAFLGLFSLL 
ELGAQGSTQK 
NEEFLQMMKK 
MAVYFKGNW 
VLEIPYEGDE 
IDLKDVLKAL 
SRMAVLYPQV 



11 
I 

VLQSMATGAT 
EIRHSMGYDS 
YFNAAVNHVD 
KSQFRPENTR 
ISMMLVLSRQ 
GITEIPIKDA 
IVDHPFFFLI 



21 
I 

FPEEAIADLS 
LKNGEEFSFL 
FSQNVAVANY 
TFSFTKDDES 
EVPLATLEPL 
NLTGLSDNKE 
RNRRTGTILF 



31 
I 

VNMYNRLRAT 
KEFSNMVTAK 
INKWVENNTN 
EVQIPMMYQQ 
VKAQLVEEWA 
IFLSKAIHKS 
MGRVMHPETM 



41 
I 

GEDENILFSP 
ESQYVMKIAN 
NLVKDLVSPR 
GEFYYGEFSD 
NSVKKQKVEV 
FLEVNBEGSE 



Seq ID NO: 229 DNA sequence 
Nucleic Acid Accession #: NM_003695 
Coding sequence: 12-398 



1 
I 

OGACATCAGA 
CAGCCCTTAC 
TCTGCCCGGC 
ATCTGGTGAA 
TCAGCAGCGG 
ACAACGCTGC 
TGAGCCTCCT 
TCATGCCTTT 
GGGTGCCAGG 
CATGGAATGC 
ACAGAGGATG 
GATTTCACAC 
TAAATGATTT 



11 
I 

GATGAGGACA 
CCTGCGCTGC 
CAGCTCTCGC 
GAAGGACTGT 
CACCAGCTCC 
ACCCACCCGC 
GGCCGTCATC 
CCTTCCCTTT 
AGCCCCAGGC 
TGATGACTTG 
CAGCCCCCAG 
TCCTTCTGTT 
AAACC 



21 
I 

GCATTGCTCC 
CACGTGTGCA 
TTCTGCAAGA 
GCGGAGTCGT 
ACCCAGTGCT 
ACCGCCCTCG 
TTAGCCCCCA 
CTCTGGGGAT 
TGAGGGCTTC 
GAGCAGGCCC 
CTGCATGGAA 
TTGTTGCCGT 



31 

1 

TCCTTGCAGC 
CCAGCTCCAG 
CCACGAACAC 
GCACACCCAG 
GCCAGGAGGA 
CCCACAGTGC 
GCCTGTGACC 
tCCACACCTC 
CCCGAAAGTC 
CACAGACCCC 
GGTGGAGGAC 
TTATTTTGTA 



41 

I 

CCTGGCTGTG 
CAACTGCAAG 
AGTGGAGCCT 
CTACACCCTG 
CCTGTGCAAT 
CCTCAGCCTG 
TTCCCCCCAG 
TCTTCCCCAG 
TGGGACCAGG 
ACAGAGGATG 
AGAAGCCCTG 
CTCAAATCTC 



51 
I 

AGGTGTGTGG 
GGTTCTGCAA 
AGTGAATATG 
ATTGAGTATT 
AGAAATCCGC 
GAAGGAGTTT 
TTCCTTGTTT 
ATATTTTAAT 
CATCAATAAG 
GGATTTTGAT 
GAAGTCGCAG 
TGAAGTCCAA 
TGGCTCCAAT 
AATAAGCATG 
AGTCAAAGCA 
ATACCTGCCC 
TGGAATAACT 
GATTTTTCTT 
AGCTGCTGCT 
TATTGTCGAC 
CATGGGACGA 
TTAAGTTACT 
GGTATATATT 
AGATAAAAAC 
CTAATGGTCC 
GAACATGTG 



51 
I 

LSXALAMGMM 
SLFVQNGFHV 
DFDAATYLAL 
GSNEAGGIYQ 
YLPRFTVEQE 
AAAVSGMIAI 



51 
I 

GCTACAGGGC 
CATTCTGTGG 
CTGAGGGGGA 
CAAGGCCAGG 
GAGAAGCTGC 



GGAAGGCCCC 
CCGGCAACGG 
TCCAGGTGGG 
AAGCCACCCC 
TGGATCCCCG 
TACATGGAGA 



Seq ID NO: 230 Protein sequence: 
Protein Accession #: NP 003686 



1 11 . 21 31 41 51 . 

I I 1 I I I 

MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 

KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSLGIALSLL 
AVIIAPSL 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession ft : Eos sequence 

Coding sequence: 126-752 



60 
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240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480. 
540 
600 
660 
720 



60 
120 
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CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
GGAGAACTCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 
CCOGTGTGGC 
ACGAGTCTTC 
AAGTTGGGCC 
TCATCCTGTC 
TTCAGGATGG 
CAAGAGTTCA 
TTCACTGCAC 
TGACTCAGTT 
ACCAGCAACC 
AGGGAGGGGA 
TTATTTCCTG 
TCTGGTATTA 
GGAATTCAGC 
AGCTCTGACT 
GGCTTCACAT 
GTGGTTTTTT 
CAGAAGGATG 
GGTAATTCCT 
TGGGGAGCTT 
TCCTAGAGAA 
TGCTGTCAGC 
TTCTACTGTG 
TGTATAACTC 
GTGTTTGTGA 



GGCTCATGCT 
GAATTCTGAT 
GGATATCGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 
CCACAAGAAG 
TGACGTGAAC 
AGACGCTGCT 
CATOGTGTGC 
CTGTATTCTG 
GCCTTCCTTT 
CGCCATCTTA 
CCACATTTTG 
CGGCTGTATA 
GAGGTGGAAT 
GGACTTGGCA 
ATTTAATCTC 
GTAGCTACCT 
TACAGCTGCA 
CAATTTTTTT 
GTTTTATTTT 
CCACCAGTTT 
CACAACTTAG 

TCTACTCOGT 
CTGATATACT 
TGTCGTTAAA 
GACTCTGTAT 
GATGCTCTTT 



21 

I 

CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTCCT 
GGGGAGCTCT 
TGCAGAAGAC 
TCCCTGOGAA 
CTGATGATCA 
CGGTCAGAAT 
GGAGACTGCT 
CTGAGTTGCT 
GATTGCATAC 
CAGTGGTGAC 
ACAGTAGTTC 
AAAATCTGAT 
AGGAAAAACA 
CCAGACCGTG 
GTCACCTTTG 
CTTCCTTTAG 
TGTCAAGATT 
TTCCTTGAGG 
TGCACCAGTA 
TTTGTTTTTA 
TGCAGAATCA 
ACTTTGGACT 
CAACCGTCGG 
GTTTCAATGT 
GAAGATGGTA 



31 
I 

GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 
CAATGGAAGA 
TAGAGAGACT 
GGGTTGTGTG 
CGCAGCTGGC 
GAGAGAGTCA 
CCATCAGTGC 
TCACGTGAGG 
TGGAAAAGAA 
CCAAGCAATG 
TTGGAATCTG 
TGGTGGGGAT 
AGAAATTAAC 
GTGTCTGGCC 
CTATAAGGCA 
GGTGGGGGAT 
GATTTTTAGA 
CCTAGGATTT 
GCACCAGCCA 
ATTCTCTTTC 
TTGCAACCTC 
CTGGAAACAG 
AGACCAGATG 
ATGTTACTGC 
CTTTTATATT 



41 
I 

TGGCGOGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTCGTGGCTT 
CGTGTGGTCT 
GTGGCAAGAA 
GATCTTCTGC 
TGGCTTCAGT 
AGCTGGGCAG 
CGAGGTGTGT 
AAAAGGGGGC 
GCCAATCTTC 
GATATAAACC 
AAGTCTCCTA 
CTCCTAGGAC 
CCAGAGAGAG 
TCCATTTTTG 
CCTGGGTAGA 
TGGTTTGGCT 
TGCAAGGACT 
TTTATTCTGT 
TTTTGAGCAG 
CTTAGCAGCA 
AGGAGCCCTC 
ATATGGGTTC 
ACCTGTTAGA 
AATGCTTCAC 
T 



51 

I 

GTCCTGGAGC 
CTCCACTCAG 
GGGTATAGAA 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 
CTGTCCAAGC 
GAGCTGAATG 
CGCACCAGGC 
GGACCAAATT 
AATCTCTCGC 
GGGAACAGGC 
TTTGGCCCTG 
TTGCTAGTAA 
TAAAAATCTG 
TTTGATCAGG 
CTAGTGGACA 
TCTGGGTTTT 
TCTGTCATTC 
AGGGTGGATG 
TTCTTTTGTT 
TGAAAAGACC 
CCCGAGCAGA 
AGTACCTCTT 
AGGTCTTTTT 
ACTGATTGAG 
TATTCTCTAT 
TGGCTAGTCC 
CTGCTGTACA 



PCT/US02/12476 



Seg ID no i 232 Protein sequence: 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

I I I I I t 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLEGQDAL ETAARAEGLS 
LDASMHSQLR ILDBEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSIAR 
VAHKKGELSM EDVWSLSKHE SSDVWCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 
LSIVCLMITQ LAGPSGPNPQ DGCILRSE 

Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 



TTTTAATGGT 
CACAACATGA 
TGTTTCTTAG 
TAGTAAACTT 
TTTGTTTCCA 
TGTTTGACTG 
TTGTTATGGT 
AAGCTATTTT 
TATATTATGG 
ATTAGCCTAC 
AATAGGTTAA 
AATTAAAGTA 
TATGAAAGCT 
TATATATTTT 
TATCCTGTAT 
TAGAAGACCC 
TTTTTTTTAG 
GATTCTGTTA 
TCCAGATAAA 
CTTAGGCTCT 
CCATAGTATC 
ACTATAATAG 
TATCAATGTT 
TACATAAAAA 
AAGTCATTAA 
AGAATCATCT 
GGAAGATGGA 
GAGT TGAAGA 
CTTTTAAAAC 
TATTCTTCTA 
AGCCCTTGTG 
CTGTTCAGGT 
TGAGATGGAG 
ATCATATGGA 
TTTTAATTCT 
CAATCCCCAA 
TTTCTGGAAT 



11 
I 

GCTCATATAT 
ATCACATAAT 
ATCGTAGTCA 
CTCTTTCATC 
AAGTCACAAT 



ATATCTTTTT 
AATCATCAAG 
ATTAACCAGA 
TATAGTATTA 
AAAGTAGTTA 
CATTTTAAAT 
CTGGCTATCA 
TTAGTTCCTT 
TTTTTTTAAG 
ACTCTTACTA 
GTAGTTTAAA 
AGCATCCAAA 
AGATGGAGAA 
CTCCATGTAT 
AAGTGGAGGG 
AAGTTTGAGT 
ATCCAATGAT 
GTGCTCATGT 
TAATTTAATA 
GAGGACTTTT 
GCCATGCTTG 
CCACTGCTCT 
TAGTAATGTA 
ATAACTAGCA 
TAAGATTATT 
GAGAAAACAT 
GAGGTGGGCA 
AGTTTGGAAA 
CACTGTCTCA 
GGGGCAGTG7 
TGCACCTACA 



21 
I 

ACTGTATTTT 
CATGATTTTT 
TTGAGAAGTC 
TTTGTGTTAG 
TGAATTATTC 
AACGTGACAG 
ATTAAATATT 
TATGGAAAAC 
ATTGTATCAT 
ATAAATTTTT 
CAAATTAAAC 
GAGCTTTATA 
TCCTGGGATA 
TGAGATAACT 
AATTGTTTTA 
GGTTCCCTAA 
GCAAGCACTG 
AACAATGCCT 
TACCTCATGT 
CTTTCTTAAG 
TAGTTCAGAA 
AATATTTTAA 
TTTTATTAAA 
ATTTGAATTT 
ATTGTTTTAA 
AATATGGAAT 
TTTTTCCAAA 
AAATTAGTGC 
CCCAGTTAAG 
TTTATTACAT 
ATTTCTTCTC 
AATGGATTTT 
CATTTAAGGT 
AGAGAGCTTA 
AAAGAGAATC 
TACCTTACTC 
TGTTTTCTTA 



31 
I 

TTGTTGTTTA 
TTTTTTTACT 
CCAATAACTC 
CTCTGTAGTC 
TTAGATACCT 
TTTATTTTCA 
TATTTTGACT 
AAATTACTAT 
TTTTGGCCTA 
CAGTTGGTTT 
TTACTAATTT 
ATACCTTAAA 
GTAATTTCTA 
AATTTCTAAT 
TAAATAGGTC 
GGATCTGCCA 
ATACCAGTGG 
AATTTCAGTT 
ACTGTGACTT 
GAAAAGTTTC 
AAGTTAATAG 
TAAATTTATA 
AAATTACCTT 
TAAATAATTT 
ATCAGTGGTT 
CCACCTCATA 
AGCTCTTTGA 
AGGAAAATGC 
TTTTGATGGT 
GAAATTTAAG 
TATAACTTCA 
TTTTTTTTTC 
CAGTTCACTA 
TCACAGGTTT 
AGCTCTCCAG 
CTTCACTGCT 
TTAACATTCA 



41 

I 

GTTTTACTTA 
TTTACTCCCC 
TAAACTTTTG 
TTAACCTGGA 
TAAGCCACTG 
AACACTAACT 
AAGCTTTCAT 
TGCATTTTCC 
ATGTCTGGAT 
GGGCAAATTT 
ATACCTGATT 
AAGTTGGTTC 
ATTATATAGT 
TATATATGTT 
ATAAGATACA 
TAGATTTTTT 
GAGTTGGTCT 
CTTAGGTTAT 
GAAAATGAAT 
TGAGTGTGAT 
GAAATCTTTT 
TAATTCAAAT 
ATTATTAGAA 
ATTTAAATCA 
TTCAACCCTC 
ACAATTAAGT 
GTGATTCTAA 
TTTTATTTCT 
TTAAATTCCA 
AGTTTAAGTT 
AAATAGATAT 
CTCTGGAGCT 
ACCTATGGTT 
GTATGCTGGT 
CAGTTCTAGA 
TCTTAGAAGG 
GAATTGGGAA 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 



51 
I 

TTGAGAGTGT 
AAATTATTCA 
AGTTATAACG 
TTTTAATTTT 
AATTCAGTTC 
TCTTGATATT 
AAAATATTTG 
TATATATGCA 
ATAAAAGATA 
AAACCTGAAA 
TTTTTTCTTG 
TAATTTAAAA 
ATTTCAAAAC 
TCAAAAACCA 
AGGTCTGCAT 
TTTTTTTTTT 
TGATCTAGGA 
GGCTTGTGAC 
TCTTAAAATT 
CTCTCTTTTG 
GTGACAGCAG 
GATAAAAATG 
CTGTGCCTAT 
AGACCACCAT 
ACTTCATATT 
CTAAATTTCT 
TTTGTAGTCA 
CCCATGTTAA 
CTAAAGAACA 
CCATCAAACT 
TTCATTCAAA 
GCCTGTTCAG 
CAGAGTTCTG 
GAATGGATAG 
AAAGCTTTGA 
TAGAATTAAG 
TATTAATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



278 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCCAGTGAGT AGTTTTCTGA AATTGGTAAC TTGGAGAGTA 
CAATTTTGTG TTTGTTTACT TTTATGTAAA AATTTGATAT 
AAAACCTCAT GCCTTTTCAT TACATCTAAT TTGAACTCTC 
TTTAAAGATG CTTTAATGAA AAGTATTAAG AAAATATATA 
CTTCAGAAAT CCATATATTT GTCATATTTA TTTTTTTAGA 
AGATGGTATT TAAAATGAAT GCCCAAAAAT ATCTTGTACC 
TTGGAAGCCG CCAGCCATTC ATGTAGAGAG TTTATAAGAA 
ATTTTATATT ACTATGGTAT CTGTGTACCA TATTTCTAAG 
CTTCTTAAAA CCATAACCTG GCTTGCCTTT TAGTGTTAAA 
ATAGAGATTC TTCTTTTATG AAGAAGAGCT GACGTAATTT 
AAGACATTAA CATAAGTCTC TGAGCAGTGA TACATTTTCA 
CCACATTAAA CAACCACGGC AACACTCAGA CTTGGCACTT 
TGTGCCTGGT ATCGCCTCTG GCATAACTTA CACGAATCGT 
CCTTCATCAA GCAC7TGCCA ACACATTCAC CTCTAACTTG 
ACAACATCTG CAACTCTACC CTATCAACTG CCAACCTAAA 
CCCCAAACAC AAAACCACTA AATCATAACC ACCACACACG 
CACACAACCA ACACACCACG ACCAAACACC CCACCACAAA 
GACAACACAT CACATACACT CACTACCCCC CCATACTCCC 

Seq ID NOs 234 DNA sequence 

Nucleic Acid AcceBsion #: Eos sequence 

Coding sequence: 27-281 



PCT/US02/12476 



AAATAACGTA 
GTGAATTACA 
AACTTCAGTG 
GATTTGTATG 
AACCTCCTAA 
TTTGTCCAAA 
AATAATTTAA 
TATTCATTAT 
CACAAAATCC 
ATTACCAGTG 
AACATGAAGA 
TCCTACGAAT 
CCTCCCTACT 
TACAACCTTA 
GACCCCCAAC 
CCACACACCA 
CAAGCTAACA 
ACCCACCA 



AGCAGGAGGA 
GTCTACCGCG 
TTCCTGCCCC 
GGCGCTCACT 
CAGGGTTCCG 
GAGTTTTTCT 
CAGAAAGAAT 
ATGTTGCAGG 
TTTTCTTCTC 
ATAAAAACTG 
GATGCCAGGA 
TCAAGCCAAG 
CTCAAACCCG 
GGTCAGAACA 
AGACAGCCTG 
TCTCAGTATC 
CATAAACACA 
GGAAAGGTCT 
TCTTTTGATG 
GCAAAGAAAA 
TTTATTTTTA 
TTTGACCTTG 
GATTAAAACA 
AAATCTCAAG 
AAAAAAAAGC 
GCACTTCTCT 
TCCCATCAAA 
ATAATCCC 



11 

1 

GAGCTGGCGG 
TAGCAGTTAC 
GTGCTCATTT 
GAAACAGTGT 
ACCAATCCAA 
TTGCTCTGAT 
GCAAGGAGAT 
AAGAGCTAGT 
CACTTGGCAT 
TTCAGCGGTT 
GGAAAGATGC 
CCAACAGGTG 
GGGAAGCCCA 
CAGCTAAGCA 
TGACGTTTCA 
TTACGCCCAG 
TAACAGCAGC 
CCTGTGACTG 
ACTCTATATC 
ATAAAAGACA 
AACTGAATTC 
AAATAATCTT 
TATTAGTAAT 
GCTTTTAAAG 
CCTCCATCTG 
TCTCATTTTC 
GCCAAAGAAA 



21 
1 

GAAGACATGC 
ATCAGACTGA 
GGGGCTGACG 
GTTGCTCCAC 
GAGCCTTGCA 
CTTGGAGACA 
AGACCAACGT 
CTTTCAGGCT 
ATCAAGAGCC 
CGCCAACAAG 
CAGGGGTAAA 
TTCTGTTTTT 
CTCTAGAACC 
GATGGCTTGG 
AAAGCAAAAG 
TGACACGATC 
AGCAATAATT 
TTTTATTTTT 
CAACTCTGAG 
ATTTCCAGTA 
AGCAGAGATT 
TACATTGTAA 
TAATTATTAA 
CATTTGTACA 
ATTCTCATTT 
CACTGTCTCG 
GAAAAGAAAA 



31 
I 

ACCCCTTGAA 
GACACTTCCT 
CCATTTTAGG 
ACCGCCTTGT 
GAAAGCATTA 
TCCCTCTGCC 
GAGATTCTCC 
GGGCTGGTGA 
AGGCGTGGAA 
AAGTGGTAAA 
GTGGGAAAAT 
CATCACAGAA 
CATGCTGGTC 
GTCATCAGGA 
TCCCCTACCA 
TACCCTCAAA 
AAAGATGAGA 
AGGGAAACAG 
GTTTGATTAA 
AGTATGCCAG 
TACATGCATT 
ATTCTTAATG 
AGGAGAATAA 
AATGACTGGA 
TCATTGTCAG 
CAAGCTAGAA 
TTGTTCTGTA 



41 

I 

GACCCAGAGA 
GTTTACAGGA 
CCTCAGCCCA 
TTTGCTTGTT 
ACGTGCTTTT 
TAGTGGAAAC 
TTCATGCACT 
CCTGAGAAAG 
GACTAAAACA 
GTAGCAAAAA 
GGGAACCTGA 
CTAATAAGTG 
ATCCATATCC 
CGTCCATTAC 
GCCAGTGAAG 
ACTTAAAAAA 
TGAGAACAAT 
AGAGGAAGAA 
AGAAATGACC 
TTCGAATTAA 
ACGATGATTA 
ATCAAAACAA 
TTGCAAATAC 
CATTTTTTAA 
TGCAACAACA 
ATTCTCACGA 
CAGATATATG 



TTTTGCTTTT 
CAGTTCTAAT 
CCAGAAGTGC 
TCAGTTTATA 
TTGGATAACT 
AGTTTATCTG 
AATTGTATGC 
TAAATTGGTA 
AACATTGTAT 
CATCTGCACA 
GTGACAACCA 
CCATCCTATA 
TGTCTACGCT 
CCAACTCACC 
ACAACACAAC 
CACACCCACC 
ACCACAAACA . 



51 
I 

GAGGCCGTCT 
GACTATAAAA 
TCTGCACCCA 
GGCGCGCTCT 
CTCTTTGGCA 
ATAAGGAATA 
CAAGAGAAAG 
AATGTCCAGC 
GGAAATGTTT 
TGGGGATGGA 
AGCCAGGAGG 
GTGCTGAGGA 
CCAAGGCCCT 
ATCCAAAGGA 
CTACCTGATT 
AAAAGGGAAA 
TAAGAAAAAA 
GAATGATTTT 
TTGAACCAGA 
TGATTTACTT 
ACATCTGAAA 
GGTTCTCAGT 
AACATTCCTA 
ATTTGAAAAA 
AAAAAGGTAT 
CTACCTTTGA 
ACATTAAAAA 



Seq ID NO: 235 Protein sequence t 
Protein Accession fh Bos sequence 



11 



21 



31 



41 



CCACAATAGG 
ACAGGATCAG 
AGTCCTTTCT 
ACCACCCTGA 
GGCCAGGCCA 
CGTCGCAGCT 
CCAGCCAGAG 
CAACTGCGTC 
GCTGACGTTA 
CGGACGCGGC 
GATTCTAAGC 
ACCACCAACA 
GCCCCATCAG 
CTCAAATCCC 
CTCTCCTGCT 
TGTGCCTTGT 
GACTGCATGA 
GCCAGTGCCA 



11 
I 

GGCAGACCTG 
ACCCAGAGGC 
AATCTCAGCT 
GCTGAGGAGC 
GGCCAGCTCC 
GAGGGAGTAA 
CCCAAGAGCC 
AGGAAGCGGA 
CTCTGGCAGA 
GGACGTTAAG 
TGCTGGTAAG 
AGGTGCACGC 
GGAACTTTGT 
GTGAGGGCAA 
GCCGCTTCCT 
GGGACATTGA 
GCCTGGCTGT 
AGCTCTGGGA 



21 
I 

TCCATCCTTC 
AGCTGGTTGG 
CCTGCCTGTA 
ACAGTTTGAG 
TCTGGCAGCA 
GGAGGCTCCC 
AGAGTGACCC 
GCAGCTCAAG 
GCTGGTGTCT 
GGGACACCTG 
TGCCTCGCAA 
CATCCCACTG 
GGCATGTGGG 
TGTCAAGGTC 
GGATGACAAC 
GACTGGGCAG 
GTCTCCTGAC 
TGTGCGAGAG 



31 
I 

TCTGTGGGTC 
GGTTTGTCGA 
CCCTCCCATA 
GCCCCCCCAA 
GAGCCTGGGC 
AGGAACCGGA 
CTCGACCTGT 
AAGCAGATTG 
GGCCTAGAGG 
GCCAAGATTT 
GATGGGAAGC 
CGCTCCTCCT 
GGGCTGGACA 
AGCCGGGAGC 
AATATTGTGA 
CAGAAGACTG 
TTCAATCTCT 
GGGACCTGCC 



41 

I - 

CCCTGTACCT 
GAAGAAGGAT 
CTCACCAAAC 
CCCCCCGCCG 
AGGTGACGGG 
GCTGGAAACC 
CAGCCATGGG 
CAGATGCCAG 
TGGTGGGACG 
ACGCCATGCA 
TGATCGTGTG 
GGGTCATGAC 
ACATGTGTTC 
TTTCTGCTCA 
CCAGCTCGGG 
TATTTGTGGG 
TCATTTCGGG 
GTCAGACTTT 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



51 

1 ! I I 

MHPLKTQREA VCLPRSSYIR LRHFLFTGDY KIPAPCSFGA DAILGLSPSA PRRSLKQCVA 
PHRLVLLVGA LSGFRPIQBP CRKH 

Seq ID NO: 236 DNA sequence 
Nucleic Acid Accession #: NMJ>02075 
Coding sequence: 406.. 1428 



CGGCCGAGGT 
GGAGATGGAG 
GAAAGCCTGT 
AGTCCAGATG 
CTGGGCCACT 
GGACAGCTAC 
CTGTGCCTAT 
CATCTACAAC 
CACAGGTTAT 
GGACACCACG 
ACACACGGGT 
GGCCTGTGAT 
CACTGGCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



51 
I 

TTCTCCCCCA 
TATCCAGATC 
CCTCTTCCCC 
GTCGGGGCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080. 



279 



WO 02/086443 



PCT/US02/12476 
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GAGTCGGACA 
GATGACGCTT 
CACGAGAGCA 
TTCGCTGGCT 
GGCATCCTCT 
GCTGTGGCCA 
AAGGGAAGTG 
GGTGTTCTCT 
GGGAGCATGG 
CCATCTCCTC 
GGACAACCTG 
GCCCTAGGAT 
TTGGCCCTGT 
TTCTCCTTTT 
GT 



TCAACGCCAT 
CCTGCCGCTT 
TCATCTGCGG 
ACGACGACTT 
CTGGCCACGA 
CAGGTTCCTG 
GAAGGCAGTG 
TCTATATTCC 
GACTGTGCCT 
CCATGGCCTT 
CCCCTCCCCA 
TCCTCCCCCA 
GACTATGGCT 
TCTACCTTTT 



CTGTTTCTTC 
GTTTGACCTG 
CATCACGTCC 
CAACTGCAAT 
TAACAGGGTG 
GGACAGCTTC 
AACACACTCA 
GGGTGCCATT 
TTGGGAGGCA 
CCCTCCCCAC 
GCCCTTTGCA 
GAGCCACTAC 
CTGGCACCAC 
TTTCTCTCCT 



CCCAATGGAG 
CGGGCAGACC 
GTGGCCTTCT 
GTCTGGGACT 
AGCTGCCTGG 
CTCAAAATCT 
GCAGCCCCCT 
CCCACTAAGC 
GCATCAGGGA 
AGTCCTCACA 
GGCCCAGCAG 
CTTTGTCCAG 
TAGGGTCCTG 
AAGACACCTG 



AGGCCATCTG 
AGGAGCTGAT 
CCCTCAGTGG 
CCATGAAGTC 
GAGTCACAGC 
GGAACTGAGG 
GCCCGA CCCC 
TTTCTCCTTT 
CACAGGGGCA 
GCCTCTCCCT 
ACTTGAGTCT 
GCCTGGGTGG 
GCCCTCTTCT 
CAATAAAGTG 



Seq ID NO; 237 Protein sequence: 
Protein Accession ft: NP_002066 



1 
i 

MGEMEQLRQE 
MHWATDSKLL 
CSIYNIiKSRE 
VGHTGDCMSL 
ICTGSDDASC 
KSERVGILSG 



11 
I 

AEQLKKQIAD 
VSASQDGKLI 
GNVKVSRELS 
AVSPDFNLFI 
RLFDLRADQB 
HDNHVSCLGV 



21 

I 

ARKACADVTL 
VWDSYTTNKV 
AHTGYLSCCR 
SGACDASAKL 
LICFSHESII 
TADGMAVATG 



31 



41 



AELVSGLEW GRVQMRTRRT 
HAIPIiRSSWV MTCAYAPSGN 
PLDDNNIVTS SGDTTCALWD 
WDVREGTCRQ TFTGHESDIN 
CGITSVAFSL SGRLLFAGYD 
SWDSFLKIWN 



CACGGGCTCG 
CTGCTTCTCC 
COGCCTACTA 
TGAGCGTGTG 
TGACGGGATG 
AGGCTGGAGA 
ATCTCATTCA 
GAGGGCAGTG 
AAGAACTGCC 
TAATGAGCAA 
GAGGCCCCAG 
TATAGGGCGT 
TATTCATGCT 
TAGCACCCTG 



51 
I 

LRGHLAKIYA 
FVACGGLDNM 
IETGQQKTVF 
AICFFPNGEA 
DFNCNVWDSM 



Seq ID NO: 23 B DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TCCCAATGTG 
TACCATTTGC 
ACTGATTCTG 
TGCATTGACC 
TAAGTGACTG 
TTTTGACGAG 
CAAAAAGGGG 
AAGAAAGAAA 
GGATCAAGAA 
CAGAAGTGAA 
TAGAAAAGTT 
CAACTACT CA 
CTTGTGTTCC 
ACTGTTGTTT 
GACATGAAAG 
TAGGCTAAGT 
AAAATCCCAA 



11 
I 

TNGAACCTAC 
TTTTAAGGCA 
TGAATTATGA 
AGTGTGAAGC 
GAAAGCTGAA 
TATCGGGTGA 
AAAAAAAAGA 
AATAAAATAC 
GTTTGTGTAC 
TCAAAATATT 
TTTCTGTAAA 
ACTTTCCTAC 
AATAAAGCTT 
GCCAAGTCCT 
TTCATTGGGT 
TATAATACAC 
ATAAAA 



21 
I 

CATAAATTCT 
GATAATCCTC 
AATCTGAAAA 
ACAGTGGAAT 
GAAT CACCGG 
CTTTGAGGTG 
GCAACCAAAG 
ACAATATGGA 
ACATAATCTC 
TCAAAATGCT 
AGTCAGATAG 
TGTAGCACAA 
CATTTACAAA 
AATATAGTTG 
TGCTAAAAAG 
TGTTTTAACA 



31 
I 

TTTCTTACNG 
CAAGTTTTCT 
GGAATTGGAA 
GAGAATGCGT 
CTTCAGTGAC 
GTCAAGAAAC 
AAAAAAAATC 
CGATGGAGAA 
ATTTTGAGAT 
GTCTTATGAA 
TAAATATTTT 
GAGTAGCTGT 
AACATGCCAT 
CTTAGCAAGT 
TATGTAGAAA 
ATTGTAAAAT 



41 

I 

GACAATCTTA 
AATGATATCT 
GTTGCTAAAA 
GCCCTGACAC 
ATGGAACCCA 
CACACTTTAA 
CATAAAATTG 
AAACAGTTAC 
ATATAACTAT 
ACTACAATAT 
AGGTTTTGCA 
GGTACTGTGC 
GQGCCATATT 
ATTGTGAGCT 
TTCAAAGGAA 
GTAAGAGAAA 



Seq ID NO: 239 DNA sequence 

Nucleic Acid Accession #: NM__0017B6.1 

Coding sequence: 130-1023 



GGGGGGGGGG 
GTTGTTGTAG 
TGACTAACTA 
GTGTATAAGG 
GAAAGTGAAG 
CTTCGTCATC 
CTCATCTTTG 
CAGTACATGG 
TTTTGTCACT 
GACAAAGGAA 
AGAGTATATA 
TCAGCTCGTT 
GCAA CTAAG A 
AGAGCTTTGG 
AAGAATACAT 
GAAAATGGCT 
GGCAAAATGG 
TAGCTTTCTG 
AACTCTTGTC 
GTCTTCTAAT 
ATTCTGTAAA 



11 
I 

GGCACTTGGC 
CTGCCGCTGC 
TGGAAGATTA 
GTAGACACAA 
AGGAAGGGGT 
CAAATATAGT 
AGTTTCTTTC 
ATTCTTCACT 
CTAGAAGAGT 
CAATTAAACT 
CACATGAGGT 
ACTCAACTCC 
AACCACTTTT 
GCACTCCCAA 
TTCCCAAATG 
TGGATTTGCT 
CACTGAATCA 
ACAAAAAGTT 
TATTTTTGTC 
TTCAAAAATA 
TGTGAAAAAA 



21 
I 

TTCAAAGCTG 
GGCCGCCGCG 
TACCAAAATA 
AACTACAGGT 
TCCTAGTACT 
CAGTCTTCAG 
CATGGATCTG 
TGTTAAGAGT 
TCTTCACAGA 
GGCTGATTTT 
AGTAACACTC 
AGTTGACATT 
CCATGGGGAT 
TAATGAAGTG 
GAAACCAGGA 
CTCGAAAATG 
TCCATATTTT 
TCCATATGTT 
TTATATATAT 
TAACTTAAAA 
AAAAAAAAAA 



31 
I 

GCTCTTGGAA 
GAATAATAAG 
GAGAAAATTO 
CAAGTGGTAG 
GCAATTCGGG 
GATGTGCTTA 
AAGAAATACT 
TATTTATACC 
GACTTAAAAC 
GGCCTTGCCA 
TGGTACAGAT 
TGGAGTATAG 
TCAGAAATTG 
TGGCCAGAAG 
AGCCTAGCAT 
TTAATCTATG 
AATGATTTGG 
ATGTCAACAG 
TTCTTTGTTA 
ATGTAAATAT 
AAAAA 



41 • 
I 

ATTGAGCGGA 
CCGGGATCTA 
GAGAAGGTAC 
CCATGAAAAA 
AAATTTCTCT 
TGCAGGATTC 
TGGATTCTAT 
AAATCCTACA 
CTCAAAATCT 
GAGCTTTTGG 
CTCCAGAAGT 
GCACCATATT 
ATCAACTCTT 
TGGAATCTTT 
CCCATGTCAA 
ATCCAGCCAA 
ACAATCAGAT 
ATAGTTGTGT 
TCAAACTTCA 
TCTATATGAA 



1140 
1200 
1260 
1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



51 
I 

TNCTAANCAA 
GAAACTATTA 
ATCTATCATT 
CAAAGAAAAA 
GTGATTTGAT 
GAACAATGTC 
CACAGAAGAA 
ATTTCTTTAT 
TTTTGTCTTT 
TCTCACAGAT 
GTGTCTTTTG 
AAATAAATTG 
TGGCCTGTAC 
ATTTGAGGAA 
AATTAAAATT 
TTTACAAATA 



60 
120 
180 
240 
300 



51 
I 

GAGOGACGCG 
CCATACCCAT 
CTATGGAGTT 
AATCAGACTA 
ATTAAAGGAA 
CAGGTTATAT 
CCCTCCTGGT 
GGGGATTGTG 
CTTGATTGAT 
AATACCTATC 
ATTGCTGGGG 
TGCTGAACTA 
CAGGATTTTC 
ACAGGACTAT 
AAACTTGGAT 
ACGAATTTCT 
TAAGAAGATG 
TTTTATTGTT 
GCTGTACTTC 
TTTAAATATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 



Seq ID NO: 240 Protein sequence: 
Protein Accession #: NP_001777.1 

1 11 21 31 41 51 

I I I ! I I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW . AMKKIRLESE EEGVPSTAIR EISLLKELRH 
PNIVSUJDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKSYLY QILQGIVFCH 



60 
120 



280 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

SRRVLHRDLK PQNLLIDDKG TIKLADFGLA RAFGIPIRVY THEWTLWYR SPEVLLGSAR 
YSTPVDIWSI GTIFAELATK KPLFHGDSEI DQLFRIFRAL GTPNNEVWPE VESLQDYKNT 
FPKWKPGSLA SHVKNLDENG LDLLSKMLIY DPAKRISGKM ALNHPYFNDL DNQIKKM 



Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession 8r NMJ)33379.1 

Coding sequence: 132-854 



PCT/US02/12476 



180 
240 



CGCCCGCGCG 
GCTTTGCAGA 
ATTGACTAAC 
TTGTGTATAA 
TAGAAAGTGA 
AACTTCGTCA 
ATCTCATCTT 
GTCAGTACAT 
TATTGCTGGG 
TTGCTGAACT 
TCAGGATTTT 
TACAGGACTA 
AAAACTTGGA 
AACGAATTTC 
TTAAGAAGAT 
TTTTTATTGT 
AGCTGTACTT 
ATTTAAATAT 



11 
I 

CGGGCTCAAC 
GAGCGCCCTC 
TATGGAAGAT 
GGGTAGACAC 
AGAGGAAGGG 
TCCAAATATA 
TGAGTTTCTT 
GGATTCTTCA 
GTCAGCTCGT 
AGCAACTAAG 
CAGAGCTTTG 
TAAGAATACA 
TGAAAATGGC 
TGGCAAAATG 
GTAGCTTTCT 
TAACTCTTGT 
CGTCTTCTAA 
AATTCTGTAA 



21 
1 

TTTGTAGAGC 
CAGGGAGTAT 
TATACCAAAA 
AAAACTACAG 
GTTCCTAGTA 
GTCAGTCTTC 
TCCATGGATC 
CTTGTTAAGG 
TACTCAACTC 
AAACCACTTT 
GGCACTCCCA 
TTTCCCAAAT 
TTGGATTTGC 
GCACTGAATC 
GACAAAAAGT 
CTATTTTTGT 
TTTCAAAAAT 
ATGTGAAAAA 



31 
I 

GAGGGGCCAA 
GCGTGCGGGG 
TAGAGAAAAT 
GTCAAGTGGT 
CTGCAATTCG 
AGGATGTGCT 
TGAAGAAATA 
TAGTAACACT 
CAGTTGACAT 
TCCATGGGGA 
ATAATGAAGT 
6GAAACCAGG 
TCTCGAAAAT 
ATCCATATTT 
TTCCATATGT 
CTTATATATA 
ATAACTTAAA 
AAAAAAAAAA 



41 
I 

CTTGGCAGAG 
ACACGGGATC 
TGGAGAAGGT 
AGCCATGAAA 
GGAAATTTCT 
TATGCAGGAT 
CTTGGATTCT 
CTGGTACAGA 
TTGGAGTATA 
TTCAGAAATT 
GTQGCCAGAA 
AAGCCTAGCA 
GTTAATCTAT 
TAATGATTTG 
TATGTCAACA 
TTTCTTTGTT 
AATGTAAATA 



51 
I 

CGCGCGGCCA 
TACCCATACC 
ACCTATGGAG 
AAAATCAGAC 
CTATTAAAGG 
TCCAGGTTAT 
ATCCCTCCTG 
TCTCCAGAAG 
GGCACCATAT 
GATCAACTCT 
GTGGAATCTT 
TCCCATGTCA 
GATCCAGCCA 
GACAATCAGA 
GATAQTTGTG 
ATCAAACTTC 
TTCTATATGA 



Seq ID NO: 242 Protein sequence: 
Protein Accession #: NP 203698.1 



41 



51 



GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 

CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 



11 
I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 



21 
I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 

CCTATGCCGG 
GCGTGTOGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTOCCTTCT 
CAACACCAAG 
GAGGCAAAAG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 



31 

! 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTCAATAT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 



41 
I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1 11 21 31 

I.I I I ! I 

MBDYTKIEKI GEGTYGWYK GRHKTTGQW AMKKIRLESE EEGVP STAIR EISLLKELRH 
PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKWTL WYRSPEVLLG 
SARYSTPVDI WSIGTIFAEL ATKKPLFHGD SEIDQLFRIF RALGTPNNEV WPEVESLQDY 
KNTFPKWKPG SLASHVKNLD ENGLDltl&KM LIYDPAKRIS GKMALNHPYP NDLDNQIKKM 

Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession §: AF101051.1 

Coding sequence: 221-856 



60 
120 
180 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
• 660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



281 



WO 02/086443 

GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

CGTTTTGGTG TTGCTTTTCA AA TGTTTG AA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 

TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 



Seq ID NO i 244 Protein sequence: 
Protein Accession ft: AAD1S433.1 

1 11 21 31 41 51 

I III ! I 

MANAGLQ1«LG PILAPLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 

QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIPVATV GMKCMKCLED DEVQKMRMAV 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 

Seq ID NO: 245 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I J I I I I 

TTTTTTTTTT TTTTTTTTTT TTTTTCAAGG AGAGCACAAG GAACTTTATT AATGACTTTC 60 

TTAATGGTTA AATGCTGTTT ACCAAGTOAC CCAGAGGCAO CGTGGTTTAG TGGTTTCAAC 120 

AGCATGGTCC CGAGAGTCTG ACAAACCTCA GTTCAAATCC TTCTTTTGTC TTCACTTAGT 180 

TTTTCTTCCT GAGATTTAGT TTCTTCATCG TTAACAATGA GGATATTAAT ATGTTTCACA 240 

CAGTTGTTAT GAAGAATGCA TATATTAGAA TGCCTGTAGT CTCAGCTACT CAGGAGGCTA 300 

AGGTGGGGAG GTCGCTCAAG CCCAGGAATT CAAAGCTGCA ATGCATTATG ATTACAGCTG 360 

TTAATAGCCA CTGCACTTCA GCCTGGGCAA TGTAGTAAGA TCCCATCTCT GGCTCGGAGG 420 
GTCCTACGCC CACGGAGTCT OGCTGATTGC TAGCACAGCA GTCTGAGATC AAACTGCA 

Seq ZD NO: 246 DNA sequence 

Nucleic Acid Accession #: XM_058553.2 

Coding sequence: 897-1400 

1 11 21 31 41 51 

I I I I 1 i 

AATTTTCAGA AGTTTCGTAT GGGGATGGTT TTATATAAAT TCAGGTTTTT CCCACAATAA 60 

TAAATGTATT TAGTCTCAGT GCTCAATAGA AGAGATTTCT AATAGAAAAG GATTCAAACT 120 

GTGAAACCAT TTCTCTTTTA ATGTTTCACA TTCCTGTTAC AGATTTGTTC TCTTGTGACT 180 

CTGTTATCCA TAATATGGAC AGTTCTTGAG TCCTAACATT GAGAGGTTTT CCCTTAGTGC 240 

ATAGAGGGAA TGAGTATTAA TTGGAGAAGC TTAAAGTATT GCCACTTTAG CACTGAAGAT 300 

TGGGATGAGA GGAGGTGAAA CCTCACTAGA AAAAGGGACA ATGTTAGTGT GGCCCTTCCT 360 

GATCATGTTT AAGAAAAGTC ATGAAAATGG TGAACTAGTG TTTCCAAGCA TATTGGAAGG 420 

GTTGAGTGTA TACTGTCTGT CAAAGACTTC CAGCATTTCC AGGTCCTAGA GAGGAACAAG 480 

ACTGGTAACC TGCCTATCTG TATTTTTAAG AACCCAGGAG GAAAGCTTTA TAATAGAACA 540 

TTATTTCTGT GTTTATGTAT AAGGGGTTTT TTGTTTTTTT AAAGACAGGA TCTCACTCCA 600 

TTGTCCAGGC CAAGTGCAAT GGCACGAACC TCATAGCTCC TGGACTTAAG TGATCTGCCT 660 

GCCTTTGCCT CCTGAGTAGC TGGGACTACA GGCATGAGCC CCCATGCCTG GCTAAGTTTG 720 

TTTTTTTGTT TGTTTGTTTG TTTGTTTTTG GGGGGGGTTG TTTTGTTTTT TGTAGAGACG 780 

TAGTCTTGCT TTGTTGCCAG GCTAGTCTCA AACTCCTGGC TTCAAGTGAT CCTCCTGCCT 840 

CAGCCTCCCA GAGTGCTAGG ATTACAGCAC TTGGATTCAG CTTCTTCATT TCCAACATGG 900 

AAGAAACTTA CACCGACTCC CTGGACCCTG AGAAGCTATT GCAATGCCCC TATGACAAAA 960 

ACCATCAAAT CAGGGCTTGC AGGTTTCCTT ATCATCTTAT CAAGTGCAGA AAGAATCATC 1020 

CTGATGTTGC AAGCAAATTG GCTACTTGTC CCTTCAATGC TCGCCACCAG GTTCCTCGAG 1080 

CTGAAATTAG TCATCATATC TCAAGCTGTG ATGACAGAAG TTGTATTGAG CAAGATGTTG 1140 

TCAACCAAAC CAGGAGCCTT AGACAAGAGA CTCTGGCTGA GAGCACTTGG CAGTGCCCTC 1200 

CTTGCGATGA AGACTGGGAT AAAGATTTGT GGGAGCAGAC CAGCACCCCA TTTGTCTGGG 1260 

GCACAACTCA CTACTCTGAC AACAACAGCC CTGCGAGCAA CATAGTTACA GAACATAAGA 1320 

ATAACCTGGC TTCAGGCATG CGAGTTCCCA AATCTCTGCC GTATGTTCTG CCATGGAAAA 1380 

ACAATGGAAA TGCACAGTAA CTGAATACCT ATCTCATCAA ATGCCAGACC CTAGAAGACT 1440 

GTTGCTTCTT CTTCTACCAG TGGGTTCTCA TTTTCCTCCT AATCTAATTA TAGAATGGTA 1500 

AACTCCCTGT GACTTTCCAA ACTGACAAGC ACACTTTTTT CCTCCCCCCT TGAATCCTCA 1560 
TTTAATGCAA GAACCCTCAT ACTCAGAAGC TTCCAAATAA ACCTTTGATA CAGATTG 



Seq ID NO: 247 Protein sequence: 
Protein Accession ft: XP_058553.1 

1 11 21 31 41 51 

I I I I I 1 

MEETYTDSLD PEKLLQCPYD KNHQIRACRF PYHLIKCRKN HPDVASKLAT CPFNARHQVP 60 
RAEISHHISS CDDRSCIEQD WNQTRSLRQ ETLAESTWQC PPCDEDWDKD LWEQTSTPFV 120 
WGTTHYSDNN SPASNIVTEH KNNLASGMRV PKSLPYVLPW KNNGNAQ 



282 



WO 02/086443 

Seq ID NO: 248 DKA sequence 
Nucleic Acid Accession ft: NMJ)03392 
Coding eequence: 758.. 1855 



1 ll 21 31 41 51 

| I I t I I 

TTAAGGAAAT CCGGGCTGCT CTTCCCCATC TGGAAGTGGC TTTCCCCACA TCGGCTCGTA 60 

AACTGATTAT GAAACATACG ATGTTAATTC GGAGCTGCAT TTCCCAGCTG GGCACTCTCG 120 

CGCGCTGGTC CCCGGGGCCT CGCCCCCCAC CCCCTGCCCT TCCCTCCCGC GTCCTGCCCC 180 

CATCCTCCAC CCCCCGCGCT GGCCACCCCG CCTCCTTGGC AGCCTCTGGC OGCAGCGCGC 240 

TCCACTCGCC TCCOGTGCTC CTCTCGCCCA TGGAATTAAT TCTGGCTCCA CTTGTTGCTC 300 

GGCCCAQGTT OQGGAGAGGA CGGAGGGTGQ CCGCAGCGGG TTCCTGAGTG AATTACCCAG 360 

GAGGGACTGA GCACAGCACC AACTAGAGAG GGGTCAGGGG GTGCGGGACT CGAGCGAGCA 420 

GGAAGGAGGC AGCGCCTGGC ACCAGGGCTT TGACTCAACA GAATTGAGAC ACGTTTGTAA 480 

TCGCTGGCGT GCCCCGCGCA CAGGATCCCA GCGAAAATCA GATTTCCTGG TGAGGTTGCG 540 

TGGGTGGATT AATTTGGAAA AAGAAACTGC CTATATCTTG CCATCAAAAA AC TCACG GAG 600 

GAGAAGCGCA GTCAATCAAC AGTAAACTTA AGAGACCCCC GATGCTCCCC TGGTTTAACT 660 

TGTATGCTTG AAAATTATCT GAGAGGGAAT AAACATCTTT TCCTTCTTCC CTCTCCAGAA 720 

GTCCATTGGA ATATTAAGCC CAGGAGTTGC TTTGGGGATG GCTGGAAGTG CAATGTCTTC 780 

CAAGTTCTTC CTAGTGGCTT TGGCCATATT TTTCTCCTTC GCCCAGGTTG TAATTGAAGC 840 

CAATTCTTGG TGGTCGCTAG GTATGAATAA CCCTGTTCAG ATGTCAGAAQ TATATATTAT 900 

AGGAGCACAG CCTCTCTGCA GCCAACTGGC AGGACTTTCT CAAGGACAGA AGAAACTGTG 960 

CCACTTGTAT CAGGACCACA TGCAGTACAT CGGAGAAGGC GCGAAGACAG GCATCAAAGA 1020 

ATGCCAGTAT CAATTCCGAC ATCGACGGTG GAACTGCAGC ACTGTGGATA ACACCTCTGT 1080 

TTTTGGCAGG GTGATGCAGA TAGGCAGCCG CGAGACGGCC TTCACATACG CCGTGAGCGC 1140 

AGCAGGGGTG GTGAACGCCA TGAGCCGGGC GTGCCGCGAG GGCGAGCTGT CCACCTGCGG 1200 

CTGCAGCCGC GCCGCGCGCC CCAAGGACCT GCCGCGGGAC TGGCTCTGGG GCGGCTGCGG 1260 

OGACAACATC GACTATGGCT ACCGCTTTGC CAAGGAGTTC GTGGACGCCC GCGAGCGGGA 1320 

GCGCATCCAC GCCAAGGGCT CCTACGAGAG TGCTCGCATC CTCATGAACC TGCACAACAA 1380 

CGAGGCCGGC CGCAGGACGG TGTACAACCT GGCTGATGTG GCCTGCAAGT GCCATGGGGT 1440 

GTCCGGCTCA TGTAGCCTGA AGACATGCTG GCTGCAGCTG GCAGACTTCC GCAAGGTGGG 1500 

TGATGCCCTG AAGGAGAAGT ACGACAGCGC GGCGGCCATG CGGCTCAACA GCCGGGGCAA 1560 

GTTGGTACAG GTCAACAGCC GCTTCAACTC GCCCACCACA CAAGACCTGG TCTACATCGA 1620 

CCCCAGCCCT GACTACTGCG TGCGCAATGA GAGCACCGGC TCGCTGGGCA CGCAGGGCCG 1680 

CCTGTGCAAC AAGACGTCGG AGGGCATGGA TGGCTGCGAG CTCATGTGCT GCGGCCGTGG 1740 

GTACGACCAG TTCAAGACCG TGCAGACGGA GCGCTGCCAC TGCAAGTTCC ACTGGTGCTG 1800 

CTACGTCAAG TGCAAGAAGT GCACGGAGAT CGTGGACCAG TTTGTGTGCA AGTAGTGGGT 1860 

GCCACCCAGC ACTCAGCCCC GCTCCCAGGA CCCGCTTATT TATAGAAAGT ACAGTGATTC 1920 

TGGTTTTTGG TTTTTAGAAA TATTTTTTAT TTTTCCCCAA GAATTGCAAC CGGAACCATT 1980 

TTTTTT CCTG TTACCATCTA AGAACTCTGT GGTTTATTAT TAATAT TATA ATTATTATTT 2040 

GGCAATAATG GGGGTGGGAA CCACGAAAAA TATTTATTTT GTGGATCTTT GAAAAGGTAA 2100 

TACAAGACTT CTTTTGGATA GTATAGAATG AAGGGGGAAA TAACACATAC CCTAACTTAG 2160 

CTGTGTGGGA CATGGTACAC ATCCAGAAGG TAAAGAAATA CATTTTCTTT TTCTCAAATA 2220 

TGCCATCATA TGGGATGGGT AGGTTCCAGT TGAAAGAGGG TGGTAGAAAT CTATTCACAA 2280 

TTCAGCTTCT ATGACCAAAA TGAGTTGTAA ATTCTCTGGT GCAAGATAAA AGG TCTTG GG 2340 

AAAACAAAAC AAAACAAAAC AAACCTCCCT TCCCCAGCAG GGCTGCTAGC TTGCTTTCTG 2400 

CATTTTCAAA ATGATAATTT ACAATGGAAG GACAAGAATG TCATATTCTC AAGGAAAAAA 2460 

GGTATATCAC ATGTCTCATT CTCCTCAAAT ATTCCATTTG CAGACAGACC GTCATATTCT 2520 

AATAGCTCAT GAAATTTGGG CAGCAGGGAG GAAAGTCCCC AGAAATTAAA AAATTTAAAA 2580 

CTCTTATGTC AAGATGTTGA TTTGAAGCTG TTATAAGAAT TGGG ATTC CA GATTTGTAAA 2640 

AAGACCCCCA ATGATTCTGG ACACTAGATT TTTTGTTTGG GGAGGTTGGC TTGAACATAA 2700 

ATGAAATATC CTGTATTTTC TTAGGGATAC TTGGTTAGTA AATTATAATA GTAGAAATAA 2760 

TACATGAATC CCATTCACAG GTTTCTCAGC CCAAGCAACA AGGTAATTGC GTGCCATTCA 2820 

GCACTGCACC AGAGCAGACA ACCTATTTGA GGAAAAACAG TGAAATCCAC CTTCCTCTTC 2880 

ACACTGAGCC CTCTCTGATT CCTCCGTGTT GTGATGTGAT GCTGGCCACG TTTCCAAACG 2940 

GCAGCTCCAC TGGGTCCCCT TTGGTTGTAG GACAGGAAAT G AAACA TTAG GAGCTCTGCT 3000 

TGGAAAACAG TTCACTACTT AGGGATTTTT GTTTCCTAAA ACTTTTATTT TGAGGAGCAG 3060 

TAGTTTTCTA TGTTTTAATG ACAGAACTTG GCTAATGGAA TTCACAGAGG TGTTGCAGCG 3120 

TATCACTGTT ATGATCCTGT GTTTAGATTA TCCACTCATG CTTCTCCTAT TGTACTGCAG 3180 

GTGTACCTTA AAACTGTTCC CAGTGTACTT GAACAGTTGC ATTTATAAGG GGGGAAATGT 3240 

GGTTTAATGG TGCCTGATAT CTCAAAGTCT TTTGTACATA ACATATATAT ATATATACAT 3300 

ATATATAAAT ATAAATATAA ATATATCTCA TTGCAGCCAG TGATTTAGAT TTACAGCTTA 3360 

CTCTGGGGTT ATCTCTCTGT CTAGAGCATT GTTGTCCTTC ACTGCAGTCC AGTTGGGATT 3420 

ATTCCAAAAG TTTTTTOAGT CTTGAGCTTG GGCTGTGGCC CCGCTGTGAT CATACCCTGA 3480 

GCACGACGAA GCAACCTCGT TTCTGAGGAA GAAGCTTGAG TTCTGACTCA CTGAAATGOG 3540 

TGTTGGGTTG AAGATATCTT TTTTTCTTTT CTGCCTCACC CCTTTGTCTC CAACCTCCAT 3600 

TTCTGTTCAC TTTGTGGAGA GGGCATTACT TGTTCGTTAT AGACATGGAC GTTAAGAGAT 3660 

ATTCAAAACT CAGAAGCATC AGCAATGTTT CTCTTTTCTT AGTTCATTCT GCAGAATGGA 3720 

AACCCATGCC TATTAGAAAT GACAGTACTT ATTAATTGAG TCCCTAAGGA A TATTCAG OC 3780 

CACTACATAG ATAGCTTTTT TTTTTTTrfT TTTTTTTTAA TAAGGACACC TCTTTCCAAA 3840 

CAGGCCATCA AATATGTTCT TATCTCAGAC TTACGTTGTT TTAAAAGTTT GGAAAGATAC 3900 

ACATCTTTTC ATACCCCCCC TTAGGAGGTT GGGCTTTCAT ATCACC TCAG CCAACTGTGG 3960 

CTCTTAATTT ATTGCATAAT GATATCCACA TCAGCCAACT GTGGCTCTTT AATTTATTGC 4020 

ATAATGATAT TCACATCCCC TCAGTTGCAG TGAATTGTGA GCAAAAGATC TTGAAAGCAA 4080 

AAAGCACTAA TTAGTTTAAA ATGTCACTTT TTTGGTTTTT ATTATACAAA AACCATGAAG 4140 

TACTTTTTTT ATTTGCTAAA TCAGATTGTT CCTTTTTAGT GACTCATGTT TATOAAGAGA 4200 

GTTGAGTTTA ACAATCCTAG CTTTTAAAAG AAACTATTTA ATGT AAAATA TTCTACATGT 4260 

CATTCAGATA TTATGTATAT CTTCTAGCCT TTATTCTGTA CTTTTAATGT ACATATTTCT 4320 

GTCTTGCGTG ATTTGTATAT TTCACTGGTT TAAAAAACAA ACATCGAAAG GCTTATTCCA 43 80 
AATGGAAGAT AGAATATAAA ATAAAACGTT ACTTGTAAAA AAAAAAAA 

Seq ID NO: 249 Protein sequence: 
Protein Accession #j NPJJ033B3 

1 11 21 31 41 SI 

I I I I I I 
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MAGSAMSSKF FLVAIAIFFS FAQWIEANS WWSLGMNNPV 
SQGQKKLCHL YQDHMQYIGB GAKTGIKECQ YQFRHRRWNC 
AFTYAVSAAG WHAMSRACR EGELSTCGCS RAARPKDLPR 
FVDARERERI HAKGSYESAR ILMNLHNNBA GRRTVYNLAD 
IiAOFRKVGDA LKEKYDSAAA MRLHSRGKLV QVNSRPNSPT 
GSLGTQGRLC NKTSEXMDGC KLMCOGRGYD QFKTVQTERC 
QFVCK 

Seq ID MO: 250 DMA sequence 
Hucleic Acid Accession ft: NM_014058 
Coding sequence: S6. .1324 



PCT/US02/12476 



i 
I 

TGACTTGGAT 
TCGGCCAGAT 
CGTCATCTTC 
GAGATATAAT 
ACTATATGCT 
TGAATCAATG 
TCAGOTTATC 
TAGATTTCAC 
TGAAAAGCTG 
AAAAATCAAC 
TAAAACTCTA 
GCCCTGGCAG 
TGCCACATGG 
GACTGCTTCC 
AATTGTCCAT 
TTCTAGCCCT 
TGAGTTTCAA 
TTACAGTCAA 
TGAACCTCAA 
AGGAAAAACA 
AGATATCTGG 
GCCTGGTGTT 
CTAAGAGAGA 
CCATTTTTAG 
AATAAACTGT 



11 
I 

GTAGACCTCG 
GTGGTGAGGG 
ATATCCCTGA 
CAAAAGAAGA 
GAGTTTGGCA 
GTGAAAAATG 
AAGTTCAGTC 
TCTACTGAGG 
CAAGATGCTG 
AAGACAGAAA 
GGTCAGAGTC 
GCTAGCCTGC 
CTTGTGAGTG 
TTTGGAGTAA 
GAAAAATACA 
GTTCCCTACA 
CCAGGTGATG 
AATCATCTTC 
GCTTACAATG 
GATGCATGCC 
TACCTTGCTG 
TATACTAGAG 
AAAGCCTCAT 
AGATACAGAA 
TTGCTTGATG 



21 
I 

ACCTTCACAG 
CTAGGAAAAG 
TTGTCCTGGC 
CCTACAATTA 
GAGAGGCTTC 
CATTTTATAA 
AACAGAAGCA 
ATCCTGAAAC 
TAGGACCCCC 
CAGACAGCTA 
TCAGGATOGT 
AGTGGGATGG 
CTGCTCACTG 
CAATAAAACC 
AACACCCATC 
CAAATGCAGT 
TGATGTTTGT 
GACAAGCACA 
ACGCCATAAC 
AGGGTGACTC 
GAATAGTGAG 
TTACGGCCTT 
GGAACAGATA 
TTGGAGAAGA 
CAAAAAAAAA 



31 
I 

GACTCTTCAT 
AGTTTGTTGG 
AGTGTGCATT 
CTATAGCACA 
TAACAATTTT 
ATCTCCATTA 
TGGAGTGTTG 
TGTAGATAAA 
TAAAGTAGAT 
TCTAAACCAT 
TGGTGGGACA 
GAGTCATCGC 
TTTTACAACA 
TTCGAAAATG 
ACATGACTAT 
ACATAGAGTT 
GACAGGATTT 
GGTGACTCTC 
TCCTAGAATG 
TGGAGGACCA 
CTGGGGAGAT 
GCGGGACTGG 
ACATTTTTTT 
CTTGCAAAAC 
A 



QMSEVYIIGA 
STVDNTSVFG 
DWLWGGCGDN 
VACKCHGVSG 
TQDLVYIDPS 
HCKFHWCCYV 



41 
I 

TGCTGGTTGG 
GAACCCTGGG 
GGACTCACTG 
TTGTCATTTA 
ACAGAAATGA 
AGGGAAGAAT 
GCTCATATGC 
ATTGTTCAAC 
CCTCACTCAG 
TGCTGCGGAA 
GAAGTAGAAG 
TGTGGAGCAA 
TATAAGAACC 
AAACGGGGTC 
GATATTTCTC 
TGTCTCCCTG 
GGAGCACTGA 
ATAGACGCTA 
TTATGTGCTG 
CTGGTTAGTT 
GAATGTGCGA 
ATTACTTCAA 
TTGTTTTTTG 
AGCTAGATTT 



QPLCSQLAGL 
RVMQXGSRET 
IDYGYRFAKE 
SCSLKTCWLQ 
PDYCVRNEST 
KCKKCTEIVD 



51 
I 

CAATGATGTA 
TTATOGGCCT 
TTCATTATGT 
CAACTGACAA 
GCCAGAGACT 
TTGTCAAGTC 
TGTTGATTTG 
TTGTTTTACA 
TTAAAATTAA 
CACGAAGAAG 
AGGGTGAATG 
CCTTAATTAA 
CTGCCAGATG 
TCCGGAGAAT 
TTGCAGAGCT 
ATGCATCCTA 
AAAATGATGG 
CAACTTGCAA 
GCTCCTTAGA 
CAGATGCTAG 
AACCCAACAA 
AAACTGGTAT 
GGTGTGGAGG 
GACTGATCTC 



Seq ID NO: 251 Protein sequence: 
Protein Accession #» NP_D54777 



1 
I 

MYRPDWRAR 
DKLYAEFGRE 
ICRFHSTEDP 
RSKTLGQSLR 
RWTASPGVTI 
SYEFQPGDVM 
LEGKTDACQG 
GI 



11 . 
I 

KRVCWEPWVI 
ASNNFTEMSQ 
ETVDKlVOliV 
IVGGTEVEEG 
KPSKMKRGliR 
FVTGFGALKN 
DSGGPLVSSD 



21 
I 

GLVIFISLIV 
RLESMVKNAF 
LHEKLQDAVG 
EWPWQASLQW 
RIIVHEKYKH 
DGYSQNHLRQ 
ARDIHYLAGI 



31 
I 

LAVCIGXiTVK 
YKSPLREEFV 
PPKVDPHSVK 
DGSHRCGATL 
PSHDYDISLA 
AQVTLIDATT 
VSWGDECAKP 



41 

I 

YVRYNQKKTY 
KSQVIKFSQQ 
IKKINKTETD 
XNATWLVSAA 
BLSSPVPYTN 
CNEPQAYNDA 
NKPGVYTRVT 



51 
I 

NYYSTLSFTT 
KHGVLAHMLL 
SYLNHCCGTR 
HCFTTYKNPA 
AVHRVCLPDA 
ITPRMLCAGS 
ALRDWITSKT 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession #: NM_003504. 

Coding sequence: 71-1771 



GGCACGAGGC 
CGCCGTGGCT 
GAGGGTCCTT 
. GGCCTTGTTC 
ACTTGAAACT 
TGGAGCTAAT 
GTGTGACACC 
ACTCATTAAA 
AGAGGAGGAT 
CACACGGTTA 
GGAGGCCCGG 
GTCAGCCATG 
GTGGTGGGCC 
ATACGTGACT 
GGATGAGGAG 
CCTGGTGCTC 
AGCCAGGTTC 
CATGGGTCTT 
GGAGAATTTG 
CGTGCAGACT 
CTTTGCCACC 
CATCCAGGCT 
ACTCGCCAAG 
CCTCGTCATC 
CATGCTGTTC 
TGTGTGTTCG 



11 
I 

CTCGTGCCGC 
ATGTTCGTGT 
CTCTTCGTGG 
CAGTGTGACC 
GCATTTCTTG 
GTAGACCTAT 
CATAGGCCAG 
CAAGATGATG 
GAAGAGCATT 
GAAGAGGAGA 
AGAAGAGACA 
GTGATGTTTG 
ATCGTTGGAC 
GATGTTGGTG 
AACACACTCT 
TACCAGCACT 
AAGCTGTGGT 
CCCCTGAAGC 
CGGGAAATGA 
TTCAGCATTC 
ATGTCTTTGA 
CTGGACAGCC 
AAGCAGCTGC 
TCCCAGGGGC 
TCTAGGCCGG 
ACAAAGAACC 



21 

I 

CGGGCTCTTG 
CCGATTTCCG 
CCTCGGAOGT 
ACGTGCAATA 
AGCATAAAGA 
TGGATATTCT 
TCAATGTCGT 
ACCTTGAAGT 
CAGGAAATGA 
TAGTGGAGCA 
TCCTCTTTGA 
AGCTGGCTTG 
TAACAGACCA 
TCCTGCAGCG 
CCGTGGACTG 
GGTCCCTCCA 
CTGTGCATGG 
AGGTGAAGCA 
TTGAAGAGTC 
ATTTTGGGTT 
TGGAGAGCCC 
TCTCCAGGAG 
GAGCCACCCA 
CTTTCCTGTA 
CATCCCTAAG 
GGCGCTGCAA 



31 
I 

GTACCTCAGC 
CAAAGAGTTC 
GGATGCTCTG 
TAOGCTGGTT 
ACAGTTTCAT 
TCAACCTGAT 
CAATGTATAC 
TCCCGCCTAT 
CAGTGATGGG 
AACCATGCGG 
CTACGAGCAG 
GATGCTGTCC 
GTGGGTGCAA 
CCAOGTTTCC 
CACAOGGATC 
TGACAGCCTG 
ACAGAAGOGG 
GAAGTTCCAG 
TGCAAATAAA 
CAAGCACAAG 
OGAGAAGGAT 
TAACCTGGAC 
GCAGACCATT 
CTGCTCTCTC 
CCTGCTCAGC 
ACTGCTGCCC 



41 

I 

GCGAGCGCCA 
TACGAGGTGG 
TGTGCGTGCA 
CCAGTTTCTG 
TATTTTATTC 
GAAGACACTA 
AACGATACCC 
GAAGACATCT 
TCAGAGCCTT 
AGGAGGCAGC 
TATGAATATC 
AAGGACCTGA 
GACAAGATCA 
CGCCACAACC 
TCCTTTGAGT 
TGCAACACCA 
CTCCAGGAGT 
GCCATGGACA 
TTTGGGATGA 
TTTCTGGCCA 
GGCTCAGGGA 
AAGCTGTACC 
GCCAGCTGCC 
ATGGAGGGCA 
AAACACCTGC 
CTGGTGATGG 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 • 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 



60 
120 
100 
240 
300 
360 
420 



51 

! 

GGCGTCCGGC 
TCCAGAGCCA 
AGATCCTTCA 
GGTGGCAAGA 
TCATAAACTG 
TATTCTTTGT 
AGATCAAATT 
TCAGGGATGA 
CTGAGAAGCG 
GGCGAGAGTG 
ATGGGACATC 
ATGACATGCT 
CTCAAATGAA 
ACCGGAAOGA 
ATGACCTCCG 
GCTATACCGC 
TCCTTGCAGA 
TCTCCTTGAA 
AGGACATGCG 
GCGACGTGGT 
CAGATCACTT 
ATGGCCTGGA 
TTTGCACCAA 
CTCCAGATGT 
TCAAGTCCTT 
CTGCCCCCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



284 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GAGCATGGAG CATGGCACAG TGACCGTGGT OGGCATCCCC CCAGAGACOG ACAGCTCGGA 
CAGGAAGAAC TTTTTTGGGA GOOCQTTTGA 6AAG6CA60G GAAAGCACCA GCTCCCGGAT 
GCTGCACAAC CATTTTGACC TCTCAGTAAT TGAGCTGAAA GCTGAGGATC GGAGCAAGTT 
TCTGGACGCA CTTATTTCCC TCCTGTCCTA GGAATTTGAT TCTTCCAOAA TGACCTTCTT 
ATTTATGTAA CTGGCTTTCA TTTAGATTGT AAGTTATGQA CATOATTTGA GATGTAGAAQ 
CCATTTTTTA TTAAATAAAA TGCTTATTTT AGGCTCCGTC CCCAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AA 



PCT/US02/12476 



Seq ID NO: 253 Protein sequence: 
Protein Accession It: NP 00349S.1 



I 

MFVSDFRKEF 
AFLEHKSQFH 
QDDDLEVFAY 
RRDILFDYEQ 
DVGVLQRHVS 
KLWSVHGQKR 
PSIHPGPKHK 
KQLRATQQTI 
TKNRRCKLLP 
HFDIiSVISLK 



11 
I 

YEWQSQRVL 
YFILINCGAN 
EDIFBDEEBD 
YEYHGTSSAH 
RHNHRNEDEE 
LQEFLADMGL 
FLASDWFAT 
ASCLCTNLVI 
LVMAAPLSME 
AEDRSKFLDA 



LFVASDVDAL 
VDLLDILQPD 



VMFEIAWMLS 
NTLSVDCTRI 
PLKQVKQKFQ 
MSLMESPEKD 
SQGPFLYCSL 
HGTVTWQIP 
LISLLS 



31 
I 

CACKILQALF 
EDTIFFVCDT 
SEPSEKRTRL 
KDLNDMLWWA 
SFEYDLRLVL 
AMDISLKENL 
GSGTDHFIQA 
MEGTPDVMLF 
PETDSSDRKN 



41 
I 

QCDHVQYTLV 
HRPVNWNVY 
EEEIVEQTKR 
IVGLTDQWVQ 
YQHWSLKDSL 
REMIEESANK 
LDSLSRSNLD 
SRPASLSLLS 
FFGRAFEKAA 



51 
I 

PVSGWQELET 
NDTQIKLLIK 
RRQRREWEAR 
DKITQMKYVT 
CNTSYTAARF 
FGMKDMRVQT 
KLYHGLELAK 
KHLLKSPVCS 
ESTSSRMLHN 



Seg ID NO: 254 DNA sequence 
Nucleic Acid Accession #: NM_022337 
Coding sequence : 48 . . 683 



GGCTGCGCTT 
ACAAGGAGCA 
TCATCAAGCG 
ACTTCGCGCT 
ATATCGCAGG 
GTGCATTTAT 
AAAATGATTT 
TGGCCAACAA 
AGTTCTGCAA 
ACATTGATGA 
TGGAGTCTAT 
GCTCTGGCTG 
TTGTTCCACA 
CACATGTGGC 
GTTCTTTCTA 
TCTGTTACAA 
TTATTTGCTT 
AACTAGCTGT 
AATATATTCT 
GACCTCCATT 
ACAGGTGTGC 
AACTGAATAT 
CTCAAGCTGT 
GCAAGTGAAC 



11 

I 

CCCTGGTCAG 
CCTGTACAAG 
CTACGTGCAC 
CAAGGTGCTC 
TCAAGAAAGA 
TGTCTTCGAT 
GGACTCCAAG 
ATGTGACCAG 
GGAGCACGGT 
AGCCTCCAGA 
TGAGCCGGAC 
TGCCAAATCC 
AATTGTGCCT 
AAGCCAAAGA 
TGCTTTCCTC 
ACTTCTGTCA 
CTTTTAATCA 
CAAGTCAAGG 
CTGATGGCCT 
CTCGGCAGAC 
TATATTGTCC 
TGTATGAAAA 
GGGGCTCCTC 
AATAAAACAT 



21 
I 

GCACGGCACG 
TTGCTGGTGA 
CAGAACTTCT 
CACTGGGACC 
TTTGGAAACA 
GTCACCAGGC 
TTAAGTCTCC 
GGGAAGGATG 
TTCGTAGGAT 
TGCCTGGTGA 
GTGGTGAAGC 
TAGTAGGCAC 
CTATTTTTAC 
TCTATGCCTC 
ACCATCATCA 
TGTAGCTGAC 
GCAAAGGCCT 
ACTGGCTTTC 
GACAGGCCTA 
CTAAGAGTTG 
TTGTCCTAAC 
GACATGCCTC 
TATACATGCT 
TAAAAGATAA 



31 

I 

TCTGGCCGGC 
TTGGCGACCT 
CCTCGCACTA 
CGGAGACTGT 
TGACGAGGGT 
CAGCCACATT 
CTAATGGCAA 
TGCTCATGAA 
GGTTTGAAAC 
AACACATACT 
CCCATCTCAC 
CTTTGCTGGT 
CATTTTGGGT 
TGTTTTTTCA 
CAGTGTTTAC 
CAAAATCCTG 
CAAGTCTTAA 
ACCTTGCCCT 
TTAAGTAGAT 
CCTCTGAGTT 
TGTCACTTGC 
CATATGTGCC 
ATACATGTAA 
AA 



41 
I 

CGCCAGGATG 
GGGCGTGGGG 
CCGGGCCACA 
GGTGCGCCTG 
CTATTACCGA 
TGAAGCAGTG 
ACCGGTTTCA 
CAATGGCCTC 
ATCAGCAAAG 
TGCAAATGAG 
ATCAACCAAG 
GTCTGGTAGG 
AAACGTCAGG 
ATGAGAGAGA 
AAACTTTTGA 
CAGGGCCACA 
AATAAAAGGG 
GGTGTCTTTT 
GTGATATTTT 
AGCTCTTTGG 
CATGGCCTGA 
TTTCTGTTAG 
TATATATTAT 



Seq ID NO: 255 Protein sequence: 
Protein Accession NP_071732 



11 



GGAACCGCCC 

GGTGTTCGTG 
GAACTTGAGC 
CGTGATGGTC 
CGCCGTGGGC 
GGGCTGGTTC 
CGCTGACTTC 
CCCCATTCAG 
CATTCTCCTT 
TGGCGCCTAC 
CAAGGAGAGA 
CCTGTGQATG 
CCGAGCCGCC 



11 

I 

GCTGCCAGCC 
CTGCCGCTCA 
CGCTACGACT 
GACATGGAGA 
TTCGTGGGCT 
TTCAACTTCC 
CACTTCTTAC 
TGCGTGGCCT 
CTGCTCATCA 
AACCTGCTAA 
TTTGGGCTCA 
CAGAATTCTG 
TACTGGCCCA 
ATCAACACCT 



21 
I 

CGGCCAGGCA 
CCTGCCTGCT 
TCGAGGCCGA 
ACGAATTCTA 
TCGGCTTCCT 
TGTTGGCAGC 
AAGACCGCTA 
CTGTCTGCGT 
TGACTTTCTT 
AGGTGAAGGA 
CAGTGACCCG 
TGTACCAGTC 
GCTTCAACTC 
ACTGCTCCTT 



31 
I 

CCCCTGCAGC 
CCTGCAGGTG 
CGCCCACTGG 
CTATCGCTAC 
CATGACTTTC 
CTTCGGCATC 
CATCGTCGTG 
GGCCTTTGGG 
CCAAGTGACC 
TGCAGGAGGC 
GATCCTCTAC 
GGACCTCTTT 
AGCCATATCC 
GGCAGCCTGC 



41 

I 

ATGGCCTGGA 
ATTATGGTGA 
TGGTCAGAGA 
CCAAGCTTCC 
CTGCAGCGCT 
CAGTGGGCGC 
GGCGTGGAGA 
GCAGTTCTGG 
CTCTTCGCTG 
TCCATGACCA 
CGACGCAACC 
GCCATGATTG 
TACCATGGGG 
GTQCTTACCT 



1620 
1680 
1740 
1800 
I860 
1920 



51 
I 

CAGGCCCCGC 
AAGACCAGTA 
ATCGGGGTGG 
CAGCTCTGGG 
GAAGCTATGG 
GCAAAGTGGA 
GTGGTTTTGT 
AAGATGGACC 
GAAAATATAA 
TGTGACCTAA 
GTTGCCAGCT 
AATGACCTCA 
ATAGATATAC 
AATAGCAAAT 
AAATATTTAG 
GTCGGCACTG 
GAGAAGAACA 
TCCAGATTTC 
CTTCCAAGAT 
AATCGTGAAC 
ATGTTGGCTT 
CTCTCTTTGA 
ATATATTTTT 



21 31 41 51 

I 111 I I 

MQAPHKEHLY KLLVIGDLGV GKTSIIKRYV HQNFSSHYRA TIGVDFALKV LHWDPETWR 
LQLWDIAGQE RFGNMTRVYY REAMGAFIVF DVTRPATFEA VAKWKNDLDS KLSLPNGKPV 
SWLLANKCD QGKDVLMNNG LKMDQFCKEH GFVGWFETSA KENINIDEAS RCLVKHILAN 
ECDLMBSIEP DWKPHLTST KVASCSGCAK S 

Seq ID NO: 256 DNA sequence 
Nucleic Acid Accession #: NM_01632l 
Coding sequence i 25 . . 1464 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 



51 

1 

ACACCAACCT 
TTCTCTTCGG 
GGACGCACAA 
AGGACGTGCA 
ACGGCTTCAG 
TGCTCATGCA 
ACCTCATCAA 
GTAAAGTCAG 
TGAATGAGTT 
TCCACACATT 
TAGAGCAGAG 
GCACCCTCTT 
ACAGCCAGCA 
CGGTGGCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



285 



5 

10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ATCCAGTGCC CTGCACAAGA AGGGCAAGCT GGACATGGTG CACATCCAGA ATGCCACGCT 
CGCAGGAGGG GTGGCCGTGG GTACCGCTGC TGAGATGATG CTCATGCCTT ACGGTGCCCT 
CATCATCOQC TTCGTCTGOG GCATCATCTC CACCCTGGGT TTTGTATACC TGACCCCATT 
CCTGGAGTCC CGGCTGCACA TCCAGGACAC ATGTGGCATT AACAATCTGC ATGGCATTCC 
TGGCATCATA GGCGGCATCG TGGGTGCTGT GACAGCGGCC TCOGCCAGCC TTGAAGTCTA 
TGGAAAAGAA GGGCTTGTCC ATTCCTTTGA CTTTCAAGGT TTCAACGGGG ACTGGACCGC 
AAGAACACAG GGAAAGTTCC AGATTTATGG TCTCTTGGTG ACCCTGGCCA TGGCCCTGAT 
• GGGTGGCATC ATTGTGGGGC TCATTTTGAG ATTACCATTC TGGGGACAAC CTTCAGATGA 
GAACTGCTTT GAGGATGCGG TCTACTGGGA GATGCCTGAA GGGAACAGCA CTGTCTACAT 
CCCTGAGGAC CCCACCTTCA AGCCCTCAGG ACCCTCAGTA , CCCTCAGTAC CCATGGTGTC 
CCCACTACCC ATGGCTTCCT CGGTACCCTT GGTACCCTAG GCTCCCAGGG CAGGTGAGGA 
GCAGGCTCCA CAGACTSTCC TGGGGCCCAG AGGAGCTGGT GCTGACCTAG CTAGGGATGC 
AAGAGTGAGC AAGCAGCACC CCCACCTGCT GGCTTGGCCT CAAGGTGCCT CCACCCCTGC 
CCTCCCCTTC ATCCCAGGGG GTCTGMCTGA GAATGGAGAA GGAGAAGCTA CAAAGTGGGC 
ATCCAAGCCG GGTTCTGGCT GCAGAAGTTC TGCCTCTGCC TGGGGTCTTG GCCACATTGG 
AGAAAAACAG GCTCAAAGTG GGGCTGGGAC CTGGTGGGTG AACCTGAGCT CTCCCAGGAG 
ACAACTTAGC TGCCAGTCAC CACCTATGAG GCTCTTCTAC CCCGTGCCTG CACCTCGGCC 
AGCATCTCCT ATGCTCCCTG GGTCCCCCAG ACCTCTCTGT GTTGTGTGCG TGGCAGCCTC 
CAGGAATAAA CATTCTTGTT GTCCTTTGTA AAAAAAAAAA AAAAAAAA 

Seg ID NO i 257 Protein sequence: 
Protein Accession #: NP_0S7405 



MAWNTNLRWR 
PSFQDVHVMV 
GVENLINADF 
SMTIHTFGAY 
YHGDSQHRAA 
LHPYGALIIG 
SASLEVYGKE 
WGQPSDENCF 



11 
I 

LPLTCLLLQV 
FVGFGFLMTP 
CVASVCVAPG 
PGLTVTRILY 
INTYCSLAAC 
FVCGIISTLG 
GLVHSFDFQG 
EDAVYWEMPE 



21 
I 

IMVILFGVFV 
LQRYGFSAVG 
AVIX3KVSPIQ 
RRNLEQSKER 
VLTSVAISSA 
PVYLTPFLES 
FKGDWTARTQ 
GNSTVYIPED 



31 
I 

RYDFEADAHW 
FNFLLAAFGI 
LLiIMTFFQVT 
QNSVYQSDLP 
LHKKGKLDMV 
RLHIQDTCGI 
GKFQIYGLLV 
PTFKPSGPSV 



41 

i 

WSERTHKNLS 
QWALLMOGWF 
LFAVNEFILL 
AMIGTLFLWM 
HIQNATLAGG 
NNLHGXPGII 
TLAMALMGG1 
PSVPMVSPLP 



51 
I 

DMENBFYYRY 
HFLQDRYIW 
NLIiKVKDAGG 
YWPSFNSAIS 
VAVGTAAEMM 
GGIVGAVTAA 
IVGLIIiRLPP 
MASSVPLVP 



Seq ID NO: 25 9 DNA sequence 

Nucleic Acid Accession #: NM_002358.2 

Coding sequence i 75.. 692 



GGGAAGTGCT 
TTGTGTCCCT 
GOGCCGAAAT 
GCATATATCC 
CTACTGATCT 
TATACAAGTG 
TCCTGGAAAG 
CCAGAGAAAA 
CAGCTACGGT 
ATACAGACAA 
CCAATTCTGA 
TGGTGGCCTA 
TAATTTTGAA 
TTGGTTAATT 
CCTTTATTTT 
CATTGTTCAA 
GATAGTAACT 
GTTTTGGTCA 
AAAGGAAGTC 
AACAATGAAA 
TTGAATCAGT 
ATATTTGTAC 
TTATAAAATC 
AAAAAAAAAA 



11 
I 

GTTGGAGCCG 
GGCCATGGCG 
CGTGGCCGAG 
ATCTGAAACC 
TGAGCTCATA 
TTCAGTTCAG 
ATGGCAGTTT 
GTCTCAGAAA 
GACATTTCTG 
AGATTTGGTT 
GGAAGTCCGC 
CAAAATTCCT 
ATGTGGTTTT 
TTTACATGGA 
TTTGGTACCT 
AAGGAACCAG 
GTAGATGGAA 
AGTAGTTTGA 
TAAATATTCA 
TATTGCTGTA 
TTCCAATTAT 
TGTTTAATGT 
AAGTTTTAAG 



21 
I 

CTGTGGTTGC 
CTGCAGCTCT 
TTCTTCTCAT 
TTTACTCGAG 
AAATACCTAA 
AAACTGGTTG 
GATATTGAGT 
GCTATCCAGG 
CCACTGTTGG 
GTACCTGAAA 
CTTCGTTCAT 
GTCAATGACT 
CCTGAAATCA 
GAAAACCAAA 
ATTTGACTTA 
GAGGTTTTTT 
AAACTTGTGC 
CTCAGTATAG 
GAATCTTTGT 
TAGCTCCTTT 
TTGACTTTAA 
TCTGTGATAC 
TGAAAGTGAG 



31 

I 

TGTCCGCGGA 
CCCGGGAGCA 
TCGGCATCAA 
TGCAGAAATA 
ATAATGTGGT 
TAGTTATCTC 
GTGACAAGAC 
ATGAAATCOG 
AAGTTTCTTG 
AATGGGAAGA 
TTACTACTAC 
GAGGATGACA 
GGTCATCTAT 
ATGATACTTA 
CCATGGAGTT 
TGTCAACATT 
TATAAAGCTA 
GTAGGGAGAT 
TAAGGTCCTG 
TGACCTTCAT 
TTTATGTAAC 
AGAACTCTTA 
GAAATAAAGT 



41 

I 

GTGGAAGCGC 
GGGAATCACC 
CAGCATTTTA 
CGGACTCACC 
GGAACAACTG 
AAATATTGAA 
TGCAAAAGAT 
TTCAGTGATC 
TTCATTTGAT 
GTCGGGACCA 
AATCCACAAA 
TGAGGAAAAT 
AGTTGATATG 
CTGAACTGTG 
AACATCATGA 
GTGATGTATA 
GATGCTTTCC 
ATTTAAGTAT 
AAAGTAACTC 
TTCATGTATA 
TTGAACCTAT 
AAAATGTTTT 
TAAGTTTGTT 



Seq ID NO: 259 Protein sequence: 
Protein Accession #t NP_002349.1 



ll 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1600 
1740 
1800 
1860 
1920 



PCT7US02/12476 



51 

I. 

GTGCTTTTGT 

CTGCGCGGGA 

TATCAGCGTG 

TTGCTTGTAA 

AAAGATTGGT 

AGTGGTGAGG 

GACAGTGCAC 

AGACAGATCA 

CTGCTGATTT 

CAGTTTATTA 

GTAAATAGCA 

AATGTAATTG 

TTTTATTTCA 

TGTAATTGTT ' 

ATTTATTGCA 

TTCCTTTGAA 

TAAATCAGAT 

AAAATACAAC 

ATAATCTATA 

GTTTTCCCTA 

GAAGCAATGG 

TTCATGTGTT 

TTAAAAAAAA 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



21 31 41 51 

I I I ! I 

MALQLSREQG ITLRGSAEIV ABFFSFGINS ILYQRGIYPS ETFTRVQKYG LTLLVTTDLE 
LIKYLNNWE QLKDWLYKCS VQKLWVISN IESGEVLERW QFDIECDKTA KDDSAPREKS 
QKAIQDEIR8 VIRQITATVT FLPLLEVSCS PDLLIYTDKD LWPEKWEES GPQFITNSEE 
VRLRSFTTTI HKVNSMVAYK IPVND 

Seq ID NO: 260 DNA sequence 
Nucleic Acid Accession fh NM_001211 
Coding sequence: 43.. 3195 

I 11 21 31 41 51 

II II I 1 
AAAGGCCTGC AGCAGGACGA GGACCTGAGC CAGGAATGCA GGATGGCGGC GGTGAAGAAG 
GAAGGGGGTG CTCTGAGTGA AGCCATGTCC CTGGAGGGAG ATGAATGGGA ACTGAGTAAA 
GAAAATGTAC AACCTTTAAG GCAAGGGCGG ATCATGTCCA CGCTTCAGGG AGCACTGGCA 
CAAGAATCTG CCTGTAACAA TACTCTTCAG CAGCAGAAAC GGGCATTTGA ATATGAAATT 



60 
120 
160 



60 
120 
180 
240 



286 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CGATTTTACA CTGGAAATGA CCCTCTGGAT 
CAGAACTATC CTCAAGGTGG GAAAGAGAGT 
GAAGCACTAC AAGGAGAAAA ACGATATTAT 
AAATTAGGGC GTTTATGCAA TGAGCCTTTG 
ATTGGTGTTT CACTTGCTCA GTTCTATATC 
AACTTTAGGA AAGCAGATGC GATATTTCAG 
GAAAGACTAC AGTCCCAGCA CCGACAATTC 
GCACTTGAGA AAGAAGAAGA GGAGGAAGTT 
CTAGCTGAAC TAAAGAGCAA AGGGAAAAAG 
GGTGCTCTCA AGGCTCCAAG CCAGAACAGA 
CAAAATAATA GTAGAATTAC TGTTTTTGAT 
TTGTCTAAGC CTACAGTCCA GCCATGGATA 
GAGCTGCAAG CAGGCCCTTG GAACACAGGC 
ACAGCTTCAC TGATAGCTGT ACCCGCTGTG 
ACTGCACAAC AGCCAGTTAT GACACCATGT 
AGCACCAGAA AGCCTGGAAA GGAAGAAGGA 
CAAGCGTCTG AGGAGAAGAA AGAGAAGATG 
GTAGGGGAAT TCTCCTTTGA AGAAATTCGG 
CAAAGGGAAG CCGAGCTATT GACCAGTGCA 
GAAGAGATGG AGAAGAAGCT AAAAGAAATC 
CAGCAAGAAG AGACGATGCC TACAAAGGAG 
CAGAAAATAC CAGGAATGAC TCTATCCAGT 
GAAACTTCAC TTGCGGAGAA CATTTGGCAG 
CCTTTCTCCA TTTTTGATGA GTTTCTTCTT 
GATCCCCCAC GAGTTTTAGC TCAACGAAGA 
ATCACCTCAA ATGAAGATGT GTCTCCAGAT 
TTGAGCGAGG ATGCCATTAT CACAGGCTTC 
GACACTTGTG ACTTTGCCAG AGCAGCTCGT 
TCCTTGAAGG ATCTCCCTTC TGATCCTGAG 
AAGACCTCTG AGGACCAGCA GACAGCTTGT 
AAGAAGCTGA GCCCAATTAT TGAAGACAGT 
GGTTCTTCTG CCTCGGTTGC AAGCACCTCC 
CTAGAACTTA CTAATGAGAC TTCAGAAAAC 
CGCAGACAGC TACTGAAGTC CCTACCAGAG 
GACAGACCAA TGCCTAAGTT GGAAATTGAG 
TGCATTAAAC GAGAATACCT AATATGTGAA 
AACTCTGCAG AATTAACAGT AATAAAGGTA 
ATCAACCTCA AGTTAAAGGA ACGTTTAAAT 
CAATATCAAG ATGGCTGTAT TGTTTGGCAC 
CTTCTCCAAC ACAGTGAATA TATTACCCAT 
TTGACAATAG TGGAGATGCT ACACAAAGCA 
TGTCTGATTC TCAGAAACAG AATCCACGAT 
TTGAAGATAG TGGACTTTTC CTACAGTGTT 
CTCAGCGGCT TTCGGACTGT ACAGATCCTG 
TCTCCCTACC AGGTAGACCT GTTTGGTATA 
GAACACCTAC AGGTCTTCTG GGATGGGTCC 
CTAAAAGATG GTGAATTGTG GAATAAATTC 
GCCACAGTGT CTGTTCTTGG GGAGCTTGCA 
TTCCAAAGTC ACCTGAACAA AGCCTTATGG 
TTGCTCTTTC AGTGAGCTAG GCAATCAAGT 
GTATTGTGGA ACACTGAAAC TGTATGTGCT 
CTACCATTGC TGTTCTACTT TTTGGTACAG 
ACAGTGATAT ACTTACTCAT GGCCTTGTCT 
CAGACTCATT ACAAATGGTT ACCTTGTTAT 
CTTTTCCCAT TTGTAATTTG TAAAATGTTC 
TAAAATAGTA TCTGTTAAAA AAAAAAAAAA 

Seq ID NO: 261 Protein sequence: 
Protein Accession #: NP 001202 



PCT/US02/12476 



MAAVKKEGGA 
AFEVEIRFYT 
FLNLWLKLGR 
QKAEPLERLQ 
PIIRVGGALK 
PRAKENELQA 
SINHILSTRK 
RKKLKEQREA 
QIASBSQKIP 
NKSPPADPPR 
ICPNPEDTCD 
SQTLSIKKLS 
PWCSQYRRQL 
PWVAPRNSAE 
CPTLQDLLQH 
NKNNQALKIV 
HLLLPKEHLQ 
GVPDTTPQSH 



11 
I 



GNDPLDVWDR 
LCNEPLDMYS 
SQHRQFQARV 
APSQNRGLQN 
GPWNTGRSLE 
PGKEEGDPLQ 
ELLTSAEKRA 
GMTLSSSVCQ 
VLAQRRPLAV 
FARAARFVST 
PIIEDSREAT 
LKSLPELSAS 
LTVIKVSSQP 
SEYITHEITV 
DFSYSVDfcRV 
VFWDGSFWKL 
LNKALKKVGK 



21 

I 

EWELSKENVQ 
YISWTEQNYP 
YLHNQGIGVS 
SRQTLLALEK 
PPPQQMQNNS 
HRPRGNTASL 



GTTTGGGATA 
AATATGTCAA 
AGTGATCCTC 
GATATGTACA 
TCATGGGCAG 
GAAGGGATTC 
CAAGCTCGAG 
TTTGAGTCTT 
ACAGCAAGAG 
GGACTCCAAA 
GAAAATGCTG 
GCACCCCCCA 
AGGTCCTTGG 
CTTCCCAGTT 
AAAATTGAAC 
GATCCTCTAC 
ATGTATTGTA 
GCTGAAGTTT 
GAGAAGAGAG 
CAAACTACTC 
ACAACTAAAC 
TCTGTTTGTC 
GAACAACCTC 
TCAGAAAAGA 
CCCCTTGCAG 
GTTTGTGATG 
AGAAATGTAA 
TTTGTATCCA 
AGACTGTTAC 
GGCACTATCT 
CGTGAAGCCA 
TCCATCAAAT 
CCTACTCAGT 
TTAAGTGCCT 
AAGGAAATTG 
GATTACAAGT 
TCTTCTCAAC 
GAAGATTTTG 
CAATATATAA 
GAAATAACAG 
GAAATAGTCC 
CCCTATGATT 
GACCTTAGGG 
GAAGGACAAA 
GCAGATTTAG 
TTCTGGAAAC 
TTTGTGCGGA 
GCAGAAATGA 
AAGGTAGGGA 
CTCACAGATT 
GTAATTTAAT 
GTATATTTTG 
AACTTTTOTG 
TTAACCCATT 
TCTTATGATC 
AAAAAAAAAA 



PLRQGRIMST 



EMQKQIEEME 
VNCCARETSL 
LKTSESITSN 
PFHEIMSLKD 



AELCIEDRPM 
VPWDFYINLX 
LIIYNIiTIV 
QLDVFTLSGF 
SQMISELKDG 
LTSPGALLFQ 



LAQFYISWAE 
EEEEEVFESS 
RITVFDENAD 
IAVPAVLPSF 
EKKEKMMYCK 
KKLKEIQTTQ 
AENIWQEQPH 
EDVSPDVCDE 
LP SD PERLLP 
SVASTSSIKC 
PKLEIEKEIE 
LKERLNEDFD 
EMLHKAEIVH 
RTVQILEGQK 
ELWNKFFVRI 



GGTATATCAG 
CGTTATTAGA 
GATTTCTCAA 
GTTACTTGCA 
AAGAATATGA 
AACAGAAGGC 
TGTCTCGGCA 
CTGTACCACA 
CTCCAATCAT 
ATCCATTTCC 
ATGAGGCTTC 
TGCCCAGGGC 
AACACAGGCC 
TCACTCCATA 
CTAGTATAAA 
AAAGGGTTCA 
AGGAGAAGAT 
TCCGGAAGAA 
CAGAAATGCA 
AGCAAGAAAG 
TGCAAATTGC 
AAGTAAACTG 
ATTCTAAAGG 
AGAATAAAAG 
TTCTCAAAAC 
AATTTACAGG 
CAATTTGTCC 
CTCCTTTTCA 
CGGAAGAAGA 
ACAGTCAGAC 
CACACTCCTC 
GTCTTCAAAT 
CACCATGGTG 
CTGCAGAGTT 
AATTAGGTAA 
TATTCTGGGT 
CTGTCCCATG 
ATCATTTTTG 
ACTGCTTCAC 
TGTTGATTAT 
ATGGTGACTT 
GTAACAAGAA 
TGCAGCTGGA 
AGATCCTGGC 
CACATTTACT 
TTAGCCAAAA 
TTCTGAATGC 
ATGGGGTTTT 
AGTTAACTAG 
GCTGCCTCAG 
TTAGGACACA 
ACGTCACTGA 
AAGAACTATT 
TGTCTCTACT 
ACCATGTATT 



41 

I 

LQGALAQESA 
LLERAVEALQ 
EYEARENFRK 
VPQRSTLAEL 
EASTAELSKP 
TPYVEETAQQ 
EKIYAGVGEF 
QERTGDQQEE 
SKGPSVPFSI 
FTGIBPLSED 
EEDLDVKTSE 
LQIPEKLELT 
LGNEDYCIKR 
HFCSCYQYQD 
GDLSPRCLIL 
ILANCSSPYQ 
LNANDEATVS 



CTGGACAGAG 
AAGAGCTGTA 
TCTCTGGCTT 
CAACCAAGGG 
AGCTAGAGAA 
TGAACCACTA 
AACTCTGTTG 
ACGAAGCACA 
CCGTGTAGGA 
TCAACAGATG 
TACAGCAGAG 
CAAAGAGAAT 
TCGTGGCAAT 
TGTGGAAGAG 
CCACATCCTA 
GAGCCATCAG 
TTATGCAGGA 
ATTAAAAGAG 
GAAACAGATT 
AACAGGTGAT 
TTCCGAGTCT 
TTGTGCCAGA 
TCCCAGTGTA 
TCCTCCTGCA 
CTCAGAAAGC 
AATTGAACCC 
TAACCCAGAA 
TGAGATAATG 
TCTAGATGTA 
TCTCAGCATC 
TGGCTTCTCT 
TCCTGAGAAA 
TTCACAGTAT 
GTGTATAGAA 
TGAGGATTAC 
GGCGCCAAGA 
GGACTTTTAT 
CAGCTGTTAT 
CCTTCAGGAT 
TTATAACCTT 
GAGTCCAAGG 
CAATCAAGCT 
TGTTTTTACC 
TAACTGTTCT 
ATTGTTCAAG 
TATTTCTGAG 
CAATGATGAG 
TGACACTACA 
TCCTGGGGCT 
AGCAATGGTT 
TTTAGATGCA 
TATTTTTTAT 
TTATTCTAAA 
TTTCCCTGTA 
TTGTAAATAA 



51 

1 

CNNTLQOQKR 
GEKRYYSDPR 
ADAIFQEGIQ 
KSKGKKTARA 
TVQPWIAPPM 
PVMTPCKIEP 
SFEEIRAEVF 
TMPTKETTKL 
FDEFLLSEKK 
AIITGFRNVT 
DQQTACGTIY 
NETSENPTQS 
EYLICEDYKL 
GCXVWHQYIN 
RNRIHDPYDC 
VDLFGIADLA 
VLGEIAAEMN 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 262 DNA sequence 
Nucleic Acid Accession #: NM_003784 
Coding sequence : 3 6 5 . . 1 5 0 7 



21 



31 



41 



287 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 
I I I I I I 

GTCTACTTAT CAATAAGCAG CTGCCTGTGC AGAGTGCAGG CTGCACCTTT GGACAGCCTT 
TAAAACTGAA TTCTCAGAAT TTTAGAACAA ATTTTTGTCT AGAAATGCTG ACTTTGGTTC 
ATTAGGTAGT GGTAAAACAG GCTCCCTTCG AAGCTCTCCT TCATCACCTT CCTAAGTGCA 
TGTACAGGGA AGCTCTCCTT CATCACCTTC CTAAGTGCAT GGGGGAAAAT ACCTAGGGCT 
CAACAGTCTT GAGAAGTGTG GAAACATTTT CTTTGTGAGT GAGAACAGAT CACCTAGAGA 
AAGGAAACCA GATTCCCATC ACTGCTTCTG GGTATCAGAT GCTAGCGCTG CACTCCATTT 
TGCAATGGCC TCCCTTGCTG CAGCAAATGC AGAGTTTTGC TTCAACCTGT TCAGAGAGAT 
GGATGACAAT CAAGGAAATG GAAATGTGTT CTTTTCCTCT CTGAGCCTCT TCGCTGCCCT 
GGCCCTGGTC CGCTTGGGCG CTCAAGATGA CTCCCTCTCT CAGATTGATA AGTTGCTTCA 
TGTTAACACT GCCTCAGGAT ATGGAAACTC TTCTAATAGT CAGTCAGGGC TCCAGTCTCA 
ACTGAAAAGA GTTTTTTCTG ATATAAATGC ATCCCACAAG GATTATGATC TCAGCATTGT 
GAATGGGCTT TTTGCTGAAA AAGTGTATGG CTTTCATAAG GACTACATTG AGTGTGCCGA 
AAAATTATAC GATGCCAAAG TGGAGCGAGT TGACTTTACG AATCATTTAG AAGACACTAG 
ACGTAATATT AATAAGTGGG TTGAAAATGA AACACATGGC AAAATCAAGA ACGTGATTGG 
TGAAGGTGGC ATAAGCTCAT CTGCTGTAAT GGTGCTGGTG AATGCTGTGT ACTTCAAAGG 
CAAGTGGCAA TCAGCCTTCA CCAAGAGCGA AACCATAAAT TGCCATTTCA AATCTCCCAA 
GTGCTCTGGG AAGGCAGTCG CCATGATGCA TCAGGAACGG AAGTTCAATT TGTCTGTTAT 
TGAGGACCCA TCAATGAAGA TTCTTGAGCT CAGATACAAT GGTGGCATAA ACATGTACGT 
TCTGCTGCCT GAGAATGACC TCTCTGAAAT TGAAAACAAA CTGACCTTTC AGAATCTAAT 
GGAATGGACC AATCCAAGGC GAATGACCTC TAAGTATGTT GAGGTATTTT TTCCTCAGTT 
CAAGATAGAG AAGAATTATG AAATGAAACA ATATTTGAGA GCCCTAGGGC TGAAAGATAT 
CTTTGATGAA TCCAAAGCAG ATCTCTCTGG GATTGCTTCG GGGGGTOGTC TGTATATATC 
AAGGATGATG CACAAATCTT ACATAGAGGT CACTGAGGAG GGCACCGAGG CTACTGCTGC 
CACAGGAAGT AATATTGTAG AAAAGCAACT CCCTCAGTCC ACGCTGTTTA GA GCTGA CCA 
CCCATTCCTA TTTGTTATCA GGAAGGATGA CATCATCTTA TTCAGTGGCA AAGTTTCTTG 
CCCTTGAAAA TCCAATTGGT TTCTGTTATA GCAGTCCCCA CAACATCAAA GRACCACCAC 
AAGTCAATAG ATYTGRGTTT AATTGGAAAA ATGTGGTGTT TCCTTTGAGT TTATTTCTTC . 
CTAACATTGG TCAGCAGATG ACACTGOTGA CTTGACCCTT CCTAGACACC TGGTTGATTG 
TCCTGATCCC TGCTCTTAGC ATTCTACCAC CATGTGTCTC ACCCATTTCT AATTTCATTG 
TCTTTCTTCC CACGCTCATT TCTATCATTC TCCCCCATGA CCCGTCTGGA AATTATGGAG 
RGTGCTCAAC TGGTAAGGAG AACGTAGAAG TAGCCCTAGG GATCCTTTTT GAAACT CTAC 
AGTTATCGCA GATATTCTAG CTTCATTGTA AGCAATCTAG GAAATAAGCC CTGC TGCT TT 
CTAGAAATAA GTGTGAAGGA TAAATTTTCT TTGTTGACCT ATGAAGATTT TAGAGTTTAC 
CTTCATATGT TTGATTTTAA ATCAGTGTAT AATCTAGATG GTAAAAAATG TGAAATTGGG 
ATTAGGGACC TACCAAAATA TTTCATTAAT GCTTTCAATT GACAAATTTT GGCCTTTCTT 
TGATAAGACA ATATGTACAT GTTTTTTCAA ATATTAAAGA TCTTTTAACT GTTGGCAGTT 
GTTATCTACA GAATCATATT TCATATGCTG TGTAGTTTAT AAGTTTTTCC TCTATTTATC 
AGAATAAAGA AATACAACAT ACCTGTAAA 

Seq ID NO: 263 Protein sequence: 
Protein Accession ft: NP 003775 



PCT/US02/12476 



MASLAAANAE 
NTASGYGNSS 
LYDAKVERVD 
WQSAFTKSET 
LPENDLSEIE 
DESKADLSGI 
FLPVIRKDDI 



11 
I 

FCFNLFREMD 
NSQSGLQSQL 
FTNHLEDTRR 
INCHFKSPKC 
NKLTFQNLME 
ASGGRLYISR 
ILPSGKVSCP 



21 
I 

DNQGNGNVFF 
KRVFSDINAS 
NINKWVENET 
SGKAVAMMHQ 
WTNPRRMTSK 
MMHKSYIEVT 



31 
I 

SSLSLFAALA 
HKDYDLSIVN 
HGXIKNVIGE 
ERKFNLSVIE 
YVEVFFPQFK 
EEGTEATAAT 



41 
I 

tiVRLGAQDDS 
GLFAEKVYGF 
GGISSSAVMV 
DPSMKILELR 
IEKNYEMKQY 
GSNIVEKQLP 



Seq ID NO: 264 DNA sequence 
Nucleic Acid Accession #: AB0S2906 
Coding sequence: 74-814 



1 
I 

AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 
I 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAO 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 
I 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAAOGGC 
AGCAACTGCG 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TATATCATTT 
CTCTTTCTTG 
ATTCTTTCCG 
ATGATTGTTT 
TCCCAAAAAA 



31 
I 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AATGGGAGAC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 
TCTTTCTTCT 
CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAAAA 



41 
I 

CAAGTCTCTC 
CTTCTGTG CC 
CCTCACTCTC 
TGTGCGGTTC 
ACAGTCACAC 
CAGAACCCAG 
CTGGAGAATT 
AAAGCTGAAG 
CTCTTTGACT 
GAAAAGTGGG 
TGTATAGGAT 
GGAGCACCAC 
ATCCTTTGCT 
TCCTTTAGAG 
AAACTCGCCC 
AGCAGA7CAT 
AACAATTTTA 
TGGAATTCCT 
CTTTTTGTTT 
TGTCAGTAAA 
GAGAATTTTT 
TTTATTGTTC 
A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



51 

I 

LSQIDKLIiHV 
HKDYIECAEK 
LVNAVYFKGK 
YNGGINMYVL 
LHALGLKDIF 
QSTLFRADHP 



60 
120 
180 
240 
300 
360 



51 

I 

CTCCCTAGCG 
TCCCGCTTCT 
TTTGCTATGA 
AAGGCCAGGT 
CTGTCAGTCC 
TACTGAGAGA 
ACACACCCAA 
GACACAGCAG 
CAGAGAAGAG 
AGAATGACAA 
GGCTTGAGGA 
TCGCCATGTC 
GCCTCCTCAT 
TGACAGGTTA 
TTCTGTCTGG 
GATGACATCA 
CCAGCAGTTA 
GCACTTAAAG 
GGAAAATCAA 
ATAATCACGT 
AAATTATTTA 
TGTACTGATA 



'60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 265 Protein sequence: 
Protein Accession #> BAB61048.1 



288 



WO 02/086443 PCT/US02/12476 

1 11 21 31 41 SI 

I 1 I I 1 I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 60 
FLHYDCGNKT VTPVSPLGKX LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 120 
5 LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL PDSBKRMWTT VHPGARKMKE KWENDKWAM 180 
SPHYFSMGDC IGWLEDFLMG MDSTLBPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 240 
FILPGI 

Seq ID MO: 266 DNA sequence 
10 Nucleic Acid Accession #: XM_0 84853.1 
Coding sequence: 127-444 

1 11 21 31 41 51 

1C I I I I I I 

1 J ATTGATGATA TATTTAACGA AATCAAATTT GGTGAATATG TGGACACTGG AAAGCTAATC 60 

GACAAGATCA ACTTACCAGA TTTCCTAAAA GTGTACCTTA ACCACAAGCC ACCTTTTGGT 120 

AACACCATGA GTGGCATCCA CAAGAGCTTT GAGGTGCTCG GTTATACCAA CTCCAAAGGG 180 

AAAAAGGCCA TTCGAAGAGA GGACTTCCTG AGACTGCTCG TTACTAAAGG TGAGCATATG 240 

AOGGAGGAGG AGATGTTGGA TTGCTTTGCT TCACTGTTTG GCCTGAATCC CGAGGGATGG 300 

20 AAATCOGAGC CTGCAACCTG CTCCGTCAAA GGTTCAGAAA TTTGCCTTGA AGAAGAACTT 360 

CCAGACGAAA TCACTGCAGA AATATTCGCG ACTGAAATTC TTGGCTTAAC CATTTCAGAA 420 

GATTCCGGCC AGGATGGTCA GTGAAGTTAC CAGGAATGTT TAAAGCACAA AGGACTTTGG 480 

GTGTGTGTGC ATGCACATGT GTGTGTTTTC - CATGAGGCAC TGCTTTTTAT GCATTTCCCT 540 
CCCCCCTCTC ATCTTTAGAA CATTTAGACA TTAAAGCAAG TTTCTGGTGA GCAATG 



25 



Seq ID NO: 267 Protein sequence j 
Protein Accession 8: XP 084853.1 



30 1 11 21 31 41 51 

till ! I 

MSGIHKSFEV LGYTNSKGKK AIRREDFLRL LVTKGEHMTE EEMLDCPASL PGLNPEGWKS 60 
EPATCSVKGS EICLEEELPD EITAEIPATE ILGLTISEDS GQDGQ 

35 Seq ID NO: 268 DNA sequence 

Nucleic Acid Accession #: NM_001B9B 
Coding sequence: 57-482 

1 11 21 31 41 51 

40 | | | | | | 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

45 ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 480 

r AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 540 

50 CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC . 

55 Seq ID NO: 269 Protein sequence: 
Protein Accession #:NPJ>01889.1 

1 11 21 31 41 51 

, A 1 1 1 I II 

60 MAQYLSTUiL LIATLAVAIA WSPKEEDRII PGGIYNADLN DEWVQRALHP AISEYNKATK 60 

DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKXQItCSP 120 

EIYEVPWENR RSLVKSRCQE S. 

_ . Seq ID NO: 270 DNA sequence 
65 Nucleic Acid Accession #: XM_093210 
Coding sequence: 13-1854 

.1 11 21 31 41 51 

- n I I I I I I 

/U ATGGCAAGCG CCGGAATCTC CTCAGCTGCC GTTTCACAAA AGAGGTACCA GGTCCGCACC 60 

AAACGAGCAC ACAAGCAGCA CCAGGAGCTG CAGAAGAAGG AGGOGGCAGC GATGGACCAG 120 

GGCAGAGGGA ATGGGGAGGG GGCATCCTAC CCCATATCTG AGGTGCGACT GCGGGACGTA 180 

GAGCGGACTG GGCCTTTCCC GTTGGCGCGT GGCCTCAATC AGGACTTCTT GCCCACGTGC 240 

GCCTTCAAAA CGGTAAGAGC TGCAACTGAA CGTGTGAGAC ATGGTGCAGA TAGGCTGAGA 300 

75 GGCGGCGGGA GAGATGCCCA TGAACTCAAG TACCCGGACA CGCCCTCCAC TTCTACCACC 360 

ACGAGTAACA CCGCCCCCAC GGGACCGCTC TCGAGGTCCC CCAAGCCAAG GACGCAAGGA 420 

GGAACGCCCC GGCGCGCGGC CAGCAGCGGC GGGCACCGGC CCAATGGCCA CGGAACTCAG 480 

CACTGGCAGT CGGCCCTCCT CACACCGCAG GCGTGCAGTG TGGCCGACGG AGCCTCCCGG 540 

_ _ GCCGAGGACC CAGCTAGGCC GTCACCCCGG TTGCTCCCAC GGGAAGGGGC ACCAGGCAAA 600 

80 CTGCCCAAGG CCCCGAGCCC AGGCTCCCTG GCGGAGGCCT CCGCTGGTCC CGCCCAGATC 660 

ATGGCCGCCA CCAGGCTCCC GAGCCATGGC TTCCTGTCCG GGAACGGCCC GGCGTCCTGG 720 
CTGTCCAGCT AG 

oc Seq ID NO: 271 Protein sequence: 
o5 protein Accession ft: XP_093210 

1 11 21 31 41 51 



289 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 
] I I I I I 

MLRHGEQKRK RARKKWDFLP TCAFKTVRAA TERVRHGADR LRGGGRDAHE LKYPDTPSTS 
TTTSNTAPTG PLSRSPKPRT QGGTPRRRPA AAGTRANGHG TQHWQSALLT PQACSVADGA 
SRAEDPARPS PRLLPREGAP GKLPKAPSPG SLAEASAGLL AHVRLQNADA QRVSISQALP 
PKSSVGRKEE RPGAGQQRRA PAPMATELST GSRPSSHRRR AVWPTEPPGP RTQLEPSPRL 
LPREGAPGKL PKAPSPGSLA EASAGPAQIM AATRLPSRGF LSGNGPASWL SS 

Seq ID NOt 272 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..732 



PCT/US02/12476 



GGATACTGTG 
TGAAAAAGCT 
TAATGTGGAG 
ATACCCACTT 
ATGATTTTGT 
TAAATTATTT 
TTAGTATCAC 
AAAATTGCAG 
TTAAGCC 



11 

I 

TCACTCAAAG 
TTTTTTCCCA 
GAAATTATTC 
GAAGCCTCTG 
CTTGTTTCTG 
TTATTTATCT 
AATTTATGGG 
AAGTCATAGG 



21 
I 

TAATGGGAGG 
CTTTTAACTT 
TTTCTCATTG 
TAGAAATGTC 
CAGTGAGAAA 
TTCATATAGT 
AGAGGGTTTT 
ACTGTCATGT 



31 
I 

GAGAGAGAAC 
GCTTTAGCGT 
GAGATTACAG 
TCGTCCTCCG 
TTACATCCAT 
TCTTACAATT 
TTGTATTTTT 
ATTGCAGCTC 



41 
I 

AGGGAGGGTA 
TAAGAGTACT 
AATATATCTA 
GTTGTATTTC 
AGCAAAGACA 
TCTAAAAAAT 
AAGCATATGT 
TGAGAACCAA 



Seq ID NO: 273 Protein sequence: 
Protein Accession Eos sequence 



31 



MGGRENREGR DAFEKAFFPT FNLL 

Seq ID NO j 274 DNA sequence 

Nucleic Acid Accession #: NM_003976.2 

Coding sequence: 299-961 



41 
I 



51 



CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGCCGCAG 
CGGGGGCCGC 
GGGCTGCCGC 
CGACGAGCTG 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCCGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 
GCGGCGCGGG 
CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACOG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 

I 

TGTTTGCTCA 
GCTGCAAAGC 
CTAGCTGTGT 
ACAAAAGATA 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGCGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 
I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGOG 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 
CCGCTGGTGC 
GCCGCCTGCA 
GGGCAGCCGC 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCCGAACAG 
AGCCTAAAAG 
CTGGCACTGG 
GGCATCAGCC 
CTTGGTTGAA 



41 

I 

GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGT 
CCCTGGCCTA 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG 
GCTCGGGCAG 
CTCGGCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 

I 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGOGGCC 

CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



Seq ID NO: 275 Protein sequence: 
Protein Accession #: NPJ>03 967.1 

1 11 



41 



51 



■ 21 31 

I 1 I I I ! 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSPMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 276 DNA sequence 

Nucleic Acid Accession #: NM_057091.1 

Coding sequence: 783-1445 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 



11 
I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 



21 
I 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 



31 
I 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 



41 
I 

CTGCTGCAGG 
OGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTOGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 



51 
I 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 



60 
120 
1B0 
240 



51 
I 

GGGATGCTTT 
TACCAGCTAA 
TTCATCTTGA 
TAAAACCTAC 
AAAGTCTTTT 
TAACACTCAT 
GGCTTATATA 
TGCCTGAAAC 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 



290 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 
TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TC6GGGQAGA 
GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGGAGGACC CCGTGCTGCC 
TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 
AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCOCTGG CCTAGGCGGC 
AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 
CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGOGAAGG CCCCCCGCCT GTCCTGGCGT 
CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 
GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 
CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG COGCGCTCGG GCAGCGGGGG 
CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 
GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGOGGCTC CTGCOGCOGC GCGOGCTCTC 
CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCOGGGGC CCTGCGACCG CCCCOGGGCT 
CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 
ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCOGC CACCGCCTGC GGCTGCCTGG 
GCTGAGGGCT CGCTCCAGGG CTTTGCAOAC TGGAOCCTTA CCGGTGGCTC TTCCTGCCTG 
GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGAGGAAG GCCTCAAAGC 
TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCOGA ACAGGTGAAG GGACAACTGA 
CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 
CTATGGAGCC CTTOGGACOC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 
GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 
AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID no i 277 Protein sequence > 
Protein Accession #: NP 003967.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST I*SHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGI/3HR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 278 DNA sequence 

Nucleic Acid Accession g: NM_0S7160.1 

Coding sequence: 1-714 



PCT7US02/12476 



ATGCCCGGCC 
CACCTGGGTG 
TGGCCCACCC 
GCGCCCCGCA 
CACCTGCCGG 
CAGCCTTCTC 
CGCGCGGCGC 
CGCCTGCGCT 
CTGGTGCGTT 
AGCCTGGCCA 
AGCCAGCCCT 
ACCTGGAGAA 
GCTCCAGGGC 
CAGAGTCCCA 
TACCGGTGGG 
AGAGCCCTCA 
TTCGGACCCA 
CTGATGAACA 
TTGAAGGACA 
TCACTCATGG 



11 
I 

TGATCTCAGC 
CCCTCTTTCT 
TGGCCGCTCT 
GCCCTGCCCC 
GGGGACGCAC 
GGCCCGCGCC 
GGGCTGGGGG 
CGCAGCTGGT 
TCCGCTTCTG 
GCCTACTGGG 
GCTGCCGACC 
CCGTGGACCG 
TTTGCAGACT 
CTAGCCAGCG 
TGATGGATAT 
CCCTGCGGAT 
CTTCTCACAG 
CTACAGTGGC 
CATATTGCAG 
GAGCTGGCCC 



21 
I 

CCGAGGACAG 
CCCTGAGGCT 
GGCTCTGCTG 
CCGCGAAGGC 

CCCOCCGCCT 
CCCGGGCAGC 
GCCGGTGCGC 
CAGCGGCTCC 
CGCCGGGGCC 
CACGCGCTAC 
CCTCTCCGCC 
GGACCCTTAC 
GCCTCAGCCA 
CATCCCCGAA 
CCCAGCCTAA 
ACTCTGGCAC 
TGAGGCATCA 
TTGCTTGGTT 
C 



31 
I 

CCCCTCCTTG 
CCACTTGGTC 
AGCAGCGTCG 
CCCCCGCCTG 
TGCAGTGGAA 
GCACCCCCAT 
CGCGCTCGGG 
GCGCTCGGCC 
TGCCGCCGCG 
CTGCGACCGC 
GAAGCGGTCT 
ACCGCCTGCG 
CGGTGGCTCT 
GGGACGAAGG 
CAGGTGAAGG 
AAGACACCAG 
TGGCCAGGCC 
GCOCCCGCCC 
GAAAGTGCCT 



41 
I 

AGGTCCTTCC 
TCTCCGCGCA 
CAGAGGCCTC 
TCCTGGCGTC 
GAGCCCGGCG 
CTGCTCTTCC 
CAGCGGGGGC 
TGGGCCACCG 
CGCGCTCTCC 
CCCCGGGCTC 
CCTTCATGGA 
GCTGCCTGGG 
TCCTGCCTGG 
CCTCAAAGCT 
GACAACTGAC 
AGACCTCAGC 
TCGAACCTGG 
AGGCCCTGTA 
GTGCTGGAAC 



Seq ID NO: 279 Protein sequence: 
Protein Accession #: NP_476 501.1 

1 11 



31 



41 



SI 



21 

I i I I I I 

MPGLISARGQ PLLEVLPPQA HLGALPLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 
APRS PAP REG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 
RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS THRTVDRLSA TACGCLG 

Seq ID NO: 280 DNA sequence 

Nucleic Acid Accession #: NM_057090.1 

Coding sequence i 29-715 



CTGATGGGCG 
GTCCCACTGC 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT 
CAGCCTGGCC 
CAGCCAGCCC 



11 
I 

CTCCTGGTGT 
CCCTGGCCTA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 



21 

I 

TGATAGAGAT 
GGCGGCAGGC 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG 
GCAGCGGCTC 
GCGCOGGGGC 
CCACGCGCTA 



31 
I 

GGAACTTGGA 
TCCACTTGGT 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGGA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCCGCCGC 
CCTGCGACCG 
CGAAGCGGTC 



41 
I 

CTTGGAGGCC 
CTCTCCG03C 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGOGCTCTC 
CCCCOGGGCT 
TCCTTCATGG 



600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 



51 
I 

TCCCCAAGCC 
GCCTGCCCTG 
CCTGGGCTCC 
CCCCGCCGCC 
GCCGCCGCCG 
COGCGGGGGC 
GCGGGGCTGC 
CTCCGACGAG 
ACACGACCTC 
CCGGCCCGTC 
CGTCAACAGC 
CTGAGGGCTC 
GACCCTCCOG 
GAGAGGCCCC 
TAGCAGCCCC 
TATGGAGCCC 
GACCCCTCCT 
GGGACAGCAT 
TGGCCTGTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



51 

1 

TCTCCACGCT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCCGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGTCAACAG 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 



291 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CACCTGGAGA ACCGTQGACC GCCTCTCCGC CACOGCCTGC QGCTGCCTGG GCTGAGGGCT 
CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 
GCAGA6TCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 
CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 
CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 
CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 
TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 
TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 
CTCACTCATG GGAGCTGGCC CC 

Seq ID NOt 281 Protein sequencer 
Protein Accession ft: NP_476431.1 ■ 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPViPRRQ APLGLSAQPA IMPTLAALAk LSSVAEASLG SAPRSPAPRE 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 
SRARAAGARG CRLRSQLVPV RALGIiGHRSD ELVRFRFCSG SCRRARSPKD LSLASLLGAG 
ALRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS ATACGCLG 

Seg ID NO: 282 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 



1 
I 

CTACTGCACC 
ATAGCAATTT 
CCGTGAACCT 
AGGTAACGTT 
GATCTAATCA 
GCTAAAAATG 
TATTCTTCAT 
CACTCTTAAG 
GAATCTCAGG 



11 
I 

TGCCCTCTGT 
CCTTTGGTTT 
TATGAATGCT 
TTCTTTTGCT 
ATGATCTAGT 
AACTTGAAAC 
CCTGGCTGTA 
GATTTTATGT 
ATACCTACAT 



21 
I 

TTCCTTTGGA 
TTAAGACTTC 
GCTTAAAAAT 
CTCATCTTGG 
CAGAAGCTAC 
ACGGAAGTAG 
ATAATAGCCA 
GTATTATTCA 
TTATCACTTT 



31 
I 

AATCTCTTAC 
TACATTGCTT 
AATGTCAAAA 
TTTCCATATA 
TTCACTGGCT 
TGGTTGGTCC 
TTATTTGTTA 
AATTGCTATT 
TTCAATATAT 



41 
I 

CTTTCATTAG 
TTTCTTTTAT 
TATGTTTTAG 
CTATTTTTGG 
AACAGTGATC 
AGTTTGAAAG 
TGCCTTTGTT 
ACTGTTCTTT 
ATGTATTTCT 



Seq ID NO: 283 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 564-1461 



GAGACTTTTA 
CAAAGAGGTC 
GAAGTGCTAC 
ACTTGGATCA 
CGACCGATGC 
CTTTCCCTGA 
GCAAACAAGG 
GGGGAGACAC 
AAGGGCTGGC 
CCTCCCGGCC 
GCCTGGCGCG 
ACTTGGCGGG 
CCGGTTTCTC 
CTATGACGGG 
CGCTGAGCTG 
AGTTTGACGA 
TGGATGAAGA 
AGGGGTGCGC 
GCTTACCAGA 
AACGTGACCG 
AAGAAATGGA 
TTCAGAAAAA 
AGGAAAAAGC 
ATCAAGAATG 
AAAACAACAG 
ATGGTTGGAA 
TGGAAAACTT 
AATTCCGTGG 
GAAGAGTAAA 
AGCCAGGAGC 
AATAACATGC 
CAGTCAATAA 
TGGAGTCCTT 
GTTGATCTTG 
ATTTAGCTTG 



11 
I 

ATCATCTATC 
AAAAATTCAC 
CAAAACACGC 
AGTTCCCTCC 
TCACAATTCT 
TTCAGTGGCA 
GACAGGCCCT 
CGTACTAAAC 
TAAGGGAGGC 
TCCACGCGTC 
CGCAGGGCCC 
CCAGATGTGG 
TGCTGACTGC 
CGCACGTGGG 
CAGCAATTCC 
GGACGACGGT 
TGCCCATGAT 
CAGCACTCAG 
AAGCCGCCTG 
GCTGCAACTG 
AGAACGTGAA 
GAATGAGCAA 
AGCAAAGGAA 
GTTAAAGAAA 
CAAGCTGAAA 
AATGCGAAAC 
ACAGGTTTTT 
AAACCAATTC 
AGACCTGTGA 
AATCTTTGCC 
TTTTATCTGG 
CTCAACACTT 
AGAGTTGAGG 
GCATGTTGTT 
TATTAAAAGT 



21 
I 

CCTTGTGCTT 
ATGTGTAGAC 
AATGACTGTC 
CCTCTCCTCA 
GACCTCGTAA 
GGTAACATAT 
CAAAGTTGTC 
AAGCTTGCAA 
CGGGGCGGAG 
GCCATGGCAA 
GCTGGGAAAG 
GCGGCGGGGC 
AGACCCAGGT 
CAGGGGCTGG 
ACCAGGTCGC 
GAOGGGGAGG 
TCAGAGGCCA 
GTTGAATCAG 
ACACCATGGG 
AAAGCTCTAG 
AAAAGAAAGA 
AAAAGAAAAG 
CTGGAGAAAG 
AAAAATGCTG 
TACAGGAGAA 
ATAAACCTCG 
ACAGTGGAAA 
ATATGCCACC 
TAAGTCAGCC 
TTGGAACTCT 
AGCTATTTAA 
AATGTGATTA 
AAGATATTTT 
TTGCAGAATA 
ACACTGTAAT 



31 
I 

TACGCAGACC 
AAATTAGGTC 
CTAAAAGTGC 
AAATATATCG 
TTATATAGGG 
TTCATGTACA 
GGTAGGGAGC 
ACAGCAGGCA 
GAAGCCAAGC 
CGCGGGGTCT 
CGCGTCCCCG 
GCTGGGGGCC 
CTCGGCCCTC 
AGGTGGTGCG 
TGTTGTCTCC 
ATGAGGAAGA 
AAGTGGCGAG 
AAAATAACCA 
AGGTGTGGTT 
AGGAATTAAA 
TAATTGCTGA 
AAAGAGAACA 
AATACTTGCA 
AAGAATGTGA 
AAAGGAAATA 
TCCAGCTGCA 
TTCCTATCCA 
TCCCAAAGAA 
ACACAAGTCA 
GTGCAGAATA 
TTTAAAAATC 
TTGACAAATA 
CTGGATTTTG 
AGTGGCTGAA 
ACCAATAAAA 



41 

I 

CTACAATACA 
CCTTAAGATG 
GTTCTGGGAT 
ACTTGTGCTG 
GGTGGTTTTG 
AAATGAACTG 
CAGGACCCCG 
CCTTCCTGCC 
TCTGCAGGCC 
GTGCTGGCCG 
CCGCGGCTCC 
TACTTTTCCC 
CTCGGACTCC 
CTCGCCGTCG 
CCTTGGCCAC 
CGTGGATGAT 
CCTGAGAGGA 
AGAAGAACAG 
TATTGGCAAA 
TCAACAACTA 
AGAAAAGCAC 
AAAAATTAAT 
AGAAAAAGCA 
GAGGAAGAAG 
GCAGAAAAAA 
AAGAGCTATG 
GAACCAGCCT 
GCTAAGGATC 
TCATCTCTGG 
CAAAGATAGC 
AGAAATTGTT 
GCAATTTTTG 
GTTTTTATAA 
TATGTAAGAA 
CTAACAATTT 



51 

I 

CTAGAGGCTT 
CCAGGCAAAC 
ACACCTGTAA 
AAAGAAATCA 
GTTTCTGCGT 
CAACACCACG 
CCAGTGGCGT 
ACTGAGGAGG 
CTGACAAAGT 
GGATTGGCCG 
GCCAG1TTGA 
TCTTCCTACG 
TGCTCAGTCC 
CCGCCGCTGC 
CAGAGCTTCC 
GAGGAAGACG 
ATGGAGTTAC 
AAACAGGTGC 
GAAAAAGAAG 
GAAAAAAGAA 
AAGGAATGGG 
AAAGAAATGG 
AAAGAAAAAT 
AAAGAAAAGA 
AGTTTCAAGA 
GTTATX3CCAA 
TTTATAATCC 
TATCAGGAAG 
TAATTCATAA 
GTATGTGGAA 
TTTTACTGCT 
CATTTGTATA 
ACTTTTTAAG 
TTGTGTTTGT 
TTCTTG 



Seq ID NO: 284 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 

I I I 

MATRGLCWPG LAGLARAGPA GKARPRRGSA 
PRSRPSSDSC SVPMTGARGQ GLEWRSPSP 
GEDEEDVDDE EDVDBDAHDS EAKVASLRGM 
PWEVWPIGKE KEERDRLQLK ALEELNQQLE 
RKEREQKINK EMEEKAAKEL EKEYLQEKAK 



31 



51 



720 
780 
840 
900 
960 
1020 
1080 
1140 



PCTYUS02/12476 



60 
120 
160 



51 

I 

GGTTTCTTTC 
TATCTGTGCT 
CTGCCTACTC 
TTTTTTGTGA 
ATGTTCATGT 
CTCTTATTAG 
ATGTAGCAGA 
TTATAGTTGA 
TATT 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 



41 

! I I 

SLNLAGQMWA AGRWGPTFPS SYAGFSADCR 
PLPLSCSNST RSLLSPLGHQ SFQFDEDDGD 
ELQGCA5TQV ESENNQEEQK QVRLPESRLT 
KRKEMEEREK RKIIAEEKHK EWVQKKNEQK 
EKYQEWLKKK NAEECERKKK EKKNNSKLKY 



60 
120 
180 
240 
300 



292 



5 

10 

15 

20 

25 

30 

35 

40 

45. 

50 

55 

60 

65 

70 

75 

80 

85 



WO 02/086443 

RRKRX 

Seq ID NO: 285 DMA sequence 

Nucleic Acid Accession #s Eos sequence 

Coding sequence: 1-1746 



PCT/US02/12476 



ATGCCACTGA 
GCCTACCATG 
GGGGCACGCA 
CTCAACACGC 
GCCCTGAGGA 
GGCTCGCTGC 
TTCCAGGGCC 
CAOCCGQCCC 
CTGGAATACA 
GGCAAGAATA 
GTCCTCCGGC 
GTTAACCTGC 
TTCCACAACA 
CCACCCAGCA 
CTGAAGGAGC 
TATGACAACC 
GTCCTGATTC 
ACGGAGCTTC 
TTCCGCATGT 
CCAGGGAATA 
CTGGAGAACT 
TATGACAATC 
AACCAGCCTA 
GGCCAQTCCC 
GTGCCTAQTT 
TCCGTCTCTT 
ATTCAGGTCA 
ATTGCCGCCA 
TGTTGCTGCT 
TGTTAAAGAG 
TCATCTTTCT 
CTAGATAAAO 
CGTGCCGGAC 
GGATTTCCGA 
ACCTGTCCTC 
A6TTAGTCCA 
CCTAAGTATT 
ACCCAGCATG 
TGAGTTCTCT 
CAGCCTGGTT 
AGACAGAAGA 
CCCCAGCACA 
TGAAAAGTTT 
AAAATCAGCT 
CACCCCTAGA 
GTGGGAACAT 
CAGCATCTAG 
GGAGATGGGG 
TCCCCCCACA 
TGTGGACAGG 
AGAGACCCTG 
GTCOGTGCAG 
TCCGCCTGGA 
ACTTAGGGGA 
GAATCTAGTG 
TGAACTTCAG 
GTCTGGGGGC 
TCCAGGGTTA 
CTGCTATACA 
CTCTGGACAA 
AGGAAAGAAC 
TCTTATTAGC 
TCGGCTCTTA 
CCCGATCGGC 
CAGGAGCACG 
TGTTTGCAAA 
AGATGAGGCC 
ACTATTGGTG 
CCAGAGCATG 
GGGGCATCCC 
GGTGCTCCTG 
GAGGGAGGTG 
GGTATTCCTG 
GAGGGCCACT 
CCTTATGTGA 
GCCTCCTTTT 
TACTAGAAAA 
GGGCTGGAAT 
CCTCTCTGTT 



11 
I 

AGCATTATCT 
GCTGCCCTAG 
TTGTGGCGGT 
ACATCACTGA 
TTGAGAAGAA 
GCTATCTCAG 
TGGACAGCCT 
ACTTCTCCCA 
TCCCTGACGG 
GCCTCACCCA 
TGTATGAGAA 
AGGAACTGGC 
ACCACAACCT 
TCTTCATGCA 
TCTCTCTGGG 
ACATCTCTTC 
TTAGCCGCAA 
GGGAGCTGTC 
TGGCCAACCT 
TCTTCGCCAA 
TGCCCCTCGG 
CCTGGAGGTG 
GGTTAGGGAC 
TCATTATCAT 
ACCCAGAAAC 
CTACCACTGA 
CTGATGACCG 
TTGTAATTGG 
GCAAGAAGAG 
GCAGGCTGGA 
GCCTCCACCC 
GTGTGCCTAC 
CTTCCTACAA 
TTCATACCCC 
CAAGAACAGC 
CAGCCTGCTC 
ATGTAAGTTG 
TCCCCTCAAA 
CCTCAAAGAA 
TTGGGGATGC 
GCCGTCATCA 
GCAAGCTCAG 
AGCCCTTTAA 
TATTAATACG 
GTTTGTTTTA 
GATAGTGTAT 
ACCCAGACCC 
GCTTCTGAAG 
GTCAGCCTGT 
ATGGGAGACT 
AGACCTGGGG 
CCACACCCTC 
GCCTTCTATG 
AGTGAAATCG 
TCTTTCTAAT 
AATCTCACTT 
TCCCTGGAGC 
TTCTCCTCCT 
CATATTCACA 
CTGGCCCAGT 
TTCAGCTGAC 
TCCCCGCTCC 
TTAGCTCCCC 
TCTTATTAGC 
TGCTGACCAG 
CACTAGTGCA 
CGTCAGAGTC 
GCACCTGGAG 
GCACATGAGC 
GGCCCGTACC 
TGAGTGGCCT 
GGAAACCTCA 
GCAGTAGCCA 
GTCCTCAGAT 
ACCTCTTGCC 
CTTCAGCGGG 
GCTGAGTGGA 
GAGCCGGCTG 
TACAGCTCCT 



21 
I 

CCTTTTGCTG 
CGAGTGTACC 
GCCCACCCCT 
ACTCAATGAG 
TGAGCTGTCG 
CCTCGCCAAC 
TGAGTCTCTC 
GTGCAGCAAC 
AGCCTTCGAC 
CATCTCACCC 
CAGGCTCACG 
TCTACAGCAG 
CCAGAGACTC 
GCTGCCCCAG 
GATCTTCGGG 
TCTACCCGAC 
TCAGATCAGC 
CCTCCACACC 
GCAGAACATC 
CGTCAATGGC 
CATCTTOGAT 
TGACTCAGAC 
GGACACTGTA 
CAATGTCAAC 
ACCATGGTAC 
GCTAACCAGC 
CAGCGTTTGG 
CATTGTCGCC 
GAGCCAAGCT 
GCAGGGCTGG 
CTGGGTCCAT 
CTCTTCCTGA 
TCAGGAAGAT 
TGGGCTTCCT 
CTTCCCTGCG 
ACTTCGTGGG 
ATTTCCCTTC 
TGAAAGTTCT 
GACTTCAAAC 
TATGAAAGAG 
GTGTCTCACT 
CCTTTTAGAG 
GGAATGAAAT 
GGATAGAGAA 
AAATTTTTAA 
GGCTTGGTGG 
AGAGCATCAC 
ATGGACTTAC 
GCAAAGGCCC 
GTGGCCTGAA 
CACCATGGCT 
TTCCCTGCCA 
GACGTGATAT 
CTCAGAGATG 
GTGGTAAAAT 
ACAGCAGGCG 
TCCTCCTGCG 
CGAGTCACAG 
TGGCGCTCAA 
TTACAGTGAA 
TCCACGGGGA 
ACAAGACACC 
GCTCCACAAG 

TTTTCCCTTC 
CTTTGTAGCT 
AAGAGATGTC 
GACATGCACC 
ATCACCCGCT 
CCTCCAGACA 
CCAGATGTCT 
TCATCCGGTG 
TGACATTGGA 
GACACCACCC 
TCTTCCTTTC 
CCCTTCAACC 
GTCTCCTTTC 
GTCCCCCAGA 
TGACAGTCCC 



31 
I 

GTGGGCTGCC 
TGCTCCAGGG 
CTGCCCTGGA 
TCCCCGTTCC 
CGCATCACGC 
AACAAGCTGC 
CTTCTGTCCA 
CTCAAGGAGC 
CACCTGGTAG 
AGGGTCTTCC 
GATATCCCCA 
AACCAGATTG 
TACCTGTCCA 
CTCAACCGTC 
CCCATGCCCA 
AATGTCTTCA 
TTCATCTCCC 
AACGCACTGC 
TCCCTGCAGA 
CTCATGGCCA 
CACCTGGGGA 
ATCCTTCCGC 
CCTGTGTGTT 
GTTGCTGTTC 
CCAGACACAC 
CCTGTGGAAG 
GGCATGACCC 
CTGGCCTGCT 
GTCCTGATGC 
GGAATGATGG 
GGAGCTTTCC 
CTTGCCTGAT 
AGATCCAACT 
TCGAGAGGGC 
CCCAGGCCCC 
AATAGTTCTC 
TTTTGTTTCT 
CCCCTTGATT 
CATTTAACTG 
AGAAGGAAAA 
TGTGATTTTT 
AAGGATATTT 
CATGTAGAAT 
AGAAATCTGG 
TTGAA GCATG 
ATTTTCACAA 
AAATATCCCC 
CTGGGACCTG 
CGTGGCCAGG 
CAGGAGATTT 
GGCCAGGTCA 
GCAAGTTGTC 
GCCTGTATCT 
AGATCCTTTA 
TCTCCATCAA 
ACACGGGGGT 
TGTGGTCTGG 
TCACACGAAT 
GAAGTTAGGC 
ATGGAGAATT 
TCTGGAAATC 
TOTGCTTTQG 
ACACCTGTGA 
ACAAGACACC 
CAGTTCCTGC 
TTTCACCCTC 
ATCCCCCCAG 
AAGGCTTGCC 
GATGGTGGCC 
GGAAGCATGG 
TTGTGCATAG 
GGCCCTGCCA 
GCACCTTCCT 
AGGAGCACCC 
TCCCATCAGA 
TCTCTGCACC 
CAACAGGATG 
AAGCTGGAGT 
ACGCCCATCT 



41 
I 

AAGCCTGGGG 
CCTCCCAGGT 
ACGCCATGAG 
TCAATATCTC 
CTGGGGCCTT 
AGGTTCTGCC 
GTAACCAGCT 
TGCAGTTGCA 
GACTCACGAA 
AGCACCTGGG 
TGGGCACTTT 
GACTGCTCTC 
ACAACCACAT 
TTACTCTCTT 
ACCTGCGGGA 
GCAACCTCCG 
CGGGTGCCTT 
AGGACCTGGA 
ACAATCGCCT 
TCCAGCTGCA 
AACTGTGTGA 
TCCGCAACTG 
TCAGCCCAGC 
CAAGCGTCCA 
CCAGTTACCC 
ACTACACTGA 
AGGCCCAGAG 
CCCTGGCTGC 
AGATGAAGGC 
GACTGGAGGA 
CGTGATTGCT 
TCTCCOGTAG 
GGCCATGGCA 
TCTTCCTCCA 
CTCCGGGCCT 
CGCTGAGATA 
CTTGTTTGTG 
TTCTGCTCCT 
GTTTCTTAAG 
TCATGCCGCT 
ATCTGGAAAA 
CCAAACTGCA 
TTTGGACTTC 
TGCCTGGGGG 
TGAAGTGTAC 
ACTGAACATA 
CATCCTGGGC 
CCCCCCATGA 
GGTGGAGGAG 
TATTATATCT 
GAAGCATCCT 
TGCGGCTCAT 
GTTTTTAATT 
ATTGAAAACG 
CATCACAGTC 
ACACCGATGG 
TTAGGAGTTG 
ACCTGCCTTC 
TCATGGCAAC 
TCAGGTCTCC 
CACGACCAAT 
AAATCCACCA 
TCTGGAAATC 
TGTGACATCC 
ACAAAAAGTG 
TGTCCCAGGG 
GGTCTCCAAG 
AGAGCCAACA 
TGCTGTGCCT 
GTTTGCCCAC 
GCACAAGTGG 
ATCTTAACCC 
CTCCAGCCAG 
TAGGTGAGGG 
GTGGTTGGAT 
ATGTTGTCTG 
ATGCATTTGC 
GGGGTACAGA 
GGAGTGGGAG 



51 
I 

TGCAGGGTTG 
GGAGTGCACC 
CCTGCAGATC 
AGCCCTCATC 
CCGAAACCTG 
CATCGGCCTC 
GTTGCAGATC 
CGGCAACCAC 
GCTCAATCTG 
CAATCTCCAG 
TGATGGGCTT 
CCCTGGTCTC 
CTCCCAGCTG 
TGGGAATTCC 
GCTTTGGCTC 
CCAGTTGCAG 
CAACGGGCTA 
CGGGAATGTC 
CAGACAGCTC 
GAACAACCAG 
GCTGCGGCTG 
GCTCCTGCTC 
CAATGTCCGA 
TGTCCCTGAG 
TGACACCACA 
TCTGACTACC 
CGGGCTGGCC 
CTGCGTCGGC 
ACCCAATGAG 
CCTGGGAATT 
CTTTCTGGCC 
AGAAGCAGGT 
AAAGCCCTGG 
AATCCTCCCC 
CTGTAGACTC 
GCCCCTCTCG 
CTATGGCTTG 
GAAGGCAGGG 
AGCCGTCAAT 
CAGTTCCTGG 
GGAAGAAACA 
AACTTTGCTT 
TAAAAACATT 
TCCCTGTGTT 
STGCAGAAAA 
CCTGTGTAAT 
TTTTCCCAGA 
GCCAGGACGG 
AATATGTGGG 
GGAGACCCTG 
GACTGCAGAG 
CGGAGGCCCC 
TTCATTCTTC 
AAGTGTAACG 
AGCTGGCAGC 
GTCACACTGG 
AGTTGTTTGC 
TCTGGCTTTC 
GTGTGTCTT T 
ACGTCTGCCC 
CCCGATCGGC 
CCAATCCCGA 
TACCACCAAT 
TCCAGGGCCA 
TCCAGAGGGC 
AATCTAGGAG 
GCATTTCCAC 
GGAAGTGAGC 
GGTGCCAACA 
AGACCTGTCG 
GCCAGGGCTG 
AGAACCCTTA 
AGGCTGACCT 
GTGAGGGCCC 
GGAGCCATTG 
GCTGAGGAGC 
TCAATT CTCA 
GTTCAGTTTT 
CTGGGAGTTA 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
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WO 02/086443 

GTGTTGGAGA AGAAACAACA AAAGCCAATT AGAACCACTA 
TGCACAGATA CTCTTCAAGC ACTGGACGTG GATTCTCTCT 
GGTAGGAGTG CCGCCTCTAC CCACTTGTGA TGGGGTACAG 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT CTCTTCAAAC 
TTGTCTTGGG CTTTCGTCAT TAAACCAAAG GAAATGGAAG 
TTAGTCTTGG TCATCAGAAC CTCACTT3GT ACCATATAGA 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG AATAGAATAG 
TGGGCTGTAT GTATATTGTT CTTCCTCCTT AGAATTTAGA 
AACTTTTCAT GGACACAATT TCCACAACCT TTCAGATGCT 
GAACTTCCAA ACTCAGGAAG TTTGCAGAGA GCAGACAGCT 
AGTTGGTCGA CAGATGTTAG ATGTATCCTA GCTTTTAGCC 
GCCCCCAGAT CCCACAGTCA GAACTGAATC TG OSTTSTT G 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG GGTGGGCTGG 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT AAGGAGAGAT 
CTTCATGCTG CCTTCAAAGC TAGATCATGT TTGCCTTGCT 
GCCCCAGTGC TTGGCGATGC ATTTACAGAT TTCTAGGCCC 
AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT CTTCTGCTGG 
GGTGTACAGA ATCAACAATA AATAATATAC ATGTAT 

Seq ID NO: 286 Protein sequence: 
Protein Accession fti NP_570843.1 



PCT/US02/12476 



TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 
GTTGCTCTCC 
TGTAACCACA 
TCATGGGAAT 
TTCTACTTAG 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
CCGGGGAAAA 
AACCCGCTGC 
CTGCAAATCA 
GTAGAGTGTG 
TAATCCATTT 



4B0O 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



1 
I 

MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHI SSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQltLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 

I 

GARIVAVPTP 
GSLRYLSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



51 

1 

LPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGb 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



Seq ID NO: 287 DNA sequence 
Nucleic Acid Accession if: NMJJ02362 
Coding sequence: 1..954 



ATGTCTTCTG 
GAGGCCCTGG 
TCCTCCTCCT 
GGTCCTCCCC 
TGGAGGCAAC 
GACGCAGAGT 
CTGCTCCGCA 
ATCAAAAATT 
ATGATCTTTG 
ACCTGCCTGG 
GGCCTTCTGA 
GAAATCTGGG 
GGGGAGCCCA 
CAGGTACCCG 
GAAACCAGCT 
GCCTACCCAT 



11 
I 

AGCAGAAGAG 
GCCTGGTGGG 
CTCCTCTGGT 
AGAGTCCTCA 
CCAATGAGGG 
CCTTGTTCCG 
AGTATCGAGC 
ACAAGCGCTG 
GCATTGACGT 
GCCTTTCCTA 
TAATCGTCCT 
AGGAGCTGGG 
GGAAACTGCT 
GCAGTAATCC 
ATGTGAAAGT 
CCCTGCGTGA 



21 

I 

TCAGCACTGC 
TGCACAGGCT 
CCCTGGCACC 
GGGAGCCTCT 
TTCCAGCAGC 
AGAAGCACTC 
CAAGGAGCTG 
CTTTCCTGTG 
GAAGGAAGTG 
TGATGGCCTG 
GGGCACAATT 
TGTGATGGGG 
CACCCAAGAT 
TGCGCGCTAT 
CCTGGAGCAT 
AGCAGCTTTG 



31 
I 

AAGCCTGAGG 
CCTACTACTG 
CTGGAGGAAG 
GCCTTACCCA 
CAAGAAGAGG 
AGTAACAAGG 
GTCACAAAGG 
ATCTTCGGCA 
GACCCCGCCA 
CTGGGTAATA 
GCAATGGAGG 
GTGTATGATG 
TGGGTGCAGG 
GAGTTCCTGT 
GTGGTCAGGG 
TTAGAGGAGG 



41 
I 

AAGGCGTTGA 
AGGAGCAGGA 
TGCCTGCTGC 
CTACCATCAG 
AGGGGCCAAG 
TGGATGAGTT 
CAGAAATGCT 
AAGCCTCCGA 
GCAACACCTA 
ATCAGATCTT 
GOGACAGCGC 
GGAGGGAGCA 
AAAACTACCT 
GGGGTCCAAG 
TCAATGCAAG 
AAGAGGGAGT 



51 
I 

GGCCCAAGAA 
GGCTGCTGTC 
TGAGTCAGCA 
CTTCACTTGC 
CACCTOGCCT 
GGCTCATTTT 
GGAGAGAGTC 
GTCCCTGAAG 
CACCCTTGTC 
TCCCAAGACA 
CTCTGAGGAG 
CACTGTCTAT 
GGAGTACCGG 
GGCTCTGGCT 
AGTTCGCATT 
CTGA 



Seq ID NO: 288 Protein sequence: 
Protein Accession 8: NP 002353.1 



1 11 21 

I I I 

MSSEQKSQHC KPEEGVEAQE EALGLVGAQA 
GPPQSPQGAS ALPTTISFTC WRQPNBGSSS 
LLRKYRAKEL VTKAEMLERV IKNYKRCFPV 
TCLGLSYDGL LGNNQIFPKT GLLIIVLGTI 
GEPRKLLTQD WVQENYL.EYR QVPGSNPARY 
AYPSLREAAL LEEEEGV 



31 



41 



51 

1 ! I 

PTTEEQEAAV SSSSPLVPGT LEEVPAAESA 
QEEEGPSTSP DAESLFREAL SNKVDELAHF 
IFGKASESLK MIPGIDVKEV DPASNTYTLV 
AMEGDSASEE EIWEELGVMG VYDGREHTVY 
EFLWGPRAIA ETSYVKVLEH WRVNARVRI 



Seq ID NO: 289 DNA sequence 
Nucleic Acid Accession 8: NM_002362 
Coding sequence: 46.. 1344 



CGGCGGCCGC 
GGCGACCTGA 
CATCAGCGCG 
CTCAACAGAC 
TTGACCAGAA 
CAGCCCATCG 
GGCCCCAGCA 
GTTCTACCTG 
AAATCCCATC 



11 
I 

GCCCTGGTTG 
AGCAGGCGCT 
GCAGCAGCAC 
ATAATATTGT 
ATGTGCAGTC 
ATTTGAGTGC 
GTGAAAATCT 
CAGCTGAATT 
TCCTCGATTA 




TGCAAAGAAA 
GTTTGGTGAT 
TGTGTCTATT 
ATGCACTGTT 
GGAGGAAGAG 
CCATGGGCTT 
TGTGATGACA 



31 
I 

GCTCTCGGGG 
GCCGAGTCGC 
GAAGACATAA 
TACACATGGA 
ATTGACACAG 
GCACTTCACA 
ACAGAAAACA 
TGGGACAGCT 
ACTTTACTGT 



41 

I 

GCGCCATGGA 
CAACGGTCCA 
ACCTGAGTGT 
CTGAGTTTGA 
AATTAAAGGT 
TTTTCCAGCT 
TAATTGCAGC 
TGGTATACGA 
TTTCAGACAA 



51 

I • 

CGAGGCCGTG 
CGTGGAGGTG 
TAGAAAGCTA 
TGAACCTTTT 
TAAAGACTCA 
GAATGAAGAT 
AAATCACTGG 
TGTGGAAGTC 
GAACGTCAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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WO 02/086443 

AGCAACCTCA TCACCTGGAA CCQGGTGGTG 
ACATCCCTGT GTAAAGCGTT AGCCCAGAAA 
TATGGCCAAT TAATTGAAAT AAACAGCCAC 
GGCAAGCTGQ TAACCAAGAT OTTTCAGAAG 
CTGGTGTTCG TGCTGATTGA TGAGGTGGAG 
GCGGGCACCG AGCCATCAGA TGCCATCOGC 
CAGATTAAAA GGCATTCCAA TGTTGTGATT 
GACGTGGCCT TCGTGGACAG GGCTGACATC 
GCCATCTTCA AAATCTACCT CTCTTGTTTG 
CCTCGCCAGC AGCTGCTGAC CCTCCGAGAG 
GTGTCAAAAT TGAGCCTTCT TTTGAATGAC 
CGGGTCCTGA GAAAACTCCC CTTTCTGGCT 
ACCATAGAGG GGTTCCTCCA GGCCCTGTCT 
AAGAAGCTTG CAGCTTACAT CTGATCCTGG 
AACACACAAC CAGTAAGTGA GGTTGCCCCA 
AAACCAAACG TTACTTAGAC TGCAAGCTAG 
AAGTGTATTC TATTTATGTT GTTTTAAAAT 
TCACTGTTTG TAAAAGATAA TTCAGATTGT 
GTTTGTTCCC AGCCCACCCC CAGTGGATGG 
AGCAAAAAAG GAAGATTAAT GCAGGTGTTA 
AAAGAAGCAT ATAATCATAG CATTAAAAAT 
TTGCTTTCTG ATATCAGCTC GTTTGATTTA 
GATGTAAAAA AGCCTATTTC TACATTATAC 
TCTTTCATAA TAAATAATCA AGACATGGTC 
GTTCCAGGGA AACACATGCT GGACATCCCT 
TGGGATGTTT CTGCCCACGG TTTTGTTTGT 
TTCACATTAA TATAATATAA AATAAATAGG 
ATGTTTTGCT TTTATCTCAC AGTAAAATAA 

Seg ID NO> 290 Protein sequence: 
Protein Accession # : NP_004228 



PCT/US02/12476 



MDEAVGDLKQ 
FDEPFLTRNV 
AANHWVLPAA 
PGTGKTSIiCK 
DDKDALVPVL 
ITEKIDVAFV 
FIENNVSKLS 
QFEERKKLAA 



11 
! 

ALPCVAESPT 
QSVSIIDTEL 
EFHGLWDSLV 
ALAQKLTIRL 
IDEVESLTAA 
DRADIKQYIG 
LLLNDISRKS 
YI 



21 
I 

VHVEVHQRGS 
KVKDSQPIDL 
YDVEVKSHLL 
SSRYRYGQLI 
RNACRAGTEP 
PPSAAAIFKI 
EGLSGRVLRK 



CTGCTCCAGG 
TTGACAATTA 
AGCCTCTTTT 
ATTCAGGATT 
AGTCTCACAG 
GTGGTCAATG 
CTGACCACTT 
AAGCAGTACA 
GAAGAACTGA 
CTAGAGATGA 
ATTTCAAGGA 
CATGCGCTGT 
CTGGCAGTGG 
GCTTCCCCAT 
CACAGCCGTC 
AAAGCCACCA 
GCATACTGAG 
TTGTCTCCTT 
GATGCATAAT 
TAGAAGCCAG 
GCACACATTA 
GTGCAAAAAT 
CAACTGAGAA 
CCATTTGCAG 
TGTAACCCGG 
GCAATAACGT 
TCAGTTACTG 
ATATAATTAA 



31 

I 

STAKKEDINL 
SACTVALHIF 
DYVMTTLLFS 
EINSHSLFSK 
SDAIRWKAV 
YL5CLKETJ4K 
LPPLAHAIiYV 



Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession #: NMJJ02658.1 

Coding sequence: 77-1372 



1 
I 

GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 
TGTTGTGAAG 
CACCACCAAA 
CTCAGGGGGA 
CTGGGGCCGT 
CTTACCCTGG 
AGGGAGGAAA 
TCCATCAGCT 
CACCACCAGG 
CAGACCCTCT 
TGTCTTTTTC 
GGCTCGAAGG 
AATGAATAAT 
AATGTGGGAG 
ATTCCATGAA 
GCTGTGAGTG 
AAACTGTGTG 
CTGGGGCCTC 
ACCTGTGACC 
ATCCCTTCCT 
ACACTGAATA 



11 

I 

CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 
CTGATTTCCC 
ATGCTATGTG 
CCCCTCGTCT 
GGATGTGCCC 
ATCCGCAGTC 
CGGGCACCAC 
GTAAGAAGAG 
GTGAACGACA 
GGCCAGGATG 
TGGACTGAAG 
GAGAGCCAGC 
TTCCCAATTA 
CAGCGGTTTG 
TGTATCAGGA 
TAAGTGTGAG 
GACTGTGATG 

AGCACTGTCT 
TTTAGCCTAG 
TTTATATTTC 



21 
I 

CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 
ACCGGGAGTG 
CTGCTGACCC 
GTTCCCTCCA 
TGAAGGACAA 
ACACCAAGGA 
CCGCTTTCTT 
ACTGGGAAGA 
ATAGCTTTAC 
GAGGGGTGGT 
CCTGCAGGAG 
TCCCCCGACC 
GGAAGTGTAA 
GGGAGCAGAG 
AATATATATG 
TAAGAGCTGG 
CCACACAGAG 
CACGTGACAG 
CAGTTTCACT 
TTCATCCAAT 
ACTATTTTTA 



31 
I 

GCAGGCCACC 
GGCGCGCCTG 
TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACAOG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 
TCAGCAGCCC 
CCAATGGAAA 
AGGCCGCATG 
GCCAGGCGTC 
AGAGAATGGC 
GCTGGTTGTC 
TAGGCTCTGC 
CCTCACGGAT 
CCTGACTCAA 
TTAAAAAGGG 
GGTGGGCATT 
GCAGCTGAGG 
ACACTAACGA 
TGTGTGTATG 
TGTCTGATTG 
TGGTCTTTCT 
TGCCTGGGAA 
TTCACATAGA 
CCTCACTGGG 
TTTATATTTT 



GT( 

GACTTTCAAG 
CTAAGTGGTT 
TGATTGATGA 
CCGCCCGAAA 
CTGTCTTGAC 
CTAACATCAC 
TTGGGCCACC 
TGAAGTGTCA 
TTGGCTTCAT 
AGAGCGAGGG 
ATGTCCAGGC 
ACAAGCAGTT 
CTGGTGCTTT 
TCCCAGGGAA 
AGGCCAGGCT 
AGACAAACAT 
GTGAAGAACC 
GCCAGCAAGT 
AAGAGAAACT 
CTCCAGGTGG 
GTTTTCAAGA 
AAAAATGGTC 
GAAAAGTGCA 
TATGGGCGCC 
TATCACATTT 
GTCTCTTTCT 
AAA 



41 

I 

SVRKLLNRHN 
QLNEDGPSSE 
DKNVNSNLIT 
WFSESGKLVT 
LTQIDQIKRH 
OQIIYPRQQL 
QAPTVTIEGP 



41 
I 

GAGGCGGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 
CACTACTACG 
ACAGATTCCT 
ACTTTGACTG 
TACACGAGAG 
CTGGCCCTCT 
ATTTTTGCAG 
ACAGATGGAT 
AGGCCTGGGT 
CATGTTACTG 
CAGGGCATCT 
TGTGAGGCCC 
TCTCTTGAGG 
CTTCAGGGCA 
TTTGCACACT 
TTAAGTCTAA 
GGAGAGGTTA 
TGTACTTATT 
TGTCCCTTTC 
TGGGGTGAGG 
TGTAATTTTA 



CACTGGAAAA 
CAGGTACCGA 
TTCGGAAAGT 
TAAAGACGCC 
TGCCTGCAGG 
CCAAATTGAT 
CGAGAAGATC 
CTCTGCAGCA 
GATCATATAC 
TGAAAACAAC 
CCTCAGCGGC 
CCCCACCGTC 
TGAAGAGAGA 
TCCCATGGAG 
TCCCTTCTGC 
TTGTTAAAAG 
CTTGTCATTT 
ATCGAAACCT 
TTTGTTTAAC 
GTGTCACCCT 
AAGGTGGCAA 
CTATTTAATG 
GGTAAAGTGT 
GACTCTGAGT 
CCTGCATTGC 
CTAATGAGGA 
GCCGAATGTT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
2040 
2100 
2160 



51 

I 

IVFGDYTWTE 
NLEEETENII 
WNRWLLHGP 
KMFQKIQDLI 
SNWILTTSN 
LTLRELEMIG 
LQAXiSLAVDK . 



51 
I 

CGGTCTAGCG 
TCCTGGTCGT 
ACTGTCTAAA 
ACTGCCCAAA 
AGGGGAATGG 
TGCCCTGGAA 
TTCAGCTGGG 
GGTGCTATGT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGCGGC 
GCCTCATCAG 
AGGAGGACTA 
TGAAGTTTGA 
ACCACAACGA 
CCCGGACTAT 
GCTGTGAGAT 
TGAAAATGAC 
GCTCTGAAGT 
GCCAGGGAGA 
GAATTGTGAG 
TCTCACACTT 
GAGGGTCCCC 
TAGAGTCATC 
TTGCCTGTGG 
GCTGGCTGCC 
ACCAGCAACT 
CCTGTGCATG 
ATGGTTGAGA 
GAGCTTAGCC 
GGGCTCTGAT 
TGTTGTGTGG 
ATATTTCCTT 
TAGGTCACTC 
CTGCAGCATG 
TTGGCCAGTT 
ACCACTCCTT 
AATAAAAGTG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
.1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



295 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ATCAATAAAA TGTGATTTTT CTGA 

Seq ID HO: 292 Protein sequence i 
Protein Accession ft-.NP_002649.1 

X 11 21 

I I I 

MRALLARIiLIi CVLWSDSKG SNELHQVPSN 
HCEIDKSKTC YEGNGHFYRG KASTDTMGRP 
YCRNPDNRRR PWCYVQVGLK PLVQECMVHD 
GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG 
RSRLNSNTQQ EMKFEVENLI LHKDYSADTL 
PSMYNDPQPG TSCEITGFGK ENSTDYLYPE 
CAADPQWKTD SCQGDSGGPL VCSLQGRMTL 
SHTKEENGLA L 



PCT/US02/12476 



31 41 

I I 
CDCLNGGTCV SNKYPSNIHW 
CLPWNSATVL QQTYHAHRSD 
CADGKKPSSP PEELKFQCGQ 
OSLISPCWVI SATHCFIDYP 
AHHNDIALLK IRSXEGRCAQ 
QLKMTWKLI SHRECQQPHY 
TGIVSWGRGC ALKDKPGVYT 



51 

! 

CNCPKKPGGQ 
ALQLGLGKHN 
KTLRPRPKI I 
KKEDYIVYLG 
PSRTIQTICL 
YGSEVTTKML 
RVSHFLPWIR 



Seq ID NO: 293 UNA sequence 
Nucleic Acid Accession #: NM_001498 
Coding sequence: 93.. 200 6 



1 
I 

GGCACGAGGC 
AGGAGGAGGA 
TGAGCTGGGA 
TCCTGCACAT 
AGGTGGAATA 
CTGGGGAGAA 
CTACCCTTTG 
AOGGAGGAAC 
AGGCTACTTC 
TAGGCTGTCC 
CCAAGTCCCT 
CAAGAAATAT 
ACAAGAATAC 
CTTCTAAGCC 
TCCAGGTGAC 
CTACTATCTG 
TGTCAGACAT 
AGGAGCGAGG 
ACTCAATAGA 
TAGATAAAGA 
ATGTTGCTCA 
ATGATGCTAA 
GATTTAAGCC 
AGGTGCAATT 
GAGTGATCCT 
TGAAGGTAGC 
TTTGCAAAGG 
TCGCTGCAGA 
TGTTTCCTGG 
ACACCAGATG 
TAATGACAGT 
ACAGTGTCAT 
ATGAATTATG 
GAAGTAAAAC 
ACTGGCTACA 
CTGTACTGTT 
TCTAGAGTTT 
AATAACATAT 
CAACCTACTG 
GAGTTAAAAT 
TGAAATTTTT 
TGTCTACATT 
GCACCTTATC 
GCTTTCCCCT 



11 

I 

TGAGTGTCCG 
GGAGGAGGAG 
GGAAACCAAG 
CTACCACGCC 
CATGTTGGTA 
AGTTCTTGAA 
GAGACCAGAG 
AATGTCCGAG 
TATATTAGAA 
TGGGTTCACA 
CTTCTTTCCA 
CCGACATAGG 
ACCATCTCCA 
GGATCATATT 
ATTCCAAGCC 
TCCAATTGTT 
TGATTGTCGC 
ACTGGAGCCA 
CAGCTATTTA 
GATCTACGAA 
TCTCTTTATT 
TGAGTCTGAC 
CCCTCCTCCA 
AACAGACTTT 
TTCCTACAAA 
ACAGAAAAGA 
TGGCAATGCA 
GGAGTACACC 
ACTGATCCCA 
TAGTATTCTG 
TGCGAGATGG 
AACTGATGAA 
TGAATGCCCA 
TGACTCATCC 
GTACCATGCC 
TTCTGGGCCA 
ATACAGTGTA 
CTAAAGTCAT 
TCTAAGCAGT 
GTTTACTGTA 
CTCTTTAAAA 
AAATCACTTG 
TATGATGTTT 
CTGAATAAAT 



21 
I 

TCTCGCGCCC 
GAGGGGGCGG 
CGCCATGCCG 
GTCAAGGACC 
TCTTTTGATC 
ACTCTGCAAG 
TATGGGAGTT 
TTCAATACAG 
GAAAATCAGG 
CTGCCCGAGG 
GATGAAGCAA 
AGAGGAGAAA 
TTTATAGAAA 
TACATGGATG 
TGCAGTATAT 
ATGGCTTTGA 
TGGGGAGTGA 
TTGAAGAACA 
TCTAAGTGTG 
CAGCTGTTGC 
AGAGACCCAC 
CATTTTGAGA 
AACTCAGACA 
GAGAACTCTG 
TTGGATTTTC 
GATGCTGTCT 
GTGGTGGATG 
CTCATGAGCA 
ATTCTGAACT 
AACTACCTAA 
ATGAGGGAGT 
ATGAATTATA 
GAGTTACTTG 
AACTAGACAT 
TCTCAGCCCG 
GTGAGCCAGA 
CATGTACATA 
CATGAACTGG 
TTTGTAAATG 
AATTTTTGTT 
ACATTTTCTC 
AATOCATTGA 
CTTTTGCAAT 
ACCCATTGAA 



31 

I 

GGAAGCGGGC 
CCATGGGGCT 
ACCACGTGCG 
GGCACAAGGA 
ATGAAAATAA 
AGAAGGGGGA 
ACATGATTGA 
TTGAGGCCAA 
CTCTTTGCAC 
TCAAACCCAA 
TAAACAAGCA 
AGGTTGTCAT 
CATTTACTGA 
CCATGGGATT 
CTGAGGCCAG 
GTGCTGCATC 
TTTCTGCATC 
ATAACTATAG 
GTGAGAAATA 
AGGAAGGCAT 
TGACACTGTT 
ATATTCAGTC 
TTGGATGGAG 
CCTATGTGGT 
TCATTCCACT 
TGCAGGGAAT 
GTTGTGGCAA 
TAGACACCAT 
CTTACCTTGA 
AGCTAATTAA 
TTATCGCAAA 
GCCTTATTTT 
GATCAGCATT 
TCTACAGAAA 
TGTGTATAAT 
AATTGATTAA 
GTAAAGTATT 
CTTGTACATT 
TACTGGTAAT 
CTTTTAAAGA 
TCGTTAATTT 
AAGTGCTTCA 
TGGAATAATC 
CTCTGAAAAA 



41 
I 

GACOGCCGTC 
GCTGTCCCAG 
GCGGCACGGG 
CGTTCTCAAG 
AAAAGTCCGG 
AAGGACAAAC 
AGGGACACCA 
CATGCGAAAA 
AATAACTTCA 
CCCAGTGGAA 
CCCTCGCTTC 
CAATGTACCA 
GGATGATGAA 
TGGAATGGGC 
ATACCTTTAT 
TCCCTTTTAC 
TGTAGATGAT 
GATCAGTAAA 
TAATGACATC 
TGATCATCTC 
TGAAGAGAAA 
CACAAATTGG 
AGTAGAATTT 
GTTTGTGGTA 
GTCAAAGGTT 
GTTTTATTTC 
GGCCCAGAAC 
CATCAATGGG 
AAACATGGAA 
GAAGAGAGCA 
CCATCCTGAC 
GAAGTGTAAC 
TAGGAAAGTA 
GAAAAATGCA 
ATGAAGACCA 
GGCTTTCTTT 
TTTGATTAAC 
TTTAAATTCT 
TGTACAATAC 
CTACCTGGGA 
TCCTTTGTCA 
AGGGTAATCT 
ACTTGGTCAC 
AAAAAAAAAA 



51 
I 

AGCCCGGAGG 
GGCTCGCCGC 
ATCCTCCAGT 
TGGGGCGATG 
TTGGTCCTGT 
CCAAACCATC 
GGACAGCCCT 
CGCCGGAAGG 
TTTCCCAGAT 
GGAGGAGCTT 
AGTACCTTAA 
ATATTTAAGG 
GCTTCAAGGG 
AATTGCTGTC 
GATCAGTTGG 
CGAGGCTATG 
AGAACTCGGG 
TCCCGATATG 
GACTTGACGA 
CTGGCCCAGC 
ATACACCTGG 
CAGACAATGA 
CGACCCATGG 
CTGCTCACCA 
GATGAGAACA 
AGGAAAGATA 
AGCACGGAGC 
AAGGAAGGTG 
GTGGATGTGG 
TCTGGAGAAC 
TACAAGCAAG 
CAAATTGCAA 
AAATATAGTG 
TTATTGACGA 
AATGATAGAA 
GGTAGGTAAA 
AATGTATTTT 
TACTCTGGAG 
TTGCATTCCA 
CCTGATTTAT 
TTTCCTTTGT 
TGGGTTTCTA 
CTTGCCCCAA 
AAAA 



Seq ID NO: 294 Protein sequence i 
Protein Accession #: NP_001489 



1 

I 

MGLLSQGSPL 
ENKKVRLVLS 



NKHPRPSTIjT 
MGFGMGNCCIi 
SASVDDRTRE 
EGIDHLLAQH 
GWRVEFRPME 
QGMFYFRKDI 
YLENMEVDVD 
LILKCNQIAN 



11 
I 

SWEET KR HAD 
GEKVLETLQE 
ATSILEENQA 
RNIRHRRGEK 
QVTPQACSIS 
ERGLEPLKNN 
VAHLFIRDPL 
VQLTDFENSA 
CKGGNAWDG 
TRCSILNYLK 
ELCECPELLG 



HVRRHGILQF 



LCTITSFPRL 
WINVPIFKD 
EARYLYDQliA 
NYRISKSRYD 
TLFEEKIHLD 
YWFWLLTR 
CGKAQNSTEL 
LIKKRASGEL 
SAFRKVKYSG 



31 
I 

LHIYHAVKDR 
TLWRPEYGSY 
GCPGFTLPEV 
KNTPSPFIET 
TICPIVMALS 
SIDSYLSKCG 



VILSYKLDFL 
AAEEYTLMSI 
MTVARWMREF 



41 
I 

HKDVLKWGDE 
MIEGTPGQPY 
KPNPVEGGAS 
FTEDDEASRA 
AASPFYRGYV 
EKYNDIDLTI 
IQSTNWQTMR 
IPLSKVDENM 
DTIINGKEGV 
IANHPDYKQD 



60 
120 
180 
240 
300 
360 
420 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



51 

I 

VEYMLVSFDH 
GGTMSEFNTV 
KSLFFPDEAI 
SKPDHIYMDA 
SDIDCRWGVI 
DKEIYEQLLQ 
FKPPPPNSDI 
KVAQKRDAVL 
FPGLIPILNS 
SVITDEMNYS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



296 



WO 02/086443 PCT/US02/12476 

Seq ID NO: 295 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-916 

5 1 11 21 31 41 51 

I I I I I I 

AGTGTTCGGC TGGGGCAGGC ACGCTOTGGC TGGCTACTTC CCTTCCTCCC ATCCCCCTTG 60 

GGCCAAACGG GATCGQTGCT TCTGGTGAGA CGCCTCCCCA TGCACATCAC TCCCAGGTGC 120 

A CCTAGGGGGC ACATTTCCCA CAACTCCCAG AGGGCAGGTT TCTAGAAAGT GCCACCAGTG 180 

10 GGGA66CGCC ACAACTTCAC T6CCATTTTG TGAGGTGCCG CCGTCTCTCC TCCAGCAAGG 240 

GAAACAATGA CCGATAAAAC AGAGAAGGTG GCTGTAGATC CTGAAACTGT GTTTAAACGT 300 

CCCAGGGAAT GTGACAGTCC TTCGTATCAG AAAAGGCAGA GGATGGCCCT GTTGGCAAGG 360 

AAACAAGGAG CAGGAGACAG CCTTATTGCA GGCTCTGCCA TGTCCAAAGA AAAGAAGCTT 420 

ATGACAGGAC ATGCTATTCC ACCCAGCCAA TTGGATTCTC AGATTGATGA CTTCACTGGT 480 

15 TTCAGCAAAG ATAGGATGAT GCAGAAACCT GGTAGCAATG CACCTGTGGG AGGAAACGTT 540 

ACCAGCAGTT TCTCTGGAGA TGACCTAGAA TGCAGAGAAA CAGCCTCCTC TCCCAAAAGC 600 

CAACGAGAAA TTAATGCTGA TATAAAACGT AAATTAGTGA AGGAACTCCG ATGCGTTGGA 660 

CAAAAATATG AAAAAATCTT CGAAATGCTT GAAGGAGTGC AAGGACCTAC TGCAGTCAGG 720 

AAGCGATTTT TTGAATCCAT CATCAAGGAA GCAGCAAGAT GTATGAGACG AGACTTTGTT 780 

20 AAGCACCTTA AGAAGAAACT GAAACGTATG ATTTGAGAAT ACTTGTCCCT GGAGGATTAT 840 

CACACCCCAA ATGCATAATC TCGTTAATGA TTGAGGAGAG AAAAGGATCA GATTGCTGTT 900 

TTCTACAATG GAGCAGGATA TTGCTGAAGT CTCCTGGCAT ATGTTACCGA ATCAAATAGC 960 

CTTCCAGAGG CTAAGAAATT TCTGTTAGTA AAAGATGTTC TTTTTCCCAA AGCATTTTAT 1020 

TTGAAAGGAT AACTTGTGTT TTGGTTATTT TGTATTCCCA CCTGTGCTGG TAGATATTAT 1080 
25 TAACCCATTA GGTAAATACT ATTACAGTCG TGGTTTCTGC A 

Seq ID NO: 296 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

1 I I I I 1 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 60 
GHAIPPSQLD SQIDDPTGPS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQR 120 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 180 
LKKKLKRMI 

Seq ID NO: 297 DNA sequence 
Nucleic Acid Accession ft: Eos sequence 
Coding sequence: 247-815 

1 11 21 31 41 51 

] I I 1 I I 

AGTGTTCGGC TGGGGCAGGC ACGCTGTGGC TGGCTACTTC CCTTCCTCCC ATCCCCCTTG 60 
GGCCAAACGG GATCGGTGCT TCTGGTGAGA CGCCTCCCCA TGCACATCAC TCCCAGGTGC 120 
CCTAGGGGGC ACATTTCCCA CAACTCCCAG AGGGCAGGTT TCTAGAAAGT GCCACCAGTG 180 
GGGAGGCGCC ACAACTTCAC TGCCATTTTG TGAGGTGCCG CCGTCTCTCC TCCAGCAAGG 240 
GAAACAATGA CCGATAAAAC AGAGAAGGTG GCTGTAGATC CTGAAACTGT GTTTAAACGT 300 
CCCAGGGAAT GTGACAGTCC TTCGTATCAG AAAAGGCAGA GGATGGCCCT GTTGGCAAGG 360 
AAACAAGGAG CAGGAGACAG CCTTATTGCA GGCTCTGCCA TGTCCAAAGA AAAGAAGCTT 420 
ATGACAGGAC ATGCTATTCC ACCCAGCCAA TTGGATTCTC AGATTGATGA CTTCACTGGT 480 
TTCAGCAAAG ATAGGATGAT GCAGAAACCT GGTAGCAATG CACCTGTGGG AGGAAACGTT 540 
ACCAGCAGTT TCTCTGGAGA TGACCTAGAA TGCAGAGAAA CAGCCTCCTC TCCCAAAAGC 600 
CAACAAGAAA TTAATGCTGA TATAAAACGT AAATTAGTGA AGGAACTCCG ATGCGTTGGA 660 
CAAAAATATG AAAAAATCTT CGAAATGCTT GAAGGAGTGC AAGGACCTAC TGCAGTCAGG 720 
AAACGATTTT TTGAATCCAT CATCAAGGAA GCAGCAAGAT GTATGAGACG AGACTTTGTT 780 
AAGCACCTTA AGAAGAAACT GAAACGTATG ATTTGAGAAT ACTTGTCCCT GGAGGATTAT 840 
CACACCCCAA ATGCATAATC TCATTAATGA TTGAGGAGAG AAAAGGATCA GATTGCTGTT 900. 
TTCTACAATG GAGCAGGATA TTGCTGAAGT CTCCTGGCAT ATGTTACCGA ATCAACTGGC 960 
CTTCCAGAGG CTAAGAAATT TCTGTTAGTA AAAGATGTTC TTTTTCCCAA AGCGTTTTAT 1020 
TTGAAAGGAT AACTTGTGTT TTGGTTATTT TGTATTCCCA CCTGTGCTGG TAGATATTAT 1080 
TAACCCATTA GGTAAATACT ATTACAGTCG TGGTTTCTGC A 

Seq ID NO: 298 Protein sequence: 
Protein Accession 8: Eos sequence 



30 

35 
40 
45 
50 
55 
60 
65 



1 11 21 31 41 51 

III I I 

_ _ MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 60 
7U GHAIPPSQLD SQIDDPTGPS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 120 

EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 180 

LKKKLKRMI 



75 
80 
85 



Seq ID NO: 299 DNA sequence 

Nucleic Acid Accession #> Eos sequence 

Coding sequence: 247-815 

1 11 21 31 41 51 

I I i I I I 

AGTGTTCGGC TGGGGCAGGC ACGCTGTGGC TGGCTACTTC CCTTCCTCCC ATCCCCCTTG . 60 
GGCCAAACGG GATCGGTGCT TCTGGTGAGA CGCCTCCCCA TGCACATCAC TCCCAGGTGC 120 
CCTAGGGGGC ACATTTCCCA CAACTCCCAG AGGGCAGGTT TCTAGAAAGT GCCACCAGTG 180 
GGGAGGCGCC ACAACTTCAC TGCCATTTTG TGAGGTGCCG CCGTCTCTCC TCCAGCAAGG 240 
GAAACAATGA CCGATAAAAC AGAGAAGGTG GCTGTAGATC CTGAAACTGT GTTTAAACGT 300 
CCCAGGGAAT GTGACAGTCC TTCGTATCAG AAAAGGCAGA GGATGGCCCT GTTGGCAAGG 360 
AAACAAGGAG CAGGAGACAG CCTTATTGCA GGCTCTGCCA TGTCCAAAGC AAAGAGCTTA 420 
TGACAGGACA TGCTATTCCA CCCAGCCAAT TGGATTCTCA GATTGATGAC TTCACTGGTT 480 



297 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCAGCAAAGA TAGGATGATG CAGAAACCTG GTAGCAATGC ACCTGTGGGA GGAAACGTTA 540 

CCAGCAGTTT CTCTGGAGAT GACCTAGAAT GCAGAGAAAC AGCCTCCTCT CCCAAAAGCC 600 

AACAAGAAAT TAATGCTGAT ATAAAACGTA AATTAGTGAA GGAACTCCGA TGCGTTGGAC 660 

AAAAATATGA AAAAATCTTC GAAATGCTTG AAGGAGTGCA AGGACCTACT GCAGTCAGGA 720 

AACGATTTTT TGAATCCATC ATCAAGGAAG CAGCAAGATG TATGAGACGA GACTTTGTTA 780 

AGCACCTTAA GAAGAAACTG AAACGTATGA TTTGAGAATA CTTGTCCCTG GAGGATTATC 840 

ACACCCCAAA TGCATAATCT CATTAATGAT TGAGGAGAGA AAAGGATCAG ATTGCTGTTT 900 

TCTACAATGG AGCAGGATAT TGCTGAAGTC TCCTGGCATA TGTTACCGAA TCAACTGGCC 960 

TTCCAGAGGC TAAGAAATTT CTGTTAGTAA AAGATGTTCT TTTTCCCAAA GCGTTTTATT 1020 

TGAAAGGATA ACTTGTGTTT TGGTTATTTT GTATTCCCAC CTGTGCTGGT AGATATTATT 1080 
AACCCATTAG GTAAATACTA TTACAGTOGT GGTTTCTGCA 

Seq ID NO i 300 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MTDKTEKVAV DPETVPKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 60 
GHAIPPSQLD SQIDDPTGPS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 120 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 180 
LKKKLXRMI 

Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession fti Eos sequence 

Coding sequence* 247-812 



PCTVUS02/12476 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAATTT 
AACAAGAAAT 
AATATGAAAA 
GATTTTTTGA 
ACCTTAAGAA 
CCCCAAATGC 
ACAATGGAGC 
CAGAGGCTAA 
AAOGATAACT 
CCATTAGGTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCG AT AAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TGGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AATCTTCGAA 
ATCCATCATC 
GAAACTGAAA 
ATAATCTCAT 
AGGATATTGC 
GAAATTTCTG 
TGTGTTTTGG 
AATACTATTA 



21 

I , 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAATGTC 
ATGCTTGAAG 
AAGGAAGCAG 
CGTATGATTT 
TAATGATTGA 
TGAAGTCTCC 
TTAGTAAAAG 
TTATTTTGTA 
CAGTCGTGGT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGGAAT 
AAGTAGTGAA 
GAGTGCAAGG 
CAAGATGTAT 
GAGAATACTT 
GGAGAGAAAA 
TGGCATATGT 
ATGTTCTTTT 
TTCCCACCTG 
TTCTGCA 



41 


51 




1 

CCTTCCTCCC 


1 

ATCCCCCTTG 


60 


TGCACATCAC 


TCCCAGGTGC 


120 


TCTAGAAAGT 


GCCACCAGTG 


180 


CCGTCTCTCC 


TCCAGCAAGG 


240 


CTGAAACTGT 


GTTTAAACGT 


300 


GGATGGCCCT 


GTTGGCAAGG 


360 


TGTCCAAAGA 


AAAGAGCTTA 


420 


GATTGATGAC 


TTCACTGGTT 


480 


ACCTGTGGGA 


GGAAATGTTA 


540 


AGCCTCCTCT 


CCCAAAAGCC 


600 


GGAAATCCGA 


TGCCTTGGAC 


660 


ACCTACTGCA 


GTCAGGAAAC 


720 


GAGACGAGAC 


TTTGTTAAGC 


780 


GTCCCTGGAG 


GATTATCACA 


840 


GGATCAGATT 


GCTGTTTTCT 


900 


TACCGAATCA 


ACTGGCCTTC 


960 


TCCCAAAGCG 


TTTTATTTGA 


1020 


TGCTGGTAGA 


TATTATTAAC 


1080 



Seq ID NO: 302 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

1 I I I ! I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDGMMQKPGS NAPVGGNVTS NFSGDDLECR GIASSPKSQQ 
EINADIKCQV VKBIRCLGQY EKIFEMLEGV GGPTAVRKRF FESIIKEAAR CMRRDFVKHL 
KKKLKRM1 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-815 



AGTGTTCGGC 
GGCCAAACAG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAGTTT 
AACAAGAAAT 
AAAAATATGA 
AACGATTTTT 
AGCACCTTAA 
ACACCCCAAA 
TCTACAATGG 
TTCCAGAGGC 
TGAAAGGATA 
AACCCATTAG 



11 
I 

TGGGACAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TAGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AAAAATCTTC 
TGAATCCATC 
GAAGAAACTG 
TGCATAATCT 
AGCAGGATAT 
TAAGAAATTT 
ACTTGTGTTT 
GTAAATACTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAACGTA 
GAAATGCTTG 
ATCAAGGAAG 
AAACGTATGA 
CGTTAATGAT 
TGCTGAAGTC 
CTGTTAGTAA 
TGGTTATTTT 
TTACAGTOGT 



31 
1 

TGGCTACTTC 
CGTCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGAAAC 
AATTAGTGAA 
AAGGAGTGCA 
CAGCAAGATG 
TTTGAGAATA 
TGAGGAGAGA 
TCCTGGCATA 
AAGATGTTCT 
GTATTCCCAC 
GGTTTCTGCA 



41 
I 

CCTTCCTTCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



60 
120 
180 



51 
I 

ATCCCCCTTG 
TCCCAGATGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 304 Protein sequence: 
Protein Accession ft: Eos sequence 



298 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

1 11 21 31 41 51 

I I I I I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKIiMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEO VQOPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 

Seq ID NO: 30S DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 87-689 



PCT7US02/12476 



60 
120 
180 



CGTGGAGGCA 
CCAGACTAGC 
AGATGTCCGC 
CAGAGGTCCC 
TGTCCGGGAA 
ATCGGGAAAT 
' CTCCCAAAAG 
AATCCACAAA 
ATAATTTAAA 
AGTATGAGAA 
CTGCTAAAGT 
AGGAGGAGGA 
TTAGAGTAGG 
ATTAGGTTTA 
AATTGTCAGT 
AAGTTGTACA 
CTGTGCACTT 
ATTTGTAAGG 
TATCTATAGT 
GCGTTGAGGC 
GAGGCTGGAC 
GTATATAGTG 
CATGAGAATA 
TATAGAACTC 
CTCCTGTACT 
GAAATGTTTT 
AGTCAATTTC 
CTCCCTATAA 
TGAAGGAGAG 
ATGAAGTCTG 
AGGAAGGTGG 
CCTATTTTGT 
AAATTAAGGC 
ACATTATTTG 
AGTGAGGGTA 
TTGGAAACAC 
AGGGCAGGCT 
GCCTGCTCAT 
GAAGGAGCTT 
GTTGGGGTGA 
CTGATGTGTA 
GTGAGTGTTG 
CGCAGGAGTG 
GTTGAGAAAC 
TACGAGTTAT 
CATCAGAACT 
AAAACTCGGT 
GCCATGTCCT 
CTAGGCCAAG 
GGTCAAAAGG 
CAGGGAAGGG 
GTGGGGGAGC 
TGTCAAGTTG 
CACACTGTGG 
AAGTGGCATG 
TTCTTAAGAT 
CTACTCCCTC 
AGGCTGTGAT 
GTATTTGGGG 



11 
I 

GCTAGCGCGA 
GAACAATACA 
TTATGCCTTC 
TGTCAATTTT 
AGAGAAATCT 
GAAGGATTAT 
GCCACCGTCT 
CCCCGGCATC 
TGACAGTGAA 
GGATGTTGCT 
TGCCCGGAAA 
GGAGGAGGAG 
GGAGCGCCGT 
ATTACAAAAT 
GGTTTACATG 
TATTTCCAAA 
TGCTGTTGGT 
TGGTGGTAAC 
TTGTAAAAAG 
TGTGGGGAAG 
CTGTTGACTC 
ACATAGCATT 
TTTTTTTTTT 
TTCATTGTCA 
TAAACACGAT 
TGAAGTTAAA 
TGACTCACAG 
ATGTGGTAGC 
GGCTACTTGA 
GAGGAGTTAG 
GTGATTAGGA 
GGGGCCAAAT 
CTTATTGTTT 
TGGTGCCCAA 
TGTGGGATGG 
CAAACACCCC 
AATGGAATCA 
AAGTTTAGCT 
GGTTTGTGTG 
GGGGAGATGG 
TATACATCAT 
CTATTGCCCA 
TTTTTGTGCT 
TTGCATGTCT 
GGTCACGGTC 
GTTTGTCCTG 
TGTGAGGTTT 
TGTCACTTGG 
ATTCGGGAGC 
GAGTGATTTG 
CCAAGGATGG 
AGTTTAGCCA 
AGGCCACTTG 
CAAGATTGCT 
ATGTTACCTA 
GCCAACCTGT 
TAACCACCTC 
GTAGTGACTA 
ACGTTGGATG 



21 
I 

GGCTGGGGAG 
GTCAGGATGG 
TTTGTGCAGA 
GCGGAATTTT 
AAATTTGATG 
GGACCAGCTA 
GGATTCTTCC 
TCTATTGGAG 
AAGCAGCCTT 
GACTATAAGT 
AAGGTGGAAG 
GATGAATAAA 
AATTGACACA 
TTGATCACGA 
AAGTGGCCAT 
CATTTTTAAA 
GTGACAAGGC 
TATGGTTATT 
AACAAAACAA 
ATGCCTTTTG 
TGCAGGGGGC 
CTGCTGCCAT 
TAAGTGCGGT 
GCAAAGCAAA 
TCGCAACGTT 
TAAACAGTAT 
CAGTGAACAA 
TTCTTTTATT 
AGCTACTGTG 
GAGAACGACA 
CTGAGGCTAT 
GCATTGCTAA 
TTCTCTTTCA 
CATTTGGGGT 
GGTGGTGGGG 

aaggaagatg 
accatttctg 
cattcactgg 
tcagtggtta 
ccacagtagc 

TACTGTCCGT 
GCATTAATAT 
ATTAATTTTA 
GGAGGCGGTG 
ACAGCCTGAT 
AATGTGTTCC 
GCCCAGAGGC 
CATTCTAAGC 
TGTTGCCAGC 
TTAAGTGGTG 
AAAGGGGTAA 
GATGATCTTT 
GTCCATTAGC 
CTTCTAGTGG 
AGGCTTAGGC 
TGCTTTTTTT 
ACCCCATTCT 
TTGTCTGTGT 
CATTCATTTT 



31 
I 

CGCTGAGCCG 
CTAAAGGTGA 
CATGCAGAGA 
CCAAGAAGTG 
AAATGGCAAA 
AGGGAGGCAA 
TGTTCTGTTC 
ACGTGGCAAA 
ACATCACTAA 
CGAAAGGAAA 
AGGAAGATGA 
GAAACTGTTT 
TCTCTTATTT 
TCATATTGTA 
GGGTGTCTGG 
ATGAAAAGGC 
ATTTAAAGAT 
GGCTAGAAAT 
CCGAGACAAA 
GGAGAGGCTO 
ATCCATTTAG 
CTTAGCTGTG 
AGTTTTTAAA 
GAGTCACTGC 
CTGTTATTTT 
TACATTTTTA 
ACCCCCACTC 
ACTCAGTGGC 
TGATTTTGTT 
TAGGCAAGGT 
CTAGGTTTAA 
ACAGCAATTT 
CCCCTACCCC 
CTTGAGCCTG 
TAGGGGACGG 
ATAGGCTCCA 
AGCACTAAAT 
AAATGTAGAT 
TATTAGTGGG 
AAGTGGTGAC 
AGCAATGAAG 
TTGGGTGTGT 
AGAGAAAGCA 
TCCTCTCCGC 
CTCTTATGTG 
TCTAGTTCTA 
ACTTGTTCCA 
TAAAGCTTTA 
CTCGTCAAAT 
CGCGTCTATC 
CTTTTGTGCT 
GATTAGGCAA 
TGGGGCAGCA 
AATAATGCCC 
TTAGCTTGAT 
TTTTTTTTCC 
TGAATGACAT 
CTCCTGTGTG 
CTGTAATAAA 



41 
I 

CGCGTOGTGC 
CCCCAAGAAA 
AGAACATAAG 
CTCTGAGAGG 
GGCAGATAAA 
GAAGAAGAAG 
AGAATTCCGC 
AAAGCTGGGT 
GGCGGCAAAG 
GTTTGATGGT 
AGAAGAGGAG 
ATCTGTCTCC 
GAGAAGTGTC 
GTCTCTCAAA 
AGCACCCTGA 
ACTCTCGTGT 
GTTTCTGGCA 
CCTGAGTTTT 
CCCTTGATGC 
TAGCTCAGGG 
CTTCAGGTTG 
GACAAAGGGG 
CTGTTTGTTT 
ATCAATGAAA 
TTTTGTATGT 
AAACTCTTCT 
CATTGTATTT 
CAGCTCACTT 
TGTGTCTGAG 
TCAGCAGCCT 
CTTTTGTCCC 
CAGAGTGTAT 
CCGTGCTCCT 
CTGCTGGTCT 
TATCCTTTTT 
TCTTGGGCCA 
GTATCATGAA 
TGATGTTCAA 
TAGTGTAACA 
ACTAAATACC 
GATACAGTAC 
ATGTTTGAGG 
GCTTTTTCTT 
CCTGTCGGGT 
TTCATAGCCA 
GAAAATGACC 
GAATTTCCCC 
GCTTCCCAAT 
ATGGAAGAGA 
TCATAACTAG 
TCCAAAGTAG 
ACATTGAGTT 
AGATCACTAC 
TAGTTTCTCT 
TTCTGGGCCC 
CCCATTTAAA 
TTTATCCTTC 
TGTCTGTTCT 
G 



51 
I 

CCTGCGCTGC 
CCAAAGGGCA 
AAGAAAAACC 
TGGAAGACGA 
GTGCGCTATG 
GATCCTAATG 
CCCAAGATCA 
GAGATGTGGA 
CTGAAGGAGA 
GCAAAGGGTC 
GAGGAAGAAG 
TTGTGAATAC 
TGTTGCCCTC 
GTGCTCTAGA 
AACTGTATCA 
TCTCCTCACT 
TTTTCTTTTT 
CAACTGTATA 
TCCTTGCTCG 
CGTGCACTGT 
TCTTGTTTCT 
GGTCAGCTGG 
TTAAACAAAC 
GTTCAAGAAC 
TTAGAATGCT 
CTATTATAAC 
GGAGACTGGC 
AGGGCTGAGA 
TGGCATTCAG 
TCCAAGGTAT 
ACCTCCACCC 
GGTGTGTCAA 
GGCACATATC 
CCTGGATGCC 
TTGCTCCTAC 
CCTGAGCTAT 
AAGTTGAATG 
TGTTAAACTG 
TTTTATCCAG 
ATTTTGAAGG 
TGTGTTGTGG 
CTATGAAACA 
AAAATTCACT 
CCTGGATGAG 
TTCGCTCTCC 
ACTAATTTAA 
TCCTGCTTCA 
TCGTGATGTG 
AACAACCTGC 
ATGTACCAAC 
CTAAGCAGAA 
TTAAAGAGGC 
TCAACGTTTT 
GAGATGATGT 
ACTGTCTGTG 
AGGATAGTAC 
GGAAAGAACA 
TGTCACAAAT 



Seq ID NO: 306 Protein sequence: 
Protein Accession ft: NP 005333.1 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



51 



1 11 21 31 41 

.1 I 1 I I I 

MAKGDPKKPK GKMSAYAFFV OTCREEHKKK NPEVPVNFAB FSKKCSERWK TMSGKEKSXF 
DEMAKADKVR YDREMKDYGP AKGGKKKKDP NAPKRPPSGP FLFCSEFRPK IKSTNPGISI 
GDVAKKLGEM WNNLNDSEKQ PYITKAAKLK EKYEKDVADY KSKGKPDGAK GPAKVARKKV 



60 
120 
180 



Seq ID NO i 307 DNA sequence 
Nucleic Acid Accession ft: NM_022342 
Coding sequence: 1..2178 



299 



WO 02/086443 



PCT/US02/12476 



1 11 21 31 41 51 

i I I I I I 

ATGGGTACTA GGAAAAAAGT TCATGCATTT GTCCGTGTCA AACCCACCGA TGACTTTGCT 60 

5 CATGAAATGA TCAGATAOQG AGATGACAAA AGAAGCATTG ATATTCACTT AAAAAAAGAC 120 

ATTOGGAGAO GAGTTGTCAA TAACCAACAG ACAGACTGGT CGTTTAAGTT GGATGGAGTT 180 

TTCAOGATG CCTCCCAGGA CTTGGTTTAT GAGACAGTTQ CAAAGGATGT GGTTTCTCAG 240 

CCCTCGATG GCTATAATGG CACCATCATG TGTTATGGGC AGAOGGGAGC TGGCAAGACA 300 

ACACCATGA TGGGGGCAAC TGAQAATTAC AAOCACCGGG GGATCCTCCC TCGTGCCCTG 360 

10 AGCAGGTTT TTAGGATGAT CGAAGAACGC CCCACACATQ CCATCACTGT GCGTGTTTCC 420 

ACTTGGAAA TCTATAATGA GAGCCTGTTT GATCTCCTGT CCACTCTGCC CTATGTTGGA 4 BO 

CCTCAGTCA CACCAATGAC CATCGTGGAA AACCCTCAAG GAGTCTTCAT TAAGGGCTTG 540 

CAGTTCACC TCACAAGTCA GGAGGAGGAT GCATTCAGCC TCCTTTTTGA GGGTGAGACC 600 

ACAGGATTA TAGCCTCCCA CACTATGAAC AAAAACTCTT CCAGATCACA CTGCATTTTC 660 

15 CCATCTACT TAGAGGCCCA TTCCCGGACC TTATCAGAGG AAAAGTACAT CACTTCCAAA 720 

TTAACTTGG TGGATCTGGC AGGCTCAGAG AGGCTGGGGA AGTCTGGGTC TGAGGGCCAA 780 

TCCTGAAGG AAGCCACCTA CATCAACAAA TOGCTCTCAT TCCTGGAGCA GGCCATCATT B4D 

CCCTTGGGG ACCAGAAGCG GGACCACATC CCCTTTCGGC AGTGCAAGCT CACCCACGCT 900 

TGAAGGACT CGTTAGGGGG AAACTGCAAT ATGGTCCTCG TGACAAACAT CTATGGAGAA 960 

20 CTGCCCAGT TAGAAGAAAC GCTATCTTGA CTGAGATTTG CCAGCAGGAT GAAGCTAGTC 1020 

CCACTGAGC CTGCCATCAA TGAAAAGTAT GATGCTGAGA GAATGGTCAA GAACCTGGAG 1080 

AGGAACTAG CACTACTCAA GCAGGAGCTG GCTATCCATG ACAGCCTGAC CAACCGCACC 1140 

TTGTGACCT ATGACCCCAT GGATGAAATC CAGATTGCTG AGATCAACTC CCAGGTGCGG 1200 

GGTACCTGG AGGGGACACT GGACGAGATC GACATAATCA GCCTTAGACA GATCAAGGAG 1260 

25 TGTTCAACC AGTTCCGGGT GGTTCTGAGC CAACAGGAAC AGGAAGTGGA GTCCACTTTG 1320 

GCAGGAAGT ACACCCTCAT TGACAGGAAT GACTTTGCAG CCATTTCTGC TATCCAGAAG 1380 

CGGGGCTTG TGGATGTTGA TGGCCACCTA GTGGGTGAGC CTGAAGGACA AAACTTTGGA 1440 

TOGGAGTCG CCCCTTTCTC TACCAAACCT GGGAAGAAAG CCAAGTCCAA GAAGACATTC 1500 

AAGAGCCAC TCAGGCCCGA CACCCCACCC TCCAAACCAG TGGCCTTTGA GGAGTTTAAG 1560 

30 ATGAGCAAG GTAGTGAGAT CAACCGAATT TTCAAAGAAA ACAAATCCAT CTTGAATGAA 1620 

GGAGGAAAA GGGCCAGCGA GACCACACAG CACATCAATG CCATCAAGCG GGAGATTGAT 1680 

TGACCAAGG AGGCCCTGAA TTTCCAGAAG TCACTACGGG AGAAGCAAGG CAAGTACGAA 1740 

ACAAGGGGC TGATGATCAT CGATGAGGAA GAATTCCTGC TGATCCTCAA GCTCAAAGAC 1800 

TCAAGAAGC AGTACCGCAG CGAGTACCAG GACCTGCGTG ACCTCAGGGC TGAGATCCAG 1860 

35 ATTGCCAGC ACCTAGTGGA TCAGTGTCGC CACCGCCTGC TCATGGAATT TGACATCTGG 1920 

ACAATGAGT CCTTTGTCAT CCCTGAGGAC ATGCAGATGG CACTGAAGCC AGGCGGCAGC 1980 

TCCGGCCAG GCATGGTCCC TGTGAACAGG ATTGTGTCTC TGGGAGAAGA TGACCAGGAC 2040 

AATTCAGCC AGCTGCAGCA GAGGGTGCTT CCTGAGGGCC CTGATTCCAT CTCCTTCTAC 2100 

ATGCCAAAG TCAAGATAGA GCAGAAGCAT AATTACTTGA AAACCATGAT GGGCCTCCAG 2160 
AGGCACATA GAAAATAG 



40 
45 



60 



75 



Seq ID NO: 308 Protein sequence: 
Protein Accession #: NP 071737 



11 21 31 41 51 

1 I I I I t 

MGTRKKVBAF VRVKPTDDFA HEMIRYGDDK RSIDIBiKKD IRRGWNNQQ TDWSPKLDGV 60 

LHDASQDLVY BTVAKDWSQ ALDGYNGTIM CYGQTGAGKT YTMMGATENY KHRGILPRAL 120 

_ _ QQVFRMIBER PTKAITVRVS YliEIYNESLF DLLSTLPYVG PSVTPMTIVE NPQGVPIKGL 180 

50 SVHLTSQEED APSLLFEGET NRIIASHTMN KNSSRSHCIF TIYLEAHSRT LSEEKYITSK 240 

INLVDIiAGSE RLGKSGSEGQ VLKEATYINK SLSPLEQAII AIX3DQKRDHI PFRQCKLTHA 300 

LKDSLGGNCN MVLVTNIYGE AAQLEETLSS LRFASPJMKLV TTEPAINEKY DAERMVKNLE 360 

XELALLXQEL AIHDSLTNRT FVTYDPMDEI QIAEINSQVR RYLEGTLDEI DIISLRQIKE 420 

VPNQFRWLS QQEQEVBSTL RRKYTLIDRN DPAAISAIQK AGLVDVDGHL VGEPEGQNPG 480 

55 LGVAPPSTKP GKKAXSKKTP KEPLRPDTPP SKPVAFEEFK NEQGSSINRI FKENKSILNE 540 

RRKRASETTQ HINAIKREID VTKEALNFQK SLREKOGKYE NKGLMIIDEE EFLLILKLKD 600 

LKKQYRSEYQ DLRDLRAEIQ YCQHLVDQCR HRLLMEPDIW YNESFVIPED MQMALKPGGS 660 

IRPGMVPVNR IVSLGEDDQD KFSQLQQRVL PEGPDSISPY NAKVKIEQKH NYLKTMMGLQ 720 
QAHRK 



Seq ID NO: 309 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 11 21 31 41 51 

65 | j till 

ri-m-n'fTT ttttttttaa TGCCTGCTGT CATGCTCTGT CTACCAGGGT GAATTTCCAA 60 

AAATTTCTGC ATAGCAATTT TAGCCAAAAC TATATATGTT CTGGGGAGGA TAGGCATAGG 120 

CACATTGAAG ACCAAAGGAA AGAGTGAAGA AGTGTAGTTG GGTCATTGTG AATGGATGTT 180 

TAGATTGTCA AGAAAAGTGG GCCAGAGGCC CCACCTCACA CTAGGACGGC AATTGCCTCT 240 

70 CATTAGTATC TCAGGCACCA TGGGTCTTAT TTGGTGTCAT AAGAAACACC CTCAACAAAG 300 

TAATGAACCC TCAGCCTCCA GCTTCTCTTC TTCGGGATTC TTCTTAGGGC CTCCTTTTTC 360 

CTTTTATGTT TCCAGTACCC TGAATTTCTT ATTCCCATCC CCCATTAAAA TCTGCTTCAA 420 

AGAAAAAACA AGAAGGACAC ATTCACTTTA AGATCCAAAT GAATGATAAG AGCTTAAAAC 480 
ATTATACTTA TCAGTATTAT TTGCATTTTT ATAGAAACCA AAACCATATT TCAACAAC 



Seq ID NO: 310 DNA sequence 
Nucleic Acid Accession !h NM_018622. 
Coding sequence: 1-1140 



80 1 11 21 31 41 51 

I I I I I I 

ATGGCGTGGC GAGGCTGGGC GCAGAGAGGC TGGGGCTGOG GCCAGGCGTG GGGTGCGTOG 60 

GTGGGCGGCC GCAGCTGCGA GGAGCTCACT GCGGTCCTAA CCCCGCCGCA GCTCCTCGGA 120 

Q . OGCAGGTTTA ACTTCTTTAT TCAACAAAAA TGCGGATTCA GAAAAGCACC CAGGAAGGTT 1B0 

85 GAACCTCGAA GATCAGACCC AGGGACAAGT GGTGAAGCAT ACAAGAGAAG TGCTTTGATT 240 

CCTCCTGTGG AAGAAACAGT CTTTTATCCT TCTCCCTATC CTATAAGGAG TCTCATAAAA 300 

CCTTTATTTT TTACTGTTGG GTTTACAGGC TGTGCATTTG GATCAGCTGC TATTTGGCAA 360 



300 



5 

10 
15 

20 
25 
30 



WO 02/086443 

TATGAATCAC TGAAATCCAG GGTCCAGAGT TATTTTGATQ GTATAAAAGC TGATTGGTTG 
GATAGCATAA GACCACAAAA AGAAGGAGAC TTCAGAAAGG AGATTAACAA GTGGTGGAAT 
AACCTAAGTG ATGGCCAGCG GACTGTGACA GGTATTATAG CTGCAAATGT CCTTGTATTC 
TGTTTATGGA GAGTACCTTC TCTGCAGCGG ACAATGATCA GATATTTCAC ATCGAATCCA 
GCCTCAAAGG TCCTTTGTTC TCCAATGTTG CTGTCAACAT TCAGTCACTT CTCCTTATTT 
CACATGGCAG CAAATATGTA TGTTTTGTGG AGCTTCTCTT CCAGCATAGT GAACATTCTG 
GGTCAAGAGC AGTTCATGGC AGTGTACCTA TCTGCAGGTG TTATTTCCAA TTTTGTCAGT 
TACCTGGGTA AAGTTGCCAC AGGAAGATAT GGACCATCAC TTGGTGCATC TGGTGCCATC 
ATGACAGTCC TCGCAGCTGT CTGCACTAAG ATCCCAGAAG GGAGGCTTGC CATTATTTTC 
CTTCCGATGT TCACGTTCAC AGCAGGGAAT GCCCTGAAAG CCATTATCGC CATGGATACA 
GCAGGAATGA TCCTGGGATG GAAATTTTTT GATCATGCGG CACATCTTGG GGGAGCTCTT 
TTTGGAATAT GGTATGTTAC TTACGGTCAT GAACTGATTT GGAAGAACAG GGAGCCGCTA 
GTGAAAATCT GGCATGAAAT AAGGACTAAT GGCCCCAAAA AAGGAGGTGG CTCTAAGTAA 



Seq ID NO* 311 Protein sequence: 
Protein Accession #: NPJJ61092.2 

1 11 21 31 41 51 

I I till 

MAWRGWAQRG WGCGQAWGAS VGGRSCEELT AVLTPPQLLG RRFNFFIQQK CGFRKAPRKV 
EPRRSDPGTS GEAYKRSALI PPVEETVFYP SPYPIRSLIX PLPFTVGPTG CAFGSAAIWQ 
YESLKSRVQS YFDGIKADWL DSIRPQKEGD FRKEINKWWN NLSDGQRTVT GIIAANVLVP 
CLWRVPSLQR TMIRYFTSNP ASKVLCSPML LSTFSHFSLP HMAANMYVLW SFSSSIVNIL 
GQEQFMAVYL SAGVISNFVS YLGKVATGRY GPSLGASGAI MTVLAAVCTK IPEGRLAIIP 
LPWFTFTAGN ALKAIIAMDT AGMILGWKFF DHAAHLGGAL PGIWYVTYGH ELIWKNREPL 
VKIWHEIRTN GPKKGGGSK 

Seq ID NO: 312 DNA sequence 
Nucleic Acid Accession #: NM_000625 
Coding sequence: 195.. 3656 



PCT/US02/12476 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 
300 
360 



35 i 



40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTCTCGGCCA 
CACAGGTCTC 
GCCCCACAGT 
GTAAAGCCAT 
ATGCAATGAA 
CCAGTCCAGT 
CCCCGCAGCC 
CAACCCCATT 
TCCAAGACAC 
TGGGGTCCAT 
GAGATGAGCT 
AGGCAAAAAT 
CAGTAACCTA 
ATGCCCCACG 
GCTGTTCCAC 
ACAATGGCAA 
ACTTCCGGGT 
GCATCAGAGG 
AGCCCAAGTA 
CTGAGCTCTT 
ACGAGTGGTT 
TGCTTGAGGT 
CAGAGATCGG 
GCAGGAGAAT 
TTGAGATCAA 
ACCACTCGGC 
GCTGCCCGGC 
TTCACCAGGA 
GGAAAACCCA 
AAGTCTTGGT 
GAGTCAGAGT 
ACCTGGGGGC 
GGCTGAGCTG 
GAGACTGCCC 
ACAACAAATT 
CCTTTGCTCA 
TGGGAGAAGG 
CCTTCAAGGC 
AGCTCTACAC 
AGCCTTTGGA 
GGCTCAAATC 
AACTCTCCTG 
GCCCAGGCAA 
CACCCCACCA 
ACAAGAGGCT 
CACCCCCAAC 
GACAGAGGCT 
GCCCCACATT 
TGCTTTCCCA 
ACACGCCCAC 
AGGGTCCCCT 
CAGTGCCCTG 



11 

V 

CCTTTGATGA 
TTCCTGGTTT 
GAAGAACATC 
AGAGATGGCC 
TGGGGAAAAA 
GACACAGGAT 
CCTCGTGGAG 
GTCCTCCCCA 
ACTTCACCAT 
TATGACTCCC 
TCTACCTCAA 
AGAGGAACAT 
CCAACTGACG 
CTGCATTGGG 
TGCCCGGGAA 
CATCAGGTCG 
GTGGAATGCT 
GGACCCTGCC 
CGGCCGCTTC 
CGAAATCCCA 
TCGGGAACTG 
GGGCGGCCTG 
AGTCCGGGAC 
GGGCCTGGAA 
CATTGCTGTG 
TGCAGAATCC 
AGACTGGATT 
GATGCTGAAC 
TGTCTGGCAG 
CAAAGCTGTG 
CACCATCCTC 
CTTATTCAGC 
CCTGGAGGAG 
TGGCAATGGA 
CAGGTACGCT 
TGACATTGAT 
GGATGAGCTC 
AGCCTGTGAG 
CTCCAATGTG 
CCTCAGCAAA 
TOGGCAGAAT 
TGAGGATGGC 
CCAGCCGGCC 
GGCAGTGCGC 
GCCCCCCTGC 
CCAGCTGCTG 
GGAGGCCCTG 
CCTGGAGGTG 
GCTCCCCATT 
GGAGATCCAC 
GCACCACGGC 
CTTTGTGCGG 



21 
I 

GGGGACTGGG 
GACTGTCCTT 
TGAGCTCAAA 
TGTCCTTGGA 
GGCATCAACA 
GACCTTCAGT 
ACGGGAAAGA 
CGGCATGTGA 
AAGGCCAAAG 
AAAAGTTTGA 
GCTATCGAAT 
CTGGCCAGGG 
GGAGATGAGC 
AGGATCCAGT 
ATGTTTGAAC 
GCCATCACCG 
CAGCTCATCC 
AACGTGGAAT 
GATGTGGTCC 
CCTGACCTTG 
GAGCTAAAGT 
GAGTTCCCAG 
TTCTGTGATG 
ACGCACAAGC 
CTCCATAGTT 
TTCATGAAGT 
TGGCTGGTCC 
TACGTCCTGT 
GACGAGAAGC 
CTCTTTGCCT 
TTTGCGACAG 
TGTGCCTTCA 
GAACGGCTGC 
GAGAAACTGA 
GTGTVTGGCC 
CAGAAGCTGT 
AGTGGGCAGG 
ACGTTTGATG 
ACCTGGGACC 
GCCCTCAGCA 
CTACAAAGTC 
CAAGGCCTGA 
CTGGTCCAAG 
CTGGAGGCCC 
TCACTCAGCC 
CTCCAAAAGC 
TGCCAGCCCT 
CTAGAGGAGT 
CTGAAGCCCA 
CTGACTGTGG 
GTCTGCAGCA 
AATGCCAGCG 



31 

I ' 

CAGTTCTAGA 
ACCCCGGGGA 
TCCAGATAAG 
AATTTCTGTT 
ACAATGTGGA 
ATCACAACCT 
AGTCTCCAGA 
GGATCAAAAA 
GGATTTTAAC 
CCAGAGGACC 
TTGTCAACCA 
TGGAAGCGGT 
TCATCTTCGC 
GGTCCAACCT 
ACATCTGCAG 
TGTTCCCCCA 
GCTATGCTGG 
rCACTCAGCT 
CCCTGGTCCT 
TGCTTGAGGT 
GGTACGCCCT 
GGTGCCCCTT 
TCCAGCGCTA 
TGGCCTCGCT 
TCCAGAAGCA 
ACATGCAGAA 
CTCCCATGTC 
CCCCTTTCTA 
GGAGACCCAA 
GTATGCTGAT 
AGACAGGAAA 
ACCCCAAGGT 
TGTTGGTGGT 
AGAAATCGCT 
TCGGCTCCAG 
CCCACCTGGG 
AGGACGCCTT 
TCCGAGGCAA 
CGCACCACTA 
GCATGCATGC 
CGACATCCAG 
ACTACCTGCC 
GCATCCTGGA 
TGGATGAGAG 
AGGCCCTCAC 
TGGCCCAGGT 
CAGAGTACAG 
TCCCGTCCCT 
GGTTCTACTC 
CCGTGGTCAC 
CATGGCTCAA 
GCTTCCACCT 



41 

1 

CAGTCCCGAA 
GGCAGTGCAG 
TGACATAAGT 
CAAGACCAAA 
GAAAGCCCCC 
CAGCAAGCAG 
ATCTCTGGTC 
CTGGGGCAGC 
TTGCAGGTCC 
CAGGGACAAG 
ATATTACGGC 
AACAAAGGAG 
CACCAAGCAG 
GCAGGTCTTC 
ACACGTGCGT 
GCGGAGTGAT 
CTACCAGATG 
GTGCATCGAC 
GCAGGCCAAT 
GGCCATGGAA 
GCCTGCAGTG 
CAATGGCTGG 
CAACATCCTG 
CTGGAAAGAC 
GAATGTGACC 
TGAATACCGG 
TGGGAGCATC 
CTACTATCAG 
GAGAAGAGAG 
GCGCAAGACA 
ATCAGAGGCG 
TGTCTGCATG 
GACCAGTACG 
CTTCATGCTG 
CATGTACCCT 
GGCCTCTCAG 
CCGCAGCTGG 
ACAGCACATT 
CAGGCTOGTG 
CAAGAACGTG 
CCGTGCCACC 
GGGGGAGCAC 
GCGAGTGGTG 
TGGCAGCTAC 
CTACTTCCTG 
GGCCACAGAA 
CAAGTGGAAG 
GCGGGTGTCT 
CATCAGCTCC 
CTACCACACC 
CAGCCTGAAG 
CCCCGAGGAT 



51 
I 

GTTCTCAAGG 
CCAGCTGCAA 
GACCTGCTTT 
TTCCACCAGT 
TGTGCCACCT 
CAGAATGAGT 
AAGCTGGATG 
GGGATGACTT 
AAATCTTGCC 
CCTACCCCTC 
TCCCTCAAAG 
ATAGAAACAA 
GCCTGGCGCA 
GATGCCCGCA 
TACTCCACCA 
GGCAAGCACG 
CCAGATGGCA 
CTGGGCTGGA 
GGCCGTGACC 
CATCCCAAAT 
GCCAACATGC 
TACATGGGCA 
GAGGAAGTGG 
CAGGCTGTCG 
ATCATGGACC 
TCCCGTGGGG 
ACCCCCGTGT 
GTAGAGGCCT 
ATTCCATTGA 
ATGGCGTCCC 
CTGGCCTGGG 
GATAAGTACA 
TTTGGCAATG 
AAAGAGCTCA 
CGGTTCTGOG 
CTCACCCCGA 
GCCGTGCAAA 
CAGATCCCCA 
CAGGACTCAC 
TTCACCATGA 
ATCCTGGTGG 
CTTGGGGTTT 
GATGGCCCCA 
TGGGTCAGTG 
GACATCACCA 
GAGCCTGAGA 
TTCACCAACA 
GCTGGCTTCC 
CCCCGGGATC 
CGAGATGGCC 
CCCCAAGACC 
CCCTCCCATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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840 
900 
960 
1020 
1080 
1140 
1200 
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1360 
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1560 
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1740 
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2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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CTTGCATCCT CATCGGGCCT GGCACAGGCA TCGCGCCCTT 
GGCTCCATGA CTCCCAGCAC AAGGGAGTGC GGGGAGGCCG 
GCCGCCGCCC AGATGAGGAC CACATCTACC AGGAGGAGAT 
GGGTGCTGCA TGCGGTGCAC ACAGCCTATT CCCGCCTGCC 
TTCAGGACAT CCTGCGGCAG CAGCTGGCCA GCGAGGTGCT 
CAGGCCACCT CTATGTTTGC GGGGATGTGC GCATGGCCCG 
AGCAGCTGGT GGCTGCCAAG CTGAAATTGA ATGAGGAGCA 
AGCTCAAGAG CCAGAAGCGC TATCACGAAG ATATCTTTGG 
CGAAGAAGGA CAGGGTGGCG GTGCAGCCCA GCAGCCTGGA 
CTACAGGAGG GGTTAAAGCT GCCGGCACAG AACTTAAGGA 
CTGAGGTCAC AGGGCCTGGG GAGATGGAGG AAAGTGATAT 
TTCCTCAACG TTGCTCCCCA TCAAGCCCTT TACTTGACCT 
ATTGATCGGA GCCTC 

Seq ID NO: 313 Protein sequence: 
Protein Accession fh np_000616 



PCTYUS02/12476 



MACPWKFLFK 
VETGKKSPES 
TPXSLTRGPR 
LTGDELIFAT 
RSAITVFPQR 
RFDWPLVLQ 
GLEFPGCPFN 
AVLHSFQKQN 
LNYVLSPFYY 
ILFATETGKS 
NGEKLKKSLF 
HLSGQEDAFR 
SKALSSMHAK 
PALVQGILER 
LLLQKLAQVA 
PILKPRPYSI 
VRNASGFHLP 
EDHIYQEEML 
VCGDVRMARD 
VAVQPSSLEM 



11 
I 

TKFHQYAMNG 
LVKLDATPLS 
DKPTPPDELL 
KQAWRNAPRC 
SDGKHDFRVW 
ANGRDPELFE 
GWYMGTEIGV 
VTIMDHHSAA 
YQVEAWKTHV 
EALAWDLGAL 
MLKELNNKFR 
SWAVQTFKAA 
NVFTMRLKSR 
WDGPTPHQA 
TEEPERQRLB 
SSPRDHTPTE 
EDPSHPCILI 
EMAQKGVLHA 
VAHTLKQLVA 
SAL 



21 
I 

EKGINNNVEK 
SPRHVRIKNW 
PQAIEFVNQY 
IGRIQWSNLQ 
NAQLIRYAGY 
IPPDLVLEVA 
RDFCDVQRYN 
ESFMKYMQNE 
WQDEKRRPKR 
FSCAFNPKW 
YAVFGLGSSM 
CETFDVRGKQ 
QNLQSPTSSR 
VRLEALDESG 
ALOQPSEYSK 
IHLTVAWTY 
GPGTGIAPPR 
VHTAYSRLPG 
AKLKLNEEQV 



31 



I 

APCATSSPVT 
GSGMTFQDTL 
YGSLKEAKIE 
VFDARSCSTA 
QMPDGSIRGD 
MEHPKYEWFR 
ILEEVGRRMG 
YRSRGGCPAD 
REIPLKVLVK 
CMDKYRLSCL 
YPRFCAFAHD 
HIQIPKLYTS 
ATILVELSCE 
SYWVSDKRLP 
WKFTNSPTFL 
HTRDGQGPLH 
SFWQQRLHDS 
KPKVYVQDIL 
EDYFFQLKSQ 



CCGCAGTTTC 
CATGACCTTG 
GCTGGAGATG 
TGGCAAGCCC 
CCGTGTGCTC 
GGACGTGGCC 
GGTCGAGGAC 
TGCTGTATTT 
GATGTCAGCG 
TGGAGCCAGC 
CCCCCAGCCT 
CCTAACAAGT 



41 

1 

QDDLQYHNLS 
HHKAKGILTC 
EHLARVEAVT 
REMFEHICRH 
PANVEFTQLC 
ELELKWYALP 
LETHKLASliW 
WIWLVPPMSG 
AVLFACMLMR 
EEERLLLWT 
IDQKL5HLGA 
NVTWDPHHYR 
DGQGLKYLPG 
PCSLSQALTY 
EVLEBFPSLR 
HGVCSTWLNS 
QHKGVRGGRM 
RQQLASEVLR 
KRYHEDIFGA 



TGGCAGCAAC 
GTGTTTGGGT 
GCCCAGAAGG 
AAGGTCTATG 
CACAAGGAGC 
CACACCCTGA 
TATTTCTTTC 
CCTTACGAGG 
CTCTGAGGGC 
TCTGCATTAT 
CAAGTCTTAT 
AGCACCCTGG 



51 
i 

KQQNESPQPL 
RSKSCLGSIM 
KEIETTVTYQ 
VRYSTNNGNI 
IDLGWKPKYG 
AVANMLLBVG 
KDQAWEINI 
SiTPVFHQEM 
KTMASRVRVT 
STFGNGDCPG 
SQLTPKGEGD 
LVQDSQPLDL 
EHLGVCPGNQ 
FLDITTPPTQ 
VSAGFXtLSQIi 
LKPQDPVPCP 
TliVFGCRRPD 
VLEKEPGHLY 
VPPYEAKKDR 



Seq ID NO: 314 DNA sequence 
Nucleic Acid Accession #: XM_087254 
Coding sequence: 47.. 2332 



AGAGTACGTG 
ATGTTCAATT 
AACACCAGAC 
CTTATCCCAT 
AATTAAAGAA 
CAATGTTCAA 
GTTGGAGTAC 
TGGTATTGTG 
GGAACGGTAC 
AATTGTTCAG 
TCTCCCTAAA 
TTTGAAAGGG 
GGAAATAGAT 
GGCAGCTGTT 
AGACAGACTA 
AGTATGGGTA 
CCATTTTCAT 
TGCTGAACAA 
GCTGGTAGTG 
GGAAGTTTGC 
AAAAGTAATA 
TGGTGCTAAT 
AGAAGGAAGA 
CAAATTGCTT 
TTTTTTTTAT 
GTTTTCTCAG 
TTCCCTACCT 
AAATAAGCCC 
TCTTTATTGG 
ACTAA TAGGG 
TGGCACTTTG 
TCATTTTTGG 
ATTTTCCTTG 
TGTGTTTATT 
TACATGTCTA 
TACTGAAAAG 
CTGTTTCCCG 
AGGAAGATGT 
CAACGACAGG 



11 

I 

TTTACAGATA 
AATGGCATGA 
TCTTCAGAAG 
CTTACAACCA 
CATGATCTCT 
ACTGACTGCA 
TATGCATCTT 
TTTATTGGCA 
AAACTGCTTC 
GCACCTTCAG 
TGTATAGGTG 
CTAAGAACTC 
AAACGCATAT 
TTCCAGTTCA 
CAAGATAAAG 
CTTACTGGGG 
AGAACCATGA 
TTGAGGCAGC 
GATGGGACCA 
AGAAATTGTT 
AGACTAATAA 
GACGTAAGCA 
CAGGCTGCAA 
TTTGTTCATG 
AAGAATGTGT 
CAAACATTGT 
ATTCTGATAT 
ACCCTTTATC 
ACCATCCTGG 
AAAGATACAT 
GTCTTCACAG 
ACTTGGATCA 
TTTTATGGAG 
CAGCTCCTGT 
TTTCTTGATA 
GCACAGCTTA 
GAAGGAGAAG 
AGTCCAACCC 
AGCATCTTGA 



21 
I 

AAACTGGTAC 
AATACCAAGA 
GAAACTTATC 
GTTCCTCTTT 
TCTTTAAAGC 
CTGGTGATGG 
CACCAGATGA 
ATTCTGAAGA 
ATATTCTGGA 
GTGAGAAGTT 
GAGAAATAGA 
TGTGTATAGC 
TTGAAGCCAG 
TAGAGAAAGA 
TTCGAGAAAC 
ATAAACATGA 
ACATCCTTGA 
TTGCCAGAAG 
GCCTATCTCT 
CAGCTGTATT 
AAATATCACC 
TGATACAAGA 
GAAACAGTGA 
GTCATTTTTA 
GCTTTATCAC 
ATGACAGCGT 
ATAGTCTTTT 
GAGACATTAG 
GCTTCAGTCA 
CTCTGCTTGG 
TCATGGTTAT 
ACCATCTCGT 
GGATTCTCTG 
CAAGTGGTTC 
TCATAAAGAA 
CTGAAACAAA 
CAGCGTGTGC 
ACATCAGCAG 
CTCTCTCCAC 



31 

} 

ACTGACAGAA 
AATTAATGGT 
TTATCTTAGT 
CAGAACCAGT 
AGTCAGTCTC 
TCCCTGGCAA 
AAAGGCTCTA 
AACTATGGAG 
ATTTGATTCA 
ATTATTTGCT 
AAAAACCAGA 
ATATAGAAAA 
GACTGCCTTG 
CCTGATATTA 
TATTGAAGGA 
AACAGCTGTT 
ACTTATAAAC 
AATTACAGAG 
TGCACTCAGG 
ATGCTGTCGT 
TGAGAAACCT 
AGCCCATGTT 
CTATGCAATA 
TTATATTAGA 
ACCCCAGTTT 
GTACCTGACT 
GGAACAGCAT 
TAAAAACCGC 
TGCCTTTATT 
AAATGGCCAG 
TACAGTCACA 
TACCTGGGGA 
GCCATTTTTG 
TGCTTGGTTT 
GGTCTTTGAC 
TGCAGGTATC 
ATCTGTTGGA 
ATCATGGAGT 
AATGGACTCA 



41 

1 

AATGAGATGC 
AGACTTGTAC 
AGTTTATCCC 
CCTGAAAATG 
TGTCACACTG 
TCCAACCTGG 
GTAGAAGCTG 
GTTAAAACTC 
GATCGTAGGA 
AAAGGAGCTG 
ATTCATGTAG 
TTTACATCAA 
CAGCAGCGGG 
CTTGGAGCCA 
TTGAGAATGG 
AGTGTGAGTT 
CAGAAATCAG 
GATCATGTGA 
GAGCATGAAA 
ATGGCTCCAC 
ATAACATTGG 
GGCATAGGAA 
GCCAGATTTA 
ATAGCTACCC 
TTATATCAGT 
TTATACAATA 
GTAGACCCTC 
CTCTTAAGTA 
TTCTTTTTTG 
ATGTTTGGAA 
GTAAAGATGG 
TCTATTATAT 
GGCTCCCAGA 
GCCATAATCC 
CGACACCTCC 
AAGTGCTTGG 
AGAATGCTGG 
GCATCGGATC 
TCTACTTGTT 



51 

I 

AGTTTCGGGA 
CCGAAGGACC 
ATCTTAACAA 
AAACTGAACT 
TACAGATTAG 
CACCATCGCA 
CTGCAAGGAT 
TTGGAAAACT 
GAATGAGTGT 
AGTCATCAAT 
ATGAATTTGC 
AAGAGTATGA 
AAGAGAAATT 
CAGCAGTAGA 
CTGGTATCAA 
TATCATGTGG 
ACAGCGAGTG 
TTCAGCATGG 
AACTATTTAT 
TGCAGAAAGC 
CTGTTGGTGA 
TCATGGGTAA 
AGTTCCTCTC 
TTGTACAGTA 
TCTACTGTTT 
TTTGTTTTAC 
ATGTGTTACA 
TTAAAACATT 
GATCCTATTT 
ACTGGACATT 
CTCTGGAAAC 
TTTATTTTGT 
ATATGTATTT 
TCATGGTTGT 
ACCCTACAAG 
ACTCCATGTG 
AACGAGTTAT 
CTTTCTATAC 
AAAGGGGCAG 



3160 
3240 
3300 
3360 
3420 
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3540 
3600 
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3840 



60 
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1260 
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1560 
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1680 
1740 
1800 
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1980 
2040 
2100 
2160 
2220 
2280 
2340 



302 
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TAGTACTTTG TGGGAGCCAO TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC AOCCTTAATT 2520 

TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2640 

TTTAATATAA ATGTAGAAAA AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2820 

ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 

TTCACAGAGC AAATTAGGAG AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCOGT TCACOGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAGCATATTA TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTTAA ATTTGCTTTT TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

TATGTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

AATTTATCAA GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATGTTGTC 3720 

TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAA7GTAAC ATTTGTCTGT TCCAGTGATG 4020 

TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA CTGACACCAO ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TOGCCTATAA TGGTGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCOGTTT 4560 

TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 

GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGCT CACCCTGGGC 4980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCAAA TCAACCCTAG GAAATACTTG CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC 5460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA AAAATAGATC ATAACTCATG ATATGTTTGT 5640 

AATCATGGTA ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATATTGT AGCAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 
GAAGTTATAT TTATCAAATA AAAACTTTCC TATAT 

Seq ID NO: 315 Protein sequence j 
Protein Accession th XP_087254 



1 11 21 31 41 51 

t I 1 I I.I 

MQFRECSING MKYQEINGHL VPSGPTPDSS EGNLSYLSSL SHLNNLSHLT TSSSFRTSPE 60 

NETELIKEHD LFFKAVSLCH TVQISNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 120 

AAARIGIVFI GNSEETMEVK TLGKLERYKL LHILEFDSDR RRMSVIVQAP SGEKLLFAKG 180 

AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEXDKR IFEARTALQQ 240 

REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRETIEALR MAGIKVWVLT GDKHETAVSV 300 

SLSOGHFHRT MNILELINQK SDSECAEQLR QLARRITEDH VIQHGLWDG TSLSLALREH 360 

EKLFMEVCRN CSAVLCCRMA PLQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQBAHVGI 420 

GIMGKEGKQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 480 

QFYCLFSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTL YRDISKNRLL 540 

SIKTFLYWTI LGFSHAFIFF FGSYLLIGKD TSLLGNGQMF GNWTFGTLVF TVMVITVTVK 600 

MALETHFWTW INHLVTWGSI IFYFVFSLFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 660 

ILMWTCLFL DIIKKVFDRH LHPTSTEKAQ LTETNAGIKC LDSMCCFPEG EAACASVGRM 720 
LERVIGRCSP THISRSWSAS DPFYTNDRSI LTLSTMDSST C 

Seq ID NO: 316 dna sequence 
Nucleic Acid Accession #: NM_004473 
Coding sequence: 661.. 1791 

1 U 21 31 41 51 

I I I I I I 

CTCGCCAGCG GTCCGOGGGG CTGGAGACCC ACGCCGTGGA GAGGACCAGC CTCAGGTCGC 60 
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CCCGCCTGGG CCCGCGCCCC GACCTCGCTO CCCOCGCCTC GCCTCTCTGC 
TACCGCCACC TTGGCCTCGG GGGCAGGGCA TGGOCGGCCC CCGCCAGATC 
AGTACTAACT GCCCTOGCTC TGGCCTTCGA GCCCGAAGCC TCTTCTGOGC 
GGCAGTAATC CTAAACTAGC GGGCACCACA GACCAGCTGC AGCCACCCCA 
CACTTCCGGA CCCCTOGACC GCCCGGCACC AGCGOGCAAG GGACCCTTCA 
AGAGTCCAGT CCOGGTOGCG AGGCCACCGC CGCTGCCCGC CTCGAGAAGC 
CTGAGCCGTC GGCTAGOGGG TCACTCCCGA GCCTCTGTCT GCACCGOGCC 
CACGGACGCT GAGCCTCCAG CGCGCGCCAG CCTGGGCOGC TGGGCTCTCC 
GCGAOGATCC CCTGAGCTCT CCGCAGAAGG GCCGAGCGTC CGTTCCGGGG 
CGCCCCCGCC CCCCGACAGC CGCGGGGATC CAGAGCCGGG GGGTGOGGGA 
ATGACTGCCG AGAGCGGGCC GCCGCCGCOG CAGCCGGAGG TGCTGGCTAC 
GAGCGCGGCG AGAOGGCAGC AGGGGCCGGG GTCCCAGGGG AGGCCACGGG 
GGOGGGCGGC GCCGCAAGCG CCCCCTGCAG CGOGGGAAGC CGCCCTACAG 
CTCATCGCCA TGGCCATCGC GCAOGCGCCC GAGCGCCGCC TCACGCTGGG 
AAGTTCATCA CCGAGCGCTT CCCCTTCTAC OGCGACAACC CCAAAAAGTG 
ATCCGCCACA ACCTCACACT CAACGACTGC TTCCTCAAGA TCCCGCGCGA 
CCGGGTAAGG GCAACTACTG GGCGCTOQAC CCCAACGCGG AGGACATGTT 
AGCTTCCTGC GCCGCCGCAA GCGCTTCAAG CGCTCGGACC TCTCCACCTA 
ATGCACGACG CGGCGGCTGC CGCAGCCGCC GCTGCCGCAG CCGCCGCCGC 
GCOGCCATCT TCCCAGGCGC GGTGCCCGCC GCGCGCCCCC CCTACCOGGG 
GCAGGCTACG CGCCGCCGTC GCTGGCCGCG COGCCTCCAG TCTACTACCC 
CCCGGCCCTT GCCGCGTCTT CGGCCTGGTT CCTGAGCGGC CGCTCAGCCC 
CCCGCACOGT CGGGGCCCGG CGGCTCTTGC GCCTTTGCCT CCGCCGGCGC 
ACCACCGGCT ACCAGCCCGC AGGCTGCACC GGGGCCCGGC CGGCCAACCC 
GCGGCTGCCT ACGCGGGCCC CGACGGCGCG TACCCGCAGG GCGCCGGCAG 
GCCGCTGCTG GCCGCCTGGC GGGACCCGCT TCGCCCCCAG CGGGCGGCAG 
GTGGAGACCA CGGTGGACTT CTACGGGCGC ACGTCGCCCG GCCAGTTCGG 
GCCTGCTACA ACCCTGGCGG GCAGCTCGGA GGGGCCAGTG CAGGCGCCTA 
CATGCTGCCG CTTATCCCGG TGGGATAGAT OGGTTOGTGT CCGCCATGTG 
GGGACGAAAA CTCATAGACA CATOGGCTGT TCACACGTTC CCCGCAACCT 
AGGAATGGAG AGAGGACTCA ACTGGGACCC ACGTGGAAAA GACCGAGCAG 
CTCGGTCTCC CCGCGCACAG CGTAGGCACC CTGTGTACTC TGTAAACGGG 
GCGAGGCAGC CAGAGCCCTT GGACTGGCAC AGGGACCCTC GATGGAGCGA 
CGGGATGCTT TCTGGCATTC TATOGGGGAG GGTCCTTGGC GGTAACCAGA 
GTGTCAACAC CAGAGACCAG GATCCAAATT GTGGGGAATC AGTTTCAGCC 
TGCCGGAACT CGGGCCTTTT TACGCGGTTC GTCCTCTAGT GCCTTTAACT 
AATAAAAGGC TGCGGCAGCG CCTTTCTTCT TAAAGTGAGG AGGACAAATT 
ATAGGCTTTT CTTCTTTTTT AAATTGGAGA AATCTCTGCT CTGGTTGACC 
TTCCCTGTCT CTGAGAACTT GAGACCTAGC TCCGAGTTGA ACTGTGCGTC 
TCCCATCACC TGAACCTTCA GTCTCCCCCA TCTGTTACAC TAGAGGGCTG 
TCCACCGCCC CCGGGTTATC ATTCAGGGCC CCATCATCTT GGATGCTGCC 
GGCAGCAATG GTGGGCCACC CAGGGCCTCT GAGTAGCCAC CCAAAGCCTA 
TAGGGAACGG AAAAGAGTTC ATGGCCAAGC GTCTAACCTA AAGTCCCAGG 
GGCAGCAATT ATATCATAAC TTATTGAACT TTTGAGCAGG ACGTGCTGGT 
CTGTTACTGC CCAGTCATAA ATCTGCTTTT CCATT AT AAG GCAGAGAGAA 
TCATTTGTCC ACTGTTTCTT GTCATCACGC AGCCCTGGAC CCAAAGGGTG 
TAAGGAGATG AGAGGATTCA AGGAGCCCGT TGGTGACGCC TTTCAGTAGC 
TCTTCCATCC CCAGCACCOC CTGCTACACC TCAGCAGCCT CCCCCATGCA 
AGAAAAATTA AGTTAGGGCA GTCAGTAAAG TGAGCTTTAG AAAGAAACTG 
TTCATTTTGT ATCTTGCTTA AGTAGCAGGC TCACTAAAAT TAGAGAAAGT 
TCCCCCTTTC CCTTGAGAAA TCTTTAAGTT TCGATTCTGG AGCAAAAACT 
AATATTTCAG AGGCTCCATT CACAGCTTTC AGATAAACTG GAGTGTTCAG 
TTAATAAAAA TCTTTGAGCA AGTGAGTTAT GGCAAGAGAA ACTCAGCCTC 
AACTTAACAG GGAAGGGCTG GGGTGTGAAA AAGAAGATTG TATGAAAACC 
TTTATTTTTT ATTTTTGGGA CTGCACTATC CTGTTCACGA AGACATGTGA 
GTCCAAATGG GGATTTGTAT AAACCAGTGC TCTCCATTAG AAATATGGTG 
ATGTAATTTT AAATATTCTA GTAGCCACAT TAATAAAGTN AAAAGAAACA 
AA 

Seq ID NO: 317 Protein sequence* 
Protein Accession #s NP w 004464 
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CCGTGGCGCT 
GCCCAGCGCC 
GCACAACCTA 
ACCCAGGGAT 
GCCGGAGACC 
ACAACGCGGG 
AGCCCCAGAC 
GGGCCAGCCC 
ACGCCAGGCC 
CGCCCGCGCC 
CGTGAAGGAA 
CCGCGGGGCG 
CTACATCGCG 
CGGCATCTAC 
GCAGAACAGC 
GGCCGGCCGC 
CGAGAGCGGC 
CCCGGCTTAC 
CGCCGCOGCC 
CGCCGTCTAT 
CGCGGCGTCG 
AGAGCTGGGG 
CCCCGCTACC 
CTCTGCCTAT 
TGOGATCTTT 
CAGTGGCGGC 
AGCGCTGGGA 
CCATGCTCGC 
AGCCAGCGTA 
GAGAACGAAC 
GCCACAGAGG 
AGGAGGTGGG 
AGCCCTCAAA 
GGGCAGCGTA 
TTCCATGTGC 
GCGTTACTAC 
TGCAAAAGAA 
TGGGCTGGTT 
AGCACTCCAG 
CAGGACTCTA 
CTGCGTATTT 
GCCGCTGTTC 
ATTGGCTCCA 
AATTTCATGG 
GTACATTCGT 
AACTAAAGTT 
TGGGGAGGGC 
AAAAGGAAAG 
GAATTTTAAC 
CCAATAACTC 
TTCAGCATTA 
ATGGACTGTT 
TTTCTGTATA 
ATTGGTAATT 
ACTTGGTTCA 
CAAGCCACAT 
AAAAAAAAAA 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



1 

I 

FKHLTHYROI 
GAGVPGEATG 
PFYRDNPKKW 
RFKRSDLSTY 
LAAPPPVYYP 
GCTGARPAHP 
YGRTSPGQFG 



11 
I 

DTRANSCRIP 
RGAGGRRRKR 
QNSIRHNLTL 
PAYMHDAAAA 
AASPGPCRVF 
SAYAAAYAGP 
ALGACYNPGG 



21 
I 

TIONFACTOR 
PLQRGKPPYS 
NDCPLKXPRE 
AAAAAAAAAA 
GLVPERPLSP 
OGAYPQGAGS 
QLGGASAGAY 



31 



41 



51 

! I I 

TTFMTAESGP PPPQPEVLAT VKEERGETAA 60 

YIALIAMAIA HAPERRLTLG GIYKPITERF 120 

AGRPGKGNYH ALDPKAEDMP ESGSFLRRRK 180 

AAAAAIPPGA VPAARPPYPG AVYAGYAPPS 240 

ELGPAPSGPG GSCAFASAGA PATTTGYQPA 300 

AIFAAAGRIA GPASPPAGGS SGGVETTVDF 360 
HARHAAAYPG GIDRFVSAM 



Seq ID NO: 318 DNA sequence 
Nucleic Acid Accession #: NM_005688 
Coding sequence: 126.. 4439 



80 
85 



1 
I 

CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
GGAGAACTCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 



11 

1 

GGCTCATGCT 
GAATTCTGAT 
GGATATCGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 



21 
I- 

CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTOCT 



31 
I 

GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 



41 
I 

TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTCGTGGCTT 



51 
I 

GTCCTGGAGC 
CTCCGCTCAG 
GGGTATAGAA 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 



60 
120 
180 
240 
300 
360 
420 
480 



304 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA OGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 60 0 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGQC IGGCTTCAGT GGACCAOCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGAOGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAAT6TTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA IS 60 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC I860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTGGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CAOGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCC C C AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCGACAGCTG 4200 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4S60 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4660 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4960 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC 6CAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
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305 



WO 02/086443 

ATGTCGTGAC CAACTAGACA TTCTGTCQCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAA 



PCT/US02/12476 



5760 
S820 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seg ID NO: 319 Protein sequence: 
Protein Accession #: NP_005679 



I 

MKDIDIGKEY 
LDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYRTGVRL 
PWAILGMIY 
VLTYIKFIKM 
HMTLGFDLTA 
NKPASPHIKI 
VLABQK6HLL 
KTSLISAILG 
CCLRPDliAIL 
NHIFNSAIRK 
PNNLLLGETP 
VPWSVYGVYI 
VSDSMKDNPH 
KFFDTTPTGR 
LVILFSVfcHI 
LDDNQAPFFL 
GLFQPTVRLA 
RYRENLPLVL 
IGLADLRSKL 



MLTIAHRLHT 



11 
I 

IIPSPGYRSV 
ILDEEHPKGK 
EDVWSLSKHE 
LAGFSGPAFM 
RGAILTMAFK 
NVIILGPTGP 
YAWVKAFSQS 
AQAFTWTVF 
EMKNATLAWD 
LDSDERPSPE 
QMTLIjEGSIA 
PSSDLTEIGE 
HLKSKTVLFV 
PVEINSKKET 
QAAGGPLAFL 
MQYYASIYAL 
ILNRFSKDMD 
VSRVLIRELK 
FTCAMRWLAV 
SET EAR FT SV 
KKVSFTIKPK 
SIIPQEPVLF 
SVGERQLLCI 
VLGSDRIMVL 



21 
I 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLE 
VKHLLEYTQA 
KILKLKNIKE 
LGSAVFILFY 
VQKIREEERR 
NSMTFALKVT 
SSHSSIQNSP 
EEEGKHIHLG 
ISGTFAYVAQ 
RGANLSGGQR 
THQLQYLVDC 
SGSQKKSQDK 
VIMALFMLNV 
SMAVMLILKA 
EVDVRLPFQA 
RLDNITQSPF 
RLDLISIALI 
ERINHYIKTL 
EKIGIVGRTG 
SGTVRSNLDP 
ARALLRHCKI 
AQGQWEFDT 



31 



IRTTSKHQHP 
RLWQEELNEV 
TESNLGYSLL 
KSLGELINIC 
PAMMFASRLT 
ILEKAGYFQG 
PFSVKSLSEA 
KLTPKMKKDK 
HLRLQRTLHS 
QAWILNATLR 
QRISLARALY 
DEVIFMKEGC 
GPKTGSVKKE 
GSTAFSTWWL 
IRGWFVKGT 
EMFIQNVILV 
LSHITSSIQG 
TTTGLMIVLM 
SLEAPARIKN 
SGKSSLGMAL 
FNQYTEDQIW 
LILDEATAAM 
PSVLLSNDSS 



Seg ID NO: 320 DNA sequence 

Nucleic Acid Accession #: AK022089.1 

Coding sequence: 181-1488 



AGCAGTTGCA 
ACATCCGAGG 

ccccaccccc 
atggctccct 
gacatggatt 
tgtgggctga 
gaggctacgt 
gagaagtgga 
aaagcgtggg 

CTTCCAGTTC 
TGGGAGCTCA 
GTCAGGAAAA 
GATAATATGG 
GTCAAGAGAA 
GATCGAGTAG 
AGTGAAGTTG 
AGCGAAAGTG 
GATAAGCTCT 
GATGCGGAAG 
GATTTGGAGA 
AAAGAGATTA 
GCCAAGGAAT 
AGAGGGAAGG 
GTATTTAGCA 
CAGGACTCCG 
CTTTCTGACC 
ATATAACACT 
GAAATGTGTT 



11 
I 

CAACTTCCAG 
AGAAGAGAAA 
ACCCTCAGAT 
TTGGAAGAAA 
CAGAAGAGAA 
CTAAACGCAC 
TTGGAGAGAA 
GAGGCTCCGA 
GAGATGAGCA 
CTTTGTGGCG 
GCCCAGCAAA 
CTTTCCGGAA 
AGACATTAGT 
TGAAAGAGCT 
AAAATGATGG 
AGCAAAATCT 
ATGGAATTGA 
CTGCTGAAAT 
GGGAAGCTGC 
AAAGCATGAA 
AATACAGTGA 
TCAATTCACT 
AATCTGAGGT 
ATTACACAAA 
AAACAACAGT 
TGCTTTCATG 
CAAAAAAATG 
GTTTCTG 



21 
I 

CAACTTTCTC 
GCTTCCAGCC 
CACTTTAAGA 
CTTGCTAAAG 
GGAAATTGTG 
CACCTCTGCT 
ACGATTTCTT 
AAGGGTTCTT 
GCCCAATATG 
GACAGCTGAA 
CTACATGAAG 
ACTGGCTAAA 
TCATCTGATC 
GGATCTGGAA 
AGAAAACTAT 
AGACTTGCAG 
ACAGCTGGAA 
AGAAAAAGAG 
AAGTGAACTG 
AGCTGGTTTG 
CTCATTGCTT 
TCACATTAGC 
TCCCAGTAGC 
TGACACAGAC 
AGGAGATGTG 
TTTTAATGTT 
TAAATCATAT 



31 

I 

AGCCGGCTAC 
CTCCTCCCTT 
TAATTTCTTT 
ACTCGGCATA 
GTTTGGGTTT 
GATGTCATCC 
CTGGGGAAGC 
CCTCCACTAA 
CAATTTGTTT 
GCCAAATTAG 
ACTTTACCAC 
ATTAAGCAGG 
ATTTCCCAGG 
ATTGAAAAGT 
GTTCAGGATG 
TATGAGGAAA 
GAACGACTGA 
GTAAAAAGTG 
GAAAGCTCTA 
AAAATTCACT 
CAGATGAAAG 
AACAAAGATG 
AATGGGGAGA 
TCGGACACTG 
GTGCTGTTGT 
TGTTTAATTT 
TGTAGTATTC 



41 
I 

TRPIiECQDAL 
VDNAGIiFSCM 
GPDAASLRRV 
LVLGLLLTEI 
SNDGQRMFEA 
AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSIj 
RASRGKKEKV 
IDLEIQEGKL 
DNILFGKEYD 
SDRSIYILDD 
ITERGTHEEL 
KAVKPEEGQL 
SYWIKQGSGN 
LRASSRLHDE 
FFCVGMIAGV 
LATIHAYNKG 
HGQIPPAYAG 
KAPSPDWPQE 
FRLVELSGGC 
DALERTHMKE 
DTETDLLIQB 
RFYAMFAAAE 



41 

I 

TAATGAGCTG 
CACCCTGGAA 
ATTCGTTTGC 
AAAACAGATC 
GCCAAGAAGA 
AGGCTTTGCT 
CCAGTGATTA 
CTAGAATCCT 
TGGTTAAAGC 
TGCAAAACAC 
CAGAT AAAC A 
ACACAGTTTC 
ACCATACTAT 
GTGAAGCTAA 
CATATTTAAT 
ACCAGACTCT 
AATATTACCG 
TTTGCATTGA 
ATTTAGAGAG 
CTCATTTGAG 
CAAAAGAATA 
GGTGCCAGTT 
TTCCTCCCTT 
GTATCAGTTC 
CAACATAGTT 
AATAGGAAAC 
AATAGTTAAT 



51 
1 

ETAARAEGLS 
TFSWLSSLAR 
VWIFCRTRLI 
VRSWSLALTW 
AAVGSLLAGG 



Seq ID NOt 321 Protein sequence: 
Protein Accession *h NP_005438.1 



1 
I 

MAPFGRNLLK 
EATFGEKRFL 
LPVPLWRTAE 
DNMETLVHLI 
SEVEQNLDLO 
DAEGEAASEL 
AKEFNSLHIS 
QDSETTVGDV 



11 
I 

TRHKNRSPTK 
LGKPSDYCII 
AKLVQNTEKL 
ISQDHTIHQQ 
YEENQTLEDL 
ESSNLESVKC 
NKDGCQLKEN 
VU*ST 



21 
I 

DMDSEEKEIV 
EKWRGSERVL 
WELSPANYMK 
VKRMKELDLE 
SESDGIEQLE 
DLEKSMKAGL 
RAKESEVPSS 



31 

I 

VWVCQEEKLV 
PPLTRILKLW 
TLPPDKQKRI 
IEKCEAKFHL 
ERLKYYRILI 
KIHSHLSGIQ 
NGEIPPFTQR 



41 

1 

CGLTKRTTSA 
KAWGDEQPNM 
VRKTFRKIiAK 
DRVENDGENY 
DKLSAEIEKE 
KEIKYSDSLL 
VFSNYTNDTD 



VIASWTFSV 
FLHEEVHMIK 
RQLQRTEHQA 
VGICGSVGSG 
EERYNSVLNS 
PLSALDAHVG 
MNLNGDYATI 
VQLEEKGQGS 
TTVTRGNETS 
IiFRRILRSPM 
FPWFLVAVGP 
QEFLHRYQEb 
IiAISYAVQLT 
GEVTFENAEM 
IKIDGVRISD 
CIAQLPLKLE 
TIREAFADCT 
NKVAVKG 



51 
I 

AAAGCCAGGA 
ATCCAGACAC 
CCGACAGACC 
TCCAACTAAA 
GAAGCTTGTC 
TGAGGAACAT 
CTGCATCATA 
GAAGCTTTGG 
AGATGCTTTT 
AGAAAAATTG 
AAAAAGAATA 
TCATGATCGA 
TCATCAGCAA 
GTTCCATCTT 
GCCCAGTTTC 
GGAGGACCTG 
AATACTCATT 
TATAAATGAA 
TGTTAAGTGT 
TGGCATCCAG 
TGAACTCCTG 
AAAGGAAAAC 
TACTCAAAGA 
TAACCACAGT 
CCAATGGCTC 
CTCATTTTAA 
AAAAACTCGA 



51 
I 

DVIQALLEEH 
QFVLVKADAF 
IKQDTVSHDR 
VQDAYLMPSF 
VKSVCIDINE 
QMKAKEYELL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 322 DNA sequence 

Nucleic Acid Accession *h NMJJ30920.1 



306 



WO 02/086443 

Coding sequence i 317-1123 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



AGCATTGAAG 
GGCGCGTGCG 
AGTTAGAGTC 

AAAATTTTG6 
AGAGCAAX3AG 
CCCGGAGGAG 
AGGCCTGAAT 
TTCGCTGGCC 
AATTTCTGGA 
GAGTGGAAAC 
GAAAAGTCTT 
TTTTGAACTA 
GCCGGACTCT 
AAATGAAGCT 
TGAGGATGAG 
AGTGGGCCTC 
TGTTGAAGAA 
GAAACGAGAT 
TTCTCTAATG 
AGCTATCCCT 
TTGCCTTATC 
ATTTTCATTG 
TTCAAGCTTA 
GGTGTAGTAT 
AATATAAAAT 
GGAATCTTTA 
TTTGGTTGCT 
TTCAGCAACA 
TAGTGGTGAC 
CATGCAGGTG 
GTTTTTAGGT 
GTTTCACTCC 
CTTTAGTCTT 
AAGAAATTGA 
TCTCCTTACC 
AAAGAAAAGC 
GCCTGTTGAT 
TTGGATACCA 
TAATAGCATT 
TTAAAATTAA 
AGTGAGTGTG 
CTCCTCTTTT 
TAAAATTTTT 
TTAATTGAAG 
TACTGCTTCT 
CTTGTGGAGA 
TAGTGAGGGG 
ATTCCCTGTT 
ATTTTTAAAA 
ATTAGAGTAC 
TTAGCTTTGT 
AGTTTTCGGT 
CTCAAGAATC 
AGCCATGACA 



11 
I 

GGGAAGGAAC 
TGOGTGTGTG 
CCAACTCTTG 
TAGTGGGCGT 
GTTGGGGGTA 
GAGGACATGG 
GTGACAGAGT 
GATACTTTCA 
CGGCTTCCCA 
GGCTTGGAAG 
AAAATAAAAG 
GACCTGTTTA 
CTGCAGCAAA 
GAAGAGGAGG 
GGTCCACCGG 
GATGAAGATG 
TCATACTTAA 
GGGGAAGAAG 
GCTGAAGACG 
TTTCTGGGTG 
ACAGAAGATA 
ATTCCAAATA 
ACTCCCATTG 
CTGTATTAGT 
GGTGCATTTC 
GACAATAGTC 
GTTTGAAGGC 
TTTTTGTCAC 
TAGTTCATGG 
TGATAAGATT 
AGCCCTTTTG 
GTGCATTCTT 
CTCCCGCTCT 
ATACCTCAAC 
GTATTTTCTG 
ATAAATGTTT 
AGGGGTGCTT 
ATATTAATAA 
CTCTGCAAAG 
TAATTGGAAT 
GATTTGTTTT 
TGTTTCCTTA 
GCTATGGAGG 
TCCCTTTGTT 
TTATGCTTCT 
ACAATGTAAA 
AACTAAAAAT 
ATTGGAATCC 
CTAGTTCCTA 
TGAAATCTAT 
TTTGGTTTGA 
GGGTGGTATT 
CCTATTTTAA 
TCCTTAAAAA 
GTGTTAAAAA 



21 
I 

TGCGGGTGTG 
TGCGCGCGCT 
GACTCCATTT 
TTATATTTGC 
TTGGGGAAGG 
AGATGAAGAA 
TAGTCCTTGA 
AAGAACTAGA 
GCTTAAATAA 
TCCTGGCAGA 
ATCTCAGTAC 
ACTGTGAGAT 
TCACATACTT 
ATGATGAGGA 
AAGGATATGA 
AAGATGAAGC 
TGAAAGAAGA 
AGGAAGAAGA 
ATGGAGAGGA 
TGCAATAGAG 
ATGTGTAACT 
AGAACTAGTC 
TGGAATTCCC 
CATTTTTAGC 
ATTCCTTATT 
TCTTGAGTGG 
AGTTACCTTT 
AAGTAACTTG 
ATTTTGTGGA 
TTATTTTTGA 
TCAGGCTGCA 
TTTCTTCTTA 
CTCAAAAGGA 
TACGTTTCAG 
ACATAAGAAT 
CTTGGTTGAC 
TTGGAAGCAG 
GACATTAAAG 
TATTTCTAAC 
ATACTAGGCA 
CAAGTGGATG 
CATGGCTACT 
CTCCATGTTC 
TTGAATATTT 
ATACTGGGAC 
ATGTATGACT 
GTTGTACAAC 
CCAAAAAGAT 
ACAATTCTCA 
ATATTGACTT 
AAACAACACT 
TTCATGCAAA 
TGCTCTTTGT 
CTTTGAAGTT 
CAAAC 




CAGGAAAGGG 
GAAGATTAAC 
TAATTGCCTG 
ATTTCTGAGT 
ACTTCGAAAA 
GAAATGTCCA 
AGTAGAAGCT 
CACAAACCTG 
AGATGGATTT 
TGGAGATGAA 
GGAAGAGGAG 
AGGTTCAGAG 
AATTCAGGAT 
GGAAGAAGGA 
AGAAGATGAC 
TGATCACATC 
TTTTATAGGA 
TGTTAATGAT 
TAGCAATTTA 
CCATAATTAA 
TATAGATTAA 
TAAGTTGGTT 
TTTTTTTTTT 
GAAAATAGAA 
GGTTCTATTC 
AGGAAAAATT 
AATCATGACA 
GCAATTCCTT 
ACTTGGGAAA 

ATATTATCAA 
AACATCTAAG 
TGTTAGTGTT 
TCAAATTTTA 
CTTTAATTCC 
GCTGGAAAAG 
TCCATTAAAA 
AAATAAAATA 
AAGGCAATGG 
GTAAGTTTTT 
ATATTTAAAT 
TGGTGTTTTA 
TGACCGAAAG 
AACATTTTTC 
TTACATACTA 
TCTTATCAAT 
TAGAGCCTCC 
TAAGTAAGGG 
ATGGCAGTAT 
AATACTTTTG 



41 

I 

GTGTGTGTGT 
CAAGGAGGTG 
TCTTTCTCCC 
ATTCATTTCT 
AAAAGGAGAG 
CTGGAGTTAA 
TGTGTCAATG 
ATGGCTAATG 
TTGGAGCTTA 
AATCTTACCT 
CTGCAAAATC 
GAAGATTATA 
GATCAGGAGG 
GATGATGAAG 
GAGGAAGAGG 
TTGGGAGAGG 
GAAGAAGATG 
GGTCTTCGAG 
TAGATCATTC 
TTTGTTTCTT 
AAAGTGTGGT 
CATATTGTAT 
TTTAGACTTA 
AACATGATCA 
CTGAAATTAC 
ATTTTTTTAG 
TTTTTTTTTG 
GCAGAATAGT 
AGTAATATGG 
GCTTATACTA 
TGCCGATGGT 
TATGATCACC 
CTTGTGAAAC 
GTTTTAAATA 
TACAGTTTTA 
ACAATATTAG 
CCTCAAAAGT 
ATGTTGGCCT 
CAGTTTTAAA 
TATTTGAAAC 
GTAGAAAAAT 
TAATGAGTAT 
CTTTTTAAAT 
AAGAAGTTAG 
ACTGAGTATA 
AAGTAAAAAT 
AAAACCCTTG 
TTCTGCTTTT 
TGCCAGATTA 
CATCTTACTG 
AGATAACTTT 
TGGGTTTTAT 
GTATATATTG 
TGCAACTGTG 



51 
I 

ATGTGTGTGC 
GGGGCAGCTG 
CCACACCTAT 
AAATCTCTTA 
TAGTAGCTGA 
GGAACAGATC 
GGGAAATTGA 
TGGAACTAAG 
GTGATAATAT 
ACCTCAATCT 
TTAAAAATTT 
GAGAAAGTAT 
ATAATGAAGC 
AGGAAGAGGA 
AAGAGGAGGA 
GAGAAGAGGA 
ATGATGACTA 
GGGAGAAGAG 
TAAGACCAGA 
CATGTACGAT 
TTTACTATTT 
GTAGAGAAAA 
ATTTTTTAAA 
CTTTTAAACA 
AGTTTGCTAT 
AGGTGATCCA 
ACTAAGAGTG 
AAAGGTTCTA 
TTCATGGATT 
AGTCCAGAGA 
TGTTTATTTT 
TTCCCTTCTT 
CCAGGAAAAC 
AGTGAAGTAG 
TGCAGTAAGC 
TGGGATGAAG 
CGGAACAATT 
CTCAAATGAT 
ACAGATATAA 
TAAATTGACA 
ATTTGGGATA 
ACAAGTATAT 
CTTGGCTATC 
TGTCAGCAAA 
GTACTGCTGC 
TATGATGTTA 
GGGATAAGTT 
AAAAACTGAA 
CAAAATACTT 
TGCAATCAAA 
TAAGACTTAT 
ATTTTGTAGA 
TGTTAAGTTC 
TTTTGAATAA 



TCCTCTGCGT 
GCCCAGCCCC 
CCATGCGCGC 
GCTCTCGCCG 
CAGCTCGCGG 
ACGCGGCCCC 
GCTTCTCGTC 
GGCTGCTGCG 
GGCAGATGAA 
AATGCAGAAA 



11 
I 

CCCGCCCCGG 
GAGCCCCGCG 
CGAGCCGGCG 
GCCACACGGA 
CAGCCGCCCC 
GCCGGCTCGG 
CTTCTCCTGC 
CCCAGCGCTC 
GACAATACAT 
GAAATCACAC 



21 
I 

GAGTGGCTGC 
CCCCGTGCCC 
TGACCGGCTC 
GCGGCGCCCG 
TGGCGGGCTG 
TGCCTGCCAG 
TGCCTCCGCT 
CGCATTGGAA 
TGCAACAGAA 
TGCCTTCAAG 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 . 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 • 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 



Seq ID NO: 323 Protein sequence; 
Protein Accession #: NP_112182.1 

1 11 21 31 41 51 

i 1 t I I I 

MEMKKKXNLE LRNRSPEEVT ELVLDNCLCV NGEIEGLNDT FKELBFLSMA HVELSSLARL 

PSLNKLRKLE LSDNIISGGL EVLAEKCPNL TYLNLSGNKI KDLSTVEALQ NLKNLKSLDL 

FNCEITNLED YRESIFELLQ QITYLDGFDQ EDNEAPDSEE EDDEDGDEDD EEEEEKEAGP 

PEGYEEEEEE EEEEDEDEDE DEDEAGSELG EGEEEVGLSY LMKEEIQDEE DDDDYVEEGE 
EEEEEEEGGL RGEKRKRDAE DDGEEEDD 

Seq ID NO: 324 DNA sequence 
Nucleic Acid Accession #: NM_003812 
Coding sequence: 224.. 2722 



60 
120 
180 
240 



31 


41 


51 




1 

GAGGCTAGGC 


1 

GAGCCGGGAA 


1 

AGGGGGCGCC 


60 


CGAGCCOGGA 


GCCCCCTGCC 


CGOGGCGGCA 


120 


CGCCCGCGGC 


CGCCCCGCAG 


CTAGCCCGGC 


180 


GGAGCTATGA 


GCCATGAAGC 


CGCCCGGCAG 


240 


CAGCCTTGCC 


GGCGCTTCCT 


GCGGCCCCCA 


300 


CGCCCCGGCC 


CGCACGCCGC 


CCTGCCGCCT 


360 


CGCCGCCTCG 


TCCOGGCCCC 


GCGCCTGGGG 


420 


TGAAACTGCA 


GAAAAAAATT 


TGGGAGTCCT 


480 


TAGCAGCAGT 


AATATCAGTT 


ACAGCAATGC 


540 


ACTCATATAT 


TACATCAACC 


AAGACTCGGA 


600 



307 



WO 02/086443 

AAGCCCTTAT CACGTTCTTG ACACAAAGGC AAGACACCAG CAAAAACATA ATAAGGCTGT 660 

CCATCTGGCC CAGGCAAGCT TCCAGATTGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 

CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG ATTCACTACG AAAATGGGAA 780 

ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT GGAAGCATCA GAGGCGTCAA 840 

AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTTCAT GGCATGTTTG AAGATGATAC 900 

CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG TGGTTGAAAA GAAGGAAGAG 1080 

AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 1140 

TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT GCACATACCA ACAACTTTGC 1200 

AAAGTCCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG CAGCTCAACA CCAGGGTTGT 1260 

CCTGGTGGCT GTAGAGACCT GGACTGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 1380 

GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440 

TGTCTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 1500 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC ATCATGGAGG AAACAGGGGT 1620 

GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAGAGACT TTTTACAGAG 1680 

AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA TTTGAGCCCA CGGAATGTGG 1740 

AAATGGATAC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 1920 

GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA 1980 

GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2040 

CAGAGACAAC CAGTGTCAGT ACATCTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 2160 

GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA CCAATCTTAC 2220 

TCGAGCTCCA CGTATTGGTC AACTTCAGGG TGAGATCATT CCAACTTCCT TCTACCATCA 2280 

AGGCOGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG TGTTTAGATC GGAAGTGCCT 2400 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT GATTTCACCT GGGCAGGGAC 2520 

AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 2580 

GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2640 

TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC AAGAAGAGAA GGTTCGATCC 2700 

TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG ACACCGCCTT GCACTGTTGG 2760 

ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT ATTAAGTTTQ TAAACAAAAC 2820 

CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2880 

CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2940 

GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT TCCCTAATGO 3000 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 

Seq ID NO: 325 Protein sequence: 
Protein Accession ft: NP_003B03 

1 11 21 31 41 51 

I I I i 1 1 

MKPPGSSSRQ PPLAGCSLAG ASCGFQRGPA GSVPASAPAR TPPCRLLLVL LLLPPLAASS 60 

RPRAWGAAAP SAPHWNBTAE KNLGVLADED NTLQQNSSSN ISYSNAMQKE ITLPSRLIYY 120 

INQDSESPYH VLDTKARHQQ KHNKAVHLAQ ASFQIEAFGS KFILDLILNN GLLSSDYVEI 180 

HYENGKPQYS KGGEHCYYHG SIRGVKDSKV ALSTCNGLHG MFEDDTFVYM IEPLELVHDE 240 

KSTGRPHIIQ KTLAGQYSKQ MKNLTMERGD QWPFLSELQW LKRRKRAVNP SRGIFEEMKY 300 

LELMIVNDHK TYKKHRSSHA KTNNFAKSW NLVDSIYKEQ LNTRWLVAV ETWTEKDQID 360 

ITTNPVQMLH EFSKYRQRIK QHADAVHU S RVTPHYKRSS LSYFGGVCSR TRGVGVNEYG 420 

LPMAVAQVLS QSLAQNLGIQ WEPSSRKPKC DCTESWGGCI MEETGVSHSR KFSKCSILEY 480 

RDFLQRGGGA CLFNRPTKLF EPTECGNGYV EAGEECDCGP HVECYGLCCK KCSLSNGAHC 540 

SDGPCCNNTS CLFQPRGYEC RDAVNECDIT EYCTGDSGQC PPNLHKQDGY ACNQNQGRCY 600 

NGECKTRDNQ OQYIWGTKAA GSDKFCYEKL NTEGTEKGNC GKDGDRWIQC SKHDVFCGFL 660 

LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLD DDTDVGYVED GTPCGPSMMC 720 

LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCTCD FTWAGTDCSI RDPVRNLHPP 780 
KDEGPKGPSA TNLIIGSIAG AILVAAIVLG GTGWGFKMVK KRRFDPTQQG PI 

Seq ID NO: 326 DNA sequence 

Nucleic Acid Accession #: AK074418.1 

Coding sequence: 244-1515 

1 11 21 31 41 51 

I I I I I I 

CTTTCTCCAA GACGGCCGGC CATGCTCTCC TCCTCTGCCA GTCTCCTCCA CCACTCTCTA 60 

ACCTGAGAGC CTGTGGAACC TGCCCGTCTC CCCTCCTCCA TCAGACACAC CTGCCTAGGA 120 

AACAGATGGA AAAAGTGAGG GACCGGTGAG TGACTTGCTG CTAAAGTTTA TACCAGATGC 180 

AAATGACAGA GCTGGAGTTC TGCTGTGCCT GGAAAGGACC TCGGAAGTCT TCTAAGGAGA 240 

GTCATGGCGT ATTACCAGGA GCCTTCAGTG GAGACCTCCA TCATCAAGTT CAAAGACCAG 300 

GACTTTACCA CCTTGCGGGA TCACTGCCTG AGCATGGGCC GGACGTTTAA GGATGAGACA 360 

TTCCCCGCAG CAGATTCTTC CATAGGCCAG AAGCTGCTCC AGGAAAAAOG CCTCTCCAAT 420 

GTGATATGGA AGCGGCCACA GGATCTACCA GGGGGTCCTC CTCACTTCAT CCTGGATGAT 480 

ATAAGCAGAT TTGACATCCA ACAAGGAGGC GCAGCTGACT GCTGGTTCCT GGCAGCACTG 540 

GGATCCTTGA CTCAGAACCC ACAGTACAGG CAGAAGATCC TGATGGTCCA AAGCTTTTCA 600 

CACCAGTATG CTGGCATTTT CCGTTTCCGG TTCTGGCAAT GTGGCCAGTG GGTGGAAGTG 660 

GTGATTGATG ACCGCCTACC TGTCCAGGGA GATAAATGOC TCTTTGTGCG TCCTCGCCAC 720 

CAAAACCAAG AGTTCTGGCC CTGCCTGCTG GAGAAGGCCT ATGCCAAOCT GCTOGGATCC 780 

TATTCCGATC TGCACTATGG CTTCCTCGAG GATGCCCTGG TGGACCTCAC AGGAGGCGTG 840 

ATCACCAACA TCCATCTGCA CTCTTCCCCT GTGGACCTGG TGAAGGCAGT GAAGACAGCG 900 

ACCAAGGCAG GCTCCCTGAT AACCTGTGCC ACTCCAAGTG GGCCAACAGA TACAGCACAG 960 

GCGATGGAGA ATGGGCTGGT GAGTCTCCAT GCCTACACTG TGACTGGGGC TGAGCAGATT 1020 

CAATACCGAA GGGGCTGGGA AGAAATTATC TCCCTGTGGA ACCCCTGGGG CTGGGGCGAG 1080 

ACCGAATGGA GAGGGCGCTG GAGTGATGGG TCTCAGGAGT GGGAGGAAAC CTGTGATCCG 1140 



308 



WO 02/086443 

CGGAAAAGCC AGCTACATAA GAAACGGGAA GATGGCGAGT TTTGGATGTC GTGTCAAGAT 12 00 

TTCCAACAGA AATTCATCGC CATGTTTATA TGTAGCGAAA TTCCAATTAC CCTCGACCAT 1260 

GOAAACACAC TCCACGAAGG ATGGTCCCAA ATAATGTTTA GGAAGCAAGT GATTCTAGGA 1320 

AACACTGCAG GAGGACCTCG GAATGATGCT CAATTCAACT TCTCTGTGCA AGAGCCAATG 13 BO 

GAAGGCACCA ATGTTGTCGT GTGCGTCACA GTTGCTGTCA CACCATCAAA TTTGAAAGCA 1440 

GAAGATGCAA AATTTCCACT CGATTTCCAA GTGATTCTGG CTGGCTCACA GAAACACTGT 1500 

CCAAAGCTCA AATAATAAAT TCCGCCGCAA CTTCACCATG ACTTACCATC TGAGCCCTGG 1560 

GAACTATGTT GTGGTTGCAC AGACACGGAG AAAATCAGCG GAGTTCTTGC TCCGAATCTT 1620 

CCTGAAAATG CCAGACAGTG ACAGGCACCT GAGCAGCCAT TTCAACCTCA GAATGAAGGG 1680 

AAGCCCTTCA GAACATGGCT CCCAACAAAG CATTTTCAAC AGATATGCTC AGCAGGTATG 1740 

GTAOCTAGCA CCCAGGGGCC TTACGTGGGA TTGGAGAAAG GGGACCTGAG GGAGGGACAG 1800 

CCCTCACAGG CCCTTACTGG GATGCAGAGA GGAGAAGTGA CTTGATGGAC TATTTTACCT I860 

GCCTCTCTTC CTGGATCGTC TCCAGAACTG CTGTGGCTGC CAAGCTCGGT AGAGACGTGG 1920 

CGCCCCACCC AGTCTCATCC GGGGGACTTC AAGCTGGAAT GCAGAGCTTA GAAAGGGAGG 1980 

GGATAATTAT GGGGTGTGAG GTGCATTGCC CTCTAAATCT TTAAACAAGC AATTGGCAGT 2040 

ACCCCGTGAA ACCTTTCCTP CTCCTACTCG- GCCACCTCCC ACCAACCTGG CATCGTTCCT 2100 

CCCGGGAGCT AGCCAGCTTC AGAAAGCACA TACAGCATCC TTGCTGCCAA ACCACCTATG 2160 

TGCACACAGG ATTTCCTTAA TGGCTTAATA AACTGTTATA AAGAACTCCT TGACTTGTCA 2220 

GAATAAAATA GCTGCCAGGG GCTCTGCACA ATGAGCCTCT TACCGTTAAA AAAAAAAAAA 2280 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 327 Protein sequence: 
Protein Accession #: BAB8S075.1 

1 11 21 31 41 51 

I I I I 1 I 

MAYYQEPSVE TSIIKFKDQD FTTLRDHCLS MGRTFKDETP PAADSSIGQK LLQEKRLSNV 60 

IWKRPQDLPG GPPHFXLODI SRFDIQQGGA ADCWFLAALG SLTQNPQYRQ KILMVQSFSH 120 

QYAGIFRFRF WQCGQWVBW IDDRLPVQGD KCLFVRPRHQ NQBFWPCLLE KAYAKLLGSY 180 

SDLHYGFLED ALVDLTGGVI TNTHLHSSPV DLVKAVKTAT KAGSLITCAT PSGPTDTAQA 240 

MENGLVSLHA YTVTGAEQIQ YRRGWEEIIS LWNPWGWGET SWRGRWSDGS QEWEETCDPR 300 
KSQLHKKRED GEFWMSCQDF QQKFIAMFIC SEIPITLDHG NTLHEGWSQI KFRKQVILGN 360 

TAGGPRNDAQ FNFSVQEPME GTNVWCVTV AVTPSNLKAE DAKFPLDFQV ILAGSQKHCP 420 
KLK 



Seq ID NO: 328 DNA sequence 

Nucleic Acid Accession S: BCQ17490.1 

Coding sequence: 74 -2 78 B 

1 11 21 31 41 51 

I t I I I t 

GTGGGTCACG TGAACCACTT TTCGCGCGAA ACCTGGTTGT TGCTGTAGTG GCGGAGAGGA 60 

TCGTGGTACT GCTATGGCGG AATCATCGGA ATCCTTCACC ATGGCATCCA GCCCGGCCCA 120 

GCGTCGGCGA GGCAATGATC CTCTCACCTC CAGCCCTGGC CGAAGCTCCC GGCGTACTGA. 180 

TGCCCTCACC TCCAGCCCTG GCCGTGACCT TCCACCATTT GAGGATGAGT CCGAGGGGCT 240 

CCTAGGCACA GAGGGGCCCC TGGAGGAAGA AGAGGATGGA GAGGAGCTCA TTGGAGATGG 300 

CATGGAAAGG GACTACOGCG CCATCCCAGA GCTGGACGCC TATGAGGCGG AGGGACTGGC 360 

TCTGGATGAT GAGGACGTAG AGGAGCTGAC GGCCAGTCAG AGGGAGGCAG CAGAGCGGGC 420 

CATGCGGCAG CGTGACOGGG AGGCTGGCCG GGGCCTGGGC CGCATGCGCC GTGGGCTCCT 480 

GTATGACAGC GATGAGGAGG ACGAGGAGCG CCCTGCCCGC AAGCGCCGCC AGGTGGAGCG 540 

GGCCACGGAG GACGGCGAGG AGGACGAGGA GATGATCGAG AGCATCGAGA AOCTGGAGGA 600 

TCTCAAAGGC CACTCTGTGC GCGAGTGGGT GAGCATGGOG GGCCCCCGGC TGGAGATCCA 660 

CCACCGCTTC AAGAACTTCC TGCGCACTCA OGTCGACAGC CACGGCCACA ACGTCTTCAA 720 

GGAGCGCATC AGOGACATGT GCAAAGAGAA CCGTGAGAGC CTGGTGGTGA ACTATGAGGA 780 

CTTGGCAGCC AGGGAGCACG TGCTGGCCTA CTTCCTGCCT GAGGCACCGG CGGAGCTGCT 840 

GCAGATCTTT GATGAGGCTG CCCTGGAGGT GGTACTGGCC ATGTACCCCA AGTACGACCG 900 

CATCACCAAC CACATCCATG TCCGCATCTC CCACCTGCCT CTGGTGGAGG AGCTGCGCTC 960 

GCTGAGGCAG CTGCATCTGA ACCAGCTGAT CCGCACCAGT GGGGTGGTGA CCAGCTGCAC 1020 

TGGOGTCCTG CCCCAGCTCA GCATGGTCAA GTACAACTGC AACAAGTGCA ATTTOGTCCT 1080 

GGGTCCTTTC TGCCAGTCCC AGAACCAGGA GGTGAAACCA GGCTCCTGTC CTGAGTGCCA 1140 

GTCGGCCGGC CCCTTTGAGG TCAACATGGA GGAGACCATC TATCAGAACT ACCAGCGTAT 1200 

CCGAATCCAG GAGAGTCCAG GCAAAGTGGC GGCTGGCOGG CTGCCCCGCT CCAAGGACGC 1260 

CATTCTCCTC GCAGATCTGG TGGACAGCTG CAAGCCAGGA GACGAGATAG AGCTGACTGG 1320 

CATCTATCAC AACAACTATG ATGGCTCCCT CAACACTGCC AATGGCTTCC CTGTCTTTGC 1380 

CACTGTCATC CTAGCCAACC AGGTGGCCAA GAAGGACAAC AAGGTTGCTG TAGGGGAACT 1440 

GACCGATGAA GATGTGAAGA TGATCACTAG CCTCTCCAAG GATCAGCAGA TCGGAGAGAA 1500 

GATCTTTGCC AGCATTGCTC CTTCCATCTA TGGTCATGAA GACATCAAGA GAGGCCTGGC 1560 

TCTGGCCCTG TTCGGAGGGG AGCCCAAAAA CCCAGGTGGC AAGCACAAGG TACGTGGTGA 1620 

TATCAACGTG CTCTTGTGCG GAGACCCTGG CACAGCGAAG TCGCAGTTTC TCAAGTATAT 1680 

TGAGAAAGTG TCCAGCCGAG CCATCTTCAC CACTGGCCAG GGGGCGTOGG CTGTGGGCCT 1740 

CACGGCGTAT GTCCAGCGGC ACCCTGTCAG CAGGGAGTGG ACCTTGGAGG CTGGGGCCCT 1800 

GGTTCTGGCT GACGGAGGAG TGTGTCTCAT TGATGAATTT GACAAGATGA ATGACCAGGA 1860 

CAGAACCAGC ATCCATGAGG CCATGGAGCA ACAGAGCATC TCCATCTCGA AGGCTGGCAT 1920 

CGTCACCTCC CTGCAGGCTC GCTGCACGGT CATTGCTGCC GCCAACCCCA TAGGAGGGCG 1980 

CTACGACCCC TCGCTGACTT TCTCTGAGAA CGTGGACCTC ACAGAGCCCA TCATCTCACG 2040 

CTTTGACATC CTGTGTGTGG TGAGGGACAC CGTGGACCCA GTOCAGGACG AGATGCTGGC 2100 

CCGCTTCGTG GTGGGCAGCC ACGTCAGACA CCACCCCAGC AACAAGGAGG AGGAGGGGCT 2160 

GGCCAATGGC AGCGCTGCTG AGCCCGCCAT GCCCAACACG TATGGCGTGG AGCCCCTGCC 2220 

CCAGGAGGTC CTGAAGAAGT ACATCATCTA CGCCAAGGAG AGGGTCCACC CGAAGCTCAA 2280 

CCAGATGGAC CAGGACAAGG TGGCCAAGAT GTACAGTGAC CTGAGGAAAO AATCTATGGC 2340 

GACAGGCAGC ATCCCCATTA CGGTGCGGCA CATCGAGTCC ATGATCCGCA TGGCGGAGGC 2400 

CCACGCGCGC ATCCATCTGC GGGACTATGT GATCGAAGAC GACGTCAACA TGGCCATCCG 2460 

CGTGATGCTG GAGAGCTTCA TAGACACACA GAAGTTCAGC GTCATGCGCA GCATGCGCAA 2520 

GACTTTTGCC CGCTACCTTT CATTCCGGCG TGACAACAAT GAGCTGTTGC TCTTCATACT 2580 

GAAGCAGTTA GTGGCAGAGC AGGTGACATA TCAGCGCAAC CGCTTTGGGG CCCAGCAGGA 2640 

CACTATTGAG GTCCCTGAGA AGGACTTGGT GGATAAGGCT CGTCAGATCA ACATCCACAA 2700 
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WO 02/086443 

CCTCTCTGCA TTTTATGACA GTGAGCTCTT 
AAGGAAAATG ATCCTGCAGC AGTTCTGAGG 
TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT 
TGAACTCGGG GTACTAGGGT CAGGGCTTAT 
TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT 
TGTCTTACTT GGTTGCTGAA CATCTTGCCA 
TTGGATCAGA GCTGCTGAGT TCAGGATGCC 
ATGGATGTCA GGAGAGCTGC TGCCCTCTTG 
TGCCTTTGGC CAGAGAGCTG GTTGAAGATG 
TCTGTGCCCC TGTGGTGGAA GAGGGCACGA 
GTCGCAGGGG TGGGATGTGA GTCATGCGGA 
TTGCTCCCTG TCTGTTTCCC CACTCTCTTA 
TAATTTTTAA TAAAGTTGAA TAAAATATAA 

Seq ID NO: 329 Protein sequence : 
Protein Accession #> AAH17490.1 



CAGGATGAAC 
CCCTATGCCA 
GCTTTATGGA 
AGCAGGATGT 
CTCACCTTTG 
CCTCCGAGTG 
TGCGTGTGGT 
GCGTGAGTTG 
TTTGTAATCG 
CAGTGCCAGC 
TTATCCACTC 
TTTGTGCATT 
AAAAAAAAAA 



MAESSESPTM 
GPLEBEEDGE 
DREAGRGLGR 
SVREWVSMAG 
EHVLAYFLPE 
HLNQI»IRTSG 
PEVNMEETIY 
NYDGSLNTAN 
IAPSIYGHED 
SRAIPTTGQG 
HEAMEQQSIS 
CWRDTVDPV 
KKYIIYAKER 
ELROYVZEDD 
AEQVTYQRNR 
LQQF 



11 

I 

ASSPAQRRRG 
ELIGDGMERD 
MRRGLLYDSD 
PRLEIHHRPK 
APAELLQIFD 
WTSCTGVLP 
QNYQRIRIQE 
GFPVFATVIL 
IKRG1ALALP 
ASAVGLTAYV 
ISKAGIVTSL 
QDEMIARPW 
VHPKLNQMDQ 
VNMAIRVMUE 
PGAQQDTIEV 



21' 
I 

NDPLTSSPGR 
YRAIPELDAY 
EEDEBRPARX 
NFLRTHVDSH 
EAALEWLAM 
QLSMVKYNCN 
SPGXVAAGRL 
ANHVAKKDNK 
GGEPKNPGGX 
QRHPVSREWT 
QARCTVIAAA 
GSHVRHHPSN 
DKVAKMYSDL 
SPIDTQKFSV 
PEKDLVDKAR 



31 

I 

SSRRTDALTS 
EAEGLALDDE 
RRQVERATED 
GHNVPKERIS 
YPKYDRITNH 
KCNFVIiGPFC 
PRSXDAILIA 
VAVGELTDED 
HKVRGDINVL 
LEAGALVLAD 
NPIGGRYDPS 



AAGTTCAGCC 
TCCATAAGGA 
CACAAAACCA 
CTGGCTGCAC 
GGTGGGATGC 
CTTTGTCTCC 
TTAGGTGTTA 
CGTATTCAGG 
TTTTCAGTCT 
GCAGCGTTCT 
GCCACAGTTA 
OGGTTTGGTT 
AAAAAA 



41 
I 

SPGRDLPPFE 
DVEELTASQR 



Seq ID NO i 330 DMA sequence 
Nucleic Acid Accession #: Ml 7254 
Coding sequence: 257-1645 



GTCCGCGCGT 
CGGGTCGCAC 
CTTTGGAGAC 
AAAGATGGCA 
TGGCTTACTG 
CTTATCAGTT 
GGCTAAGACA 
CCCACGCGTC 
GGAATGTAAC 
CAAAGGCGGG 
GGAGGAGAAG 
AGCAGATCCT 
AGAATATGGC 
GTGCAAGATG 
TCTCTCACAT 
TGATAAAGCC 
TGAGCCCCCC 
TGCTCAACCA 
TTATCAGATT 
GCTTTGGCAG 
GGAAGGCACC 
AGAGCGGAAG 
CTATGACAAO 
CCACGGGATC 
CTCAGACCTC 
GCCCCACCCT 
CTGGAATTCA 
TTCTCATCTG 
CACCAGCCCA 
TGAAAAAAGC 
GAGGGAGTTA 
GGACATATCA 
AAGGACAAAG 
AATCCCACTA 
AACATACCGT 
TCAAAAACAA 
ACTGCATGGC 
CAGCTTTCTC 
ACTATGAACT 
AGGGGTGAAG 
TCTCAAGCAA 
ACTCGAGGGT 
GTAATGGAGA 
TGTCAAATGA 
TCATTATGTG 



11 
I 

GTCCGCGCCC 
TAACTCCCTC 
CCGAGGAAAG 
GAACCAAGGG 
AAGGACATGA 
GTGAGTGAGG 
GAGATGACCG 
CCTCAGCAGG 
CCTAGCCAGG 
AAGATGGTGG 
CACATGCCAC 
AGGCTATGGA 
CTTCCAGACG 
ACCAAGGACG 
CTCCACTACC 
TTACAAAACT 
AGGAGATCAG 
TCTCCTTCCA 
CTTGGACCAA 
TTCCTCCTGG 
AACGGGGAGT 
AGCAAACCCA 
AACATCATGA 
GCCCAGGCCC 
CCGTACATGG 
CCAGCCCTCC 
CCAACTGGGG 
GGCACTTACT 
TCGCCAGAAA 
TTTACTGGGG 
CTGAAGTCTT 
TCTGTGGACT 
TGCCAAAGAA 
ATGCAAACTG 
TTATAATGCC 
GAGAAAAGAC 
ATGTGCTGTT 
AAACTGTGAA 
AAAAGGTGGG 
AAGGAGGAGG 
TGAAGACTGG 
TCATGCAGTC 
AAGGGAAGTA 
AAATTTTAAC 
GGGGCTTTGT 



21 
I 

GCGTGTGCCA 
GGCGCCGACG 
CCGTGTTGAC 
CAACTAAAGC 
TTCAGACTGT 
ACCAGTCGTT 
CGTCCTCCTC 
ATTGGCTGTC 
TGAATGGCTC 
GCAGCCCAGA 
CCCCAAACAT 
GTACAGACCA 
TCAACATCTT 
ACTTCCAGAG 
TCAGAGAGAC 
CTCCACGGTT 
CCTGGACCGG 
CAGTGCCCAA 
CAAGTAGCCG 
AGCTCCTGTC 
TCAAGATGAC 
ACATGAACTA 
CCAAGGTCCA 
TCCAGCCCCA 
GCTCCTATCA 
CCGTGACATC 
GTATATACCC 
ACTAAAGACC 
CTCTATCGGA 
CTGGGGAAGG 
ACTACAGAAA 
GACCTTGTAA 
AGTGGTCTTA 
GGATGAAACT 
ATTTTAAGGA 
ACGAGAGAGA 
TTGGTTGAAA 
GATGACCCAA 
ACTGAGGATG 
AAGAGGCAGA 
ACTCAGGACA 
AGTGTTATAC 
GTAGAATTCA 
TGGAATTGTC 
TCTCCACAGG 



RKESMATGSI 
MRSMRKTFAR 
QINIHNLSAF 



31 

I 

GCGCGCGTGC 
GOGGCGCTAA 
CAAAAGCAAG 
CGTCAGGTTC 
CCCGGACCCA 
GTTTGAGTGT 
CAGCGACTAT 
TCAACCCCCA 
AAGGAACTCT 
CACCGTTGGG 
GACCACGAAC 
TCTGCGGCAG 
GTTATTCCAG 
GCTCACCCCC 
TCCTCTTCCA 
AATGCATGCT 
TCACGGCCAC 
AACTGAAGAC 
CCTTGCAAAT 
GGACAGCTCC 
GGATCCOGAC 
CGATAAGCTC 
TGGGAAGGGC 
CCCCCCGGAG 
CGCCCACCCA 
TTCCAGTTTT 
CAACACTAGG 
TGGCGGAGGC 
GAACATGAAT 
AAGCCGGGGA 
TGAGGAGGAT 
AAGACAGTGT 
AGAAATGTAT 
AAAGCAATAG 
AAACTACCTG 
CTGTGGCCCA 
TCAAATACAT 
AGTTTCCAAC 
TGTATAGAGT 
GAAGGAGGAG 
TTTGGGGACT 
CAAACCCAGT 
GAAACAAAAA 
TGATATTTAA 
GTCAGGTAAG 



DMCKENRESL 
IHVRISHLPL 
QSQNQEVKPG 
DIiVDSCKPGD 
VKMITSLSKD 
LCGDPGTAXS 
RGVCLIDEFD 
LTPSENVDLT 
AAEPAMPNTY 
PITVRHIESM 
YIjSPRRDNNE 
YDSELFRMNK 



41 
I 

CTTGGCCGTG 
CCTCTOGGTT 
ACAAATGACT 
TGAACAGCTG 
GCAGCTCATA 
GCCTACGGAA 
GGACAGACTT 
GCCAGGGTCA 
CCTGATGAAT 
ATGAACTACG 
GAGCGCAGAG 
TGGCTGGAGT 
AACATCGATG 
AGCTACAACG 
CATTTGACTT 
AGAAACACAG 
CCCACGCCCC 
CAGOGTCCTC 
CCAGGCAGTG 
AACTCCAGCT 
GAGGTGGCCC 
AGCOGCGCCC 
TACGCCTACA 
TCATCTCTGT 
CAGAAGATGA 
TTTGCTGCCC 
CTCCCCACCA 
TTTTCCCATC 
CAAAAGTGCC 
AGAGATCCAA 
GCTAAAAATG 
ATGTAGAAGC 
AAACT7TAGA 
AAACAACACA 
TATTTAAAAA 
TCAACA6ACG 
TCCGTTTGAT 
TCCTTTACAG 
GAGCGTGTGA 
ACCAGGCTGG 
GTGTACAATG 
GTTAGGAGAA 
TGCGCATCTC 
GAGAAACATT 
AGATGGCCTT 



ACGACCTGAA 


2760 


TTCCTTGGGA 


2820 


GAGCACTTGA 


2880 


CTGGCATGAC 


2940 


CTTGCCAGTG 


3000 


ACTCAGTACC 


3060 


GCCTTCTTAC 


3120 


CTGCTTTTGC 


3180 


CCTGCAGGTT 


3240 


GGGCTCCTCA 


3300 


TCAGCTGCCA 


3360 


TCTGTAGTTT 


3420 


51 
1 




1 

DESEGLLGTE 


60 


EAAERAMRQR 


120 


IENLEDIiKGH 


180 


WNYEDLAAR 


240 


VEELRSliRQL 


300 


SCPECQSAGP 


360 


EIELTGIYHN 


420 


QQIGEKIFAS 


480 


QFLKYIEKVS 


540 


KMNDQDRTSI 


600 


EPIlSRFDIIi 


660 


GVEPLPQEVL 


720 


IRMAEAHARI 


780 


LXtLFILKQLV 


840 


FSHDLKRKMI 


900 


51 

1 




1 

CGCGCOGAGC 


60 


ATTCCAGGAT 


120 




180 




240 


TCAAGGAAGC 


300 




360 


CCAAGATGAG 


420 


CCATCAAAAT 


4B0 


GCAt^TC^TGGC 


540 


GCAGCTACAT 


600 


TTATCGTGCC 


660 


GGGCGGTGAA 


720 


GGAAGGAACT 


780 


V. w>wVL V^V *• w w 4 


840 


CAGATGATGT 


900 


ATTTACCATA 


960 


AGTCGAAAGT 


1020 


nU * * nw\ £ ww 


1080 


GCCAGATCCA 


1140 


GCATCAGCTG 


1200 


GGOGCTGGGG 


1260 


TGOGTTACTA 


1320 


A* "^"I'C^tA 1 u ^ u r 


1380 


ACAAGTACCC 


1440 


ACTTTGTGGC 


1500 


CAAACCCATA 


1560 


GCCATATGCC 


1620 


AGCGTGCATT 


1680 


TCAAGAGGAA 


1740 


AGACTCTTGG 


1800 


TCACGAATAT 


1860 


ATGAAGTCTT 


1920 


GTAGAGTTTG 


1980 


GTTTTGACCT 


2040 


T AGTTTCATA 


2100 


TTY3AT ATf3 C?A 


2160 


GGACAGCTGT 


2220 


TATTACCGGG 


2280 


TTGTAGACAG 


2340 


GAAAGAAACT 


2400 


AGTTATGGAG 


2460 


AGGACACAGC 


2520 


TTTCTTTGTT 


2580 


CAGGACCTCA 


2640 


CTTGGCTGCC 


2700 
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ACAATCAGAA ATCACGCAGG CA TTTTG GGT AGGCGGCCTC 
AACGCTGTGC GTTTGTCAGA ATGAAGTATA CAAGTCAATG 
ATAATTATAT AACTTATGCA TTTATACACT AOGAGTTGAT 
CGACAAAAGA QACAATCGAT ATAATGTGGC CTTGAATTTT 
TACAATATGA AGTTATTAGT TCTTAGAATG CAGAATGTAT 
TAGCATGGCA AATCAGATTT ATACAGGAGT CTGCATTTGC 
TTGCTTAATG AAAACATGTG CTGAATGTTG TGGATTTTGT 
GGAACTTGTO CAAGGGAGAG CCAAGGAAAT AGGATGTTTG 

Seq ID NO: 331 Protein sequence 
Protein Accession #: AAA52398 



PCT/US02/12476 



x 
I 

MIQTVPDPAA 
QDWLSQPPAR 
PPPNMTTNER 
DDFQRLTPSY 
SAWTGHGHPT 
LELLSDSSNS 
MTKVHGKRYA 
LPVTSSSFFA 



11 
I 

HIKBALSWS 
VTIKMECNPS 
RVIVPADPTL 
NADILLSHLH 
PQSKAAQPSP 
SCITWEGTNG 
YKFDFHGIAQ 
APNPYWNSPT 



21 
I 

EDQSLFECAY 
QVNGSRNSPD 
WSTDHVRQHL 
YLRETPLPHL 
STVPKTEDQR 
EPKMTDPDEV 



31 
I 

GTPHLAKTEM 
BCSVAKGGKM 
EWAVKEYQLP 
TSDDVDKALQ 
PQLDPYQILG 



GGIYPNTRLP 



LYKYPSDLPY 
TSHMPSHLGT 



CAGTTTTCCT 
TTTTTCCCCC 
CTCGGCCAGC 
AACTCTGTAT 
GTAATAAAAT 
ACTTTTTTTA 
GTTATAATTT 
GCACCC 



41 

I 

TASSSSDYGQ 
VGSPDTVGMN 
DVNILLFQNI 
NSPRLMHARN 
PTSSRLANPG 
PNMNYDKLSR 
MGSYHAHPQK 
YY 



TTGAGTCGCG 
TTTTTATATA 
CAAAGACACA 
GCTTAATGTT 
AAGCTTGGCC 
GTGACTAAAG 
ACTTTGTCCA 



51 

i 

TSKMSPRVPQ 
YGSYMEEKHM 
DGKELCKMTK 
TDLPYEPPRR 
SGQIQLWQPL 
ALRYYYDKNI 
MNFVAPHPPA 



2760 
2820 
2880 
2940 
3000 
3060 
3120 



Seq ID NO: 332 DNA sequence 
Nucleic Acid Accession ft: NM_O00020 
Coding sequence: 283-1794 



AGGAAACGGT 
AGAAACATTT 
GAGCGAGCCC 
CCAGCGCTGG 
AGGCTAGCGC 
AGGAAAGGCC 
TCTCGGGGCC 
CGGGGGGCCT 
CGGGGCTGCG 
CACTACTGCT 
CAACCTCCTT 
CTGGCCTTGC 
GAGGAGAAGC 
TCTGAGCAGG 
GGCTCAGGGC 
TGTGTGGGAA 
GCCGTCAAGA 
AACACAGTAT 
CGCAACTCGA 
GACTTTCTGC 
GCATGCGGCC 
GCCCACCGCG 
GCCGACCTGG 
AACCCGAGAG 
ACGGACTGCT 
GAGATTGCCC 
GATGTGGTGC 
CAGACCCCCA 
ATGATGCGGG 
AAGACACTAC 
AGCACCTGAT 
CTATCTGGGT 
TGCTCGGCCC 
G7CTGGCCTG 
CAGCATGGTG 
GTGCCAAGCC 
CCCTTGATCA 
CCCTGGCACA 
CCCATCAGTT 
TCCTCAACAA 
ACTAGGGCAT 
AAAAGGGCAG 
GCCAAGCATG 
TTTGCTCCAT 
TTTTTTTTTT 
CCAGCTCACC 
GTAGCTGGGA 
CAGGGTTTCA 
ACCTCAGCCT 
TTGTTTCTTA 
CTAGTTCTCT 
ATGCTCCAGC 
CAAGGAGTGT 
CATGCCAGTG 
CTCGCCCTCT 
GCTTCCAAGG 
CCCTGGCTTC 
ATGGGCTCTA 



11 

I . 

TTATTAGGAG 
TTGCTCCAGC 
CTCCCCGGCT 
CGGTGCAACT 
CCCGCCACCC 
TTCTGATGCT 
CGCTGGTGAC 
GGTGCACAGT 
GGAACTTGCA 
GCGACAGCCA 
CGGAGCAGCC 
TGGCCCTGGT 
AGOGTGGCCT 
GCGACACGAT 
TCCCCTTCCT 
AAGGCCGCTA 
TCTTCTCCTC 
TGCTCAGACA 
GCACGCAGCT 
AGAGACAGAC 
TGGCGCACCT 
ACTTCAAGAG 
GCCTGGCTGT 
TGGGCACCAA 
TTGAGTCCTA 
GCCGGACCAT 
CCAATGACCC 
CCATCCCTAA 
AGTGCTGGTA 
AAAAAATTAG 
TCCTTTCTGC 
AGAGGTAGTG 
CCAGCCCACC 
CTCAAAGCGG 
CACCCCCTAC 
AGGGAATCCC 
ACCCCACTGC 
CACTTCCCTG 
TCTCTCTGTG 
GAGTGCAGCT 
TAAATCCTAA 
GTCAGATGGG 
GCAGGGGGAA 
GTGACAAAAG 
GACACGGAGT 
GCAACGTCTA 
TTACAGGCAC 
CCATGCTGGC 
CCCAAAGTGC 
TCTACATATT 
GACACTTCAG 
CCCTGGCAAT 
CTGGAGCACC 
GCCACCCTTG 
CTGTGGCATA 
CTCAAAAGAA 
AGGCCCACAC 
GAGAGACACA 



21 
I 

GGAGTGGTGG 
CCCCATCCCA 
CCAGCCCGGT 
GCGGCCGCGC 
GCAGAGCGGG 
GCTGATGGCC 
CTGCACGTGT 
AGTGCTGGTG 
CAGGGAGCTC 
CCTCTGCAAC 
GGGAACAGAT 
GGCCCTGGGT 
GCACAGCGAG 
GTTGGGGGAC 
GGTGCAGAGG 
TGGOGAAGTG 
GAGGGATGAA 
CGACAACATC 
GTGGCTCATC 
GCTGGAGCCC 
GCACGTGGAG 
CCGCAATGTG 
GATGCACTCA 
GOGGTACATG 
CAAGTGGACT 
CGTGAATGGC 
CAGCTTTGAG 
CCGGCTGGCT 
CCCAAACCCC 
CAACAGTCCA 
CTGCAGGGGG 
TGAGTGTGGT 
CAGCCAAAAA 
CAGGCTCCCT 
CACTCCCGGG 
AGTCCCAGAC 
CCCACCAGAG 
CCAGGCCTCA 
GATTTGTATC 
TGCTGAATGT 
GAGGTCCTAC 
CAAGGCCCAG 
GGTCAGTGGG 
CAGGCCTGTC 
TTCGCTCTTG 
CCTCCCAGGT 
ATGCCACCAT 
CATGCTGGTT 
TGGGGTTACA 
GGAAGATTTG 
CCTATATCAC 
TTGCCTCAAG 
TCCTAGTCTA 
GGCTCAGACA 
GTCTTCTCTG 
ATTTGGCTCC 
CCCTGGGCCA 
CAGAAAGTTT 



31 
I 

AGCTGGGCCA 
GTCCCGGGAG 
CCGGGGCOGC 
GGTGGAGGGG 
CCCAGAGGGA 
TTGGTGACCC 
GAGAGCCCAC 
CGGGAGGAGG 
TGCAGGGGGC 
CACAACGTGT 
GGCCAGCTGG 
GTCCTGGGCC 
CTGGGAGAGT 
CTCCTGGACA 
ACAGTGGCAC 
TGGGGGGGCT 
CAGTCCTGGT 
CTAGGCTTCA 
ACGCACTACC 
CATCTGGCTC 
ATCTTCGGTA 
CTGGTCAAGA 
CAGGGCAGCG 
GCACCCGAGG 
GACATCTGGG 
ATCGTGGAGG 
GACATGAAGA 
GCAGACCCGG 
TCTGCCCGAC 
GAGAAGCCTA 
CTGGGGGGGT 
GTGTGCTGGG 
TACAGCTGGG 
GACGCCTGGC 
ACAGGATGCA 
TCAGAGCCCG 
CTGCCAGGGT 
GCCTCTAGCA 
TCAGCTCCAT 
CAGCTGCCTG 
TGAGGTGTGG 
GACTTTCAGA 
TGTCAAGAGA 
TCAGGACCTT 
TTGTCCAGGC 
TCAAATCATT 
GCCTGGCTAA 
CTCGAACTCC 
GGTGTGAGCC 
GTCCTGATGT 
AGCTAACTTC 
ATGGGGGTTT 
AGTCTGCAAG 
GCTCTGGGCC 
CCCCAGGACT 
ATCCAAGAAG 
GGSCCAGAGA 
GGGCATTTGG 



41 

I 

GGCAGGAAGA 
GCTGCCGCGC 
GCCGGACCCC 
AGGTGGCCCC 
CCATGACCTT 
AGGGAGACCC 
ATTGCAAGGG 
GGAGGCACCC 
GCCCCACCGA 
CCCTGGTGCT 
CCCTGATCCT 
TGTGGCATGT 
CCAGTCTCAT 
GTGACTGCAC 
GGCAGGTTGC 
TGTGGCACGG 
TCCGGGAGAC 
TCGCCTCAGA 
ACGAGCACGG 
TGAGGCTAGC 
CACAGGGCAA 
GCAACCTGCA 
ATTACCTGGA 
TGCTGGAGGA 
CCTTTGGCCT 
ACTATAGACC 
AGGTGGTGTG 
TCCTCTCAGG 
TCACCGCGCT 
AAGTGATTCA 
GGGGGGCAGT 
GATGGGCAGC 
CTGAAACCTG 
TCTCTCCCCA 
AAAGAGGCTC 
GGCCTGCACT 
GGCACAGGGC 
TAAGCTCCAG 
GATGCCTTGG 
AGAGAGCTGG 
CAGGATCACA 
TTAACTGAGA 
CCCAGGTCTG 
TTCTTTTCTT 
TAGAGTGCAA 
CTCTTGCCTC 
TTTTGTATAT 
TGACCTCAGG 
ATCGCGCCTG 
CCTTTGAGGC 
YTCAGTCTCA 
GAAAATAACT 
CTCCAGTTCT 
TTTTGACCAC 
GCAGGGOGGC 
GCTCCAGCTC 
GTGTGTCTCA 
GAAATTTTCA 



51 
I 

CGCTGGAATA 
CAGCTGCGCC 
AGCCCGCCGT 
GGTCCGCCGA 
GGGCTCCCCC 
TGTGAAGCCG 
GCCTACCTGC 
CCAGGAACAT 
GTTOGTCAAC 
GGAGGCCACC 
GGGCCCOGTG 
CCGACGGAGG 
CCTGAAAGCA 
CACAGGGAGT 
CTTGGTGGAG 
TGAGAGTGTG 
TGAGATCTAT 
CATGACCTCC 
CTCCCTCTAC 
TGTGTCCGCG 
ACCAGCCATT 
GTGTTGCATC 
CATCGGCAAC 
GCAGATCCGC 



ACCCTTCTAT 
TGTGGATCAG 
CCTAGCTCAG 
GOGGATCAAG 
ATAGCCCAGG 
GGATGGTGCC 
TGOGCCTGCC 
ATCCCCTGCT 
CCCCTATGGC 
CAGAGTCAGA 
TTGCCCCCTG 
CCTGTCCAGC 
AGAGCCAGGG 
GCTTTCTGTC 
GGCCTGACTT 
GGCCAGTGGA 
GGATATCGAG 
ACCCCGGATG 
TTTTCCTTCT 
TGGCATGATC 
AGACTCCCGA 
TTAGTAGAAA 
TGTTCCACCT 
GCCAGGACCT 
TTCTTTAGCT 
TCTATTCCTT 
TTACCTGACT 
TGCCTAAAAC 
AAGCCAGCCC 
TTCCTCCAAG 
CCCTACTGGC 
GGAGAATTCA 
AGGRTGTATG 



60 
120 
180 
240 
300 
360 
420 
462 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



311 
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45 
50 
55 
60 
65 
70 
75 
80 
85 



TATGGYTCAC 
GAAGTGGATT 
GACAAGGACA 
GCAGTGAAGA 
GCATCTTATG 
CATTGTGCAA 
TGGATGGGCT 
AATCCACCCA 
GGAACAAACT 
TGGAAAATCC 
GAGAAGGG6G 
ATGCTTCTGT 
AGACGCTGTT 
ATGGTTAAAT 



GTATGGWGCA 
G6A6G6GAGC 
GCCCCAAGGT 
AAGCTCTCCC 
TGTGTCTTCC 
GGCTOGGAAG 
AGGTTCCCAG 
GCCCACGAAT 
CCTGCTGAGA 
CTAAGAGAAG 
CCCAATGGCC 
CTGAGTGCAG 
TGTGGGAGCA 
CCTGAAAAAA 



GGTTGTCCTG 
TTGAGGAATA 
TGGGAAGACC 
CGCTCCTGCT 
ACCATCCTCA 
AGAACCAGGA 
ATCATTAGGG 
CATCTCCCTC 
CCCCACAGCC 
GCCTGGGGGA 
AGGGAGTGAA 
GAAGGTGTTC 
CTGGGCTCAT 
AAAAAAAAA 



GTCCYKGGGT 
TAAGGAGCGG 
TGGCCTTAGT 
GTAATGACCC 
TGGTGGCACT 
AGTGAAACTG 
CAGAGTTTGC 
TTTGAAGGAT 
AGAAACTGAA 
MAGGAAKTGG 
GGAGGTGGCG 
CAGGGTCGAA 
GCCTGGCACA 



GCAGGGAAGT 
GGGTGGAGAC 
CGTCCTCAGC 
AGAGTAGCCT 
TTTCTAGGCC 
GGTGAAAACA 
ACGTCCTCTG 
TTTWATTTCT 
AGCAGCAGCT 
AGTGACAGGG 
TTGCTGAGAG 
ATTACACTTC 
CAATAGGTCT 



GGGCTGCAGG 
TCAGGCTATG 
CTAGGGCAGG 
CCCCAGGCCG 
TGTCTCCCAG 
GAAAGCTCAA 
GTTCACTGGG 
ACTGGGTTTT 
CCCCAAAGCC 
GACAGGTAGA 
CAGTCTGCAC 
TCGTACCTGG 
GCAATAAACC 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 



Seq ID NO: 333 Protein sequence 
Protein Accession ft: NP_0 00011 



MTLGSPRKGL 
RHPQEHRGCG 
LILGPVLALL 
DCTTGSGSGL 
RETEIYNTVL 
RLAVSAACGL 
YLDIGNNPRV 
YRPPFYDWP 
TALRIKKTLQ 



11 

I 

LMkLMALVTQ 
NLHRELCRGR 
ALVALGVLGL 
PPLVQRTVAR 
LRHDNILGFI 
AHLHVEIFGT 
GTKRYMAPEV 
NDPSFEDMKK 
KISNSPEKPK 



21 
I 

GDPVKPSRGP 
PTEFVNHYCC 
WHVRRRQEKQ 
QVALVECVGK 
ASDMTSRNSS 
QGKPAIAHRD 
LDEQIRTDCF 
WCVDQQTPT 
VIQ 



31 

I 

LVTCTCESPH 
DSHLCNHNVS 
RGLHSELGES 
GRYGEVWRGL 
TQLWLITHYH 
FKSRNVLVKS 
ESYKWTDIWA 
IPNRLAADPV 



41 
I 

CKGPTCRGAW 
LVLEATQPPS 
SLILKASEQG 
WHGESVAVKI 
EHGSLYDFLQ 
NLQCCIADLG 
FGLVLWEIAR 
LSGLAQMMRE 



51 
I 

CTWLVREEG 
EQPGTDGQLA 
DTMLGDLLDS 



RQTLEPHLAL 
LAVMHSQGSD 
RTIVNGIVED 
CWYPNPSARL 



Seq ID NO: 334 DNA sequence 

Nucleic Acid Accession ft: NM_004126.1 

Coding sequence: 108-329 



1 

! 

GGCACGAGCT 
AGCGGCTCCG 
ACATCGAAGA 
AAGTGAAGTT 
AAGAACGTTC 
AAGAAAAAGG 
AGAACTAGTT 
TGAAATTAAA 
GTGCTACTCA 
ACAATTATGG 
GCTTCAAATA 



11 

I 

CGTGCCGGCC 
CTGCCAGAGC 
TTTGCCAGAG 
GCAGAGACAA 
TGGAGAGGAT 
CAGCTGTGTT 
TGTTTTAGTT 
AGGAGACTTT 
TCTTTGCTCA 
AAATAAGAAC 
AAGTTTTGTC 



21 
I 

TTCAGTTGTT 
TAGCCCGAGC 
AAGGAAAAAC 
CAAGTGTCTA 
CCTCTAGTAA 
ATTTCATAAA 
TTCCCAGATA 
CTTAAGCACC 
CTATGCAGTC 
ATTACTTGAG 
TT 



31 

I 

TCGGGACGCG 
CCGGTTCTGG 
TGAAAATGGA 
AATGTTCTGA 
AGGGAATTCC 
TAACTTGGGA 
AAACCAACAT 
ATATAGATAG 
TTTTTTAAGA 
CATGACACTT 



41 
I 

CCGAGCTTCG 
GGCGAAAATG 
AGTTGAGCAG 
AGAAATAAAG 
AGAAGACAAG 
GAAACTGCAT 
GCTTTTTAAG 
GGTTATGTAT 
GAGGAGAGAG 
CTTTCAGTAT 



CCGCTCTTCC 

CTTCGCAAAG 
AACTATATTG 
AACCCCTTTA 
CCTAAGTGGA 
GAAGGAAGAA 
AAAAGCATAT 
TATCAGATGT 
ATTGCTTGAT 



Seq ID NO: 335 Protein sequence 
Protein Accession ft: NP_004117.1 



11 



51 



21 31 41 

III I I I 

MPALHIEDLP EKEKLKMEVE QIiRKEVKLQR QOVSKCSEEI KNYIEERSGE DPLVKGIPED 

KNPFKEKG5C VIS 



Seq ID NO: 336 DNA sequence 
Nucleic Acid Accession ft: NM_005795 
Coding sequence: 555-1940 



GCACGAGGGA 
CAAGCTCTGC 
TTCCCACCTT 
TGAGAATATT 
AAGAAATTCT 
GACAATTGTG 
GAATAATAAA 
AAAGAAAACT 
ACAAGGTTGC 
ATTTGGGCTT 
TTATGATTCT 
TTACTAGAAA 
CCATTCAACA 
ACGATGTTGC 
ATCCATCAGA 
CAAGCAACAG 
AGACTGCACT 
TGCTTATCTC 
TACACAAAAA 
CTGCAGTGGC 
AGTTCATTCA 
ACCTACACAC 
ATTTTCTTGG 



11 
I 

ACAACCTCTC 
TAACTGAATC 
GCTTGTGGGT 
TCACAAAGAA 
TAAAGACAAT 
CATATCGTCT 
AACCCATACT 
ACTACAACTT 
TATAAAACAA 
AATGATGGAG 
TGTTACAGCA 
TAAAATCATG 
AGCAGAAGGC 
AGCAGGAACT 
AAAAGTTACA 
AACATGGACA 
AAATTTGTTT 
GCTTGGCATA 
TCTGTTCTTC 
CAACAACCAG 
TCTTTACCTG 
ACTCATTGTG 
CTGGGGATTT 



21 
I 

TCTCTSCAGC 
TCATCCTAAT 
AAATCTCTTC 
TTTCCTTAAG 
GTCAAATATG 
AATAATAAAA 
AGCCTATAGA 
GACAAGACTG 
GATTGCTACA 
AAAAAGTGTA 
GAATTAGAAG 
ACAGCTCAAT 
GTTTACTGCA 
GAATCAATGC 
AAGATCTGTG 
AATTATACCC 
TACCTGACCA 
TTCTTTTATT 
TCATTTGTTT 
GCCTTAGTAG 
ATCGGCTGTA 
GTGGCCGTGT 
CCACTGATTC 



31 

1 

AGAGAGT6TC 
TGCAGGATCA 
TGCGGAATCT 
AGCTGGACTG 
ATCCAAGAGA 
ACCCATACTA 
AAACAATATT 
CTGCAAACTT 
ACTTCTAGTT 
CCCTGTATTT 
AGAGTCCTGA 
ATGAATGTTA 
ACAGAACCTG 
AGCTCTGCCC 
ACCAAGATGG 
AGTGTAATGT 
TAATTGGACA 
TCAAGAGCCT 
GTAACTCTGT 
CCAC AAAT CC 
ATTACTTTTG 
TTGCAGAGAA 
CTGCTTGTAT 



41 
I 

ACCTCCTGCT 
CATTGCAAAG 
CAGAAAGTAA 
GGTCTTGACC 
AAATGTGATT 
GCCTATAGAA 
TGAAAGATTG 
CAATTGGTCA 
TATGTTATAC 
TCTGGTTCTC 
GGACTCAATT 
CCAAAAGATT 
GGATGGATGG 
TGATTACTTT 
AAACTGGTTT 
TAACACCCAC 
CGGATTGTCT 
AAGTTGCCAA 
TGTAACAATC 
TGTTAGTTGC 
GATGCTCTGT 
GCAACATTTA 
ACATGCCATT 



51 
I 

TTAGGACCAT 

CTTTCACTCT 

AGTTCCATCC 

CCTGGAATTT 

TGAGTCTGGA 

AACAATATTT 

CTACCACTAA 

CCACAACTTG 

AGCATATTTC 

TTGCCTTTTT 

CAGTTGGGAG 

ATGCAAGACC 

CTCTGCTGGA , 

CAGGACTTTG 

AGACATCCAG 

GAGAAAGTGA 

ATTGCATCAC 

AGGATTACCT 

ATTCACCTCA 

AAAGTGTCCC 

GAAGGCATTT 

ATGTGGTATT 

GCTAGAAGCT 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



312 



WO 02/086443 

TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA TCTCCTCTAC ATTATCCATG 1440 

GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT GTTAAATATT GTA0GC6TTC 1500 

TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA TCTQTACATG AAAGCTGTGA 1560 

GAGCTACTCT TATCTTGGTO CCATTGCTTG GCATTGAATT TGTGCTGATT CCATGGOGAC 1620 

CTGAAGGAAA GATTQCAGAG GAGGTATATG ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860 

GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT CACTGTTTGG TGCTTCTCCT 1980 

AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

ATCCAGCTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

CACTATGCCT GATGTGAOGC TACTAACCTG ACATCACCAA GTGTGGAATT GGAGAAAAGC 2220 

ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT TTTAGTTTTA AAACTCTTTA 2400 

TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATCTTGTGGC ATATCCATTG TGGAAACTGG ATGAACAGGA TGTATAATAT GCAATCTTAC 2640 

TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA ACACCTTGTC AACCTCTTCC 2700 

TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT ATAAATTTTG CCCTTCCATT 2760 

TCTACTGTAT AAACAAATTA GCAATCATTT TATATAAAGA AAATCAATGA AGGATTTCTT 2820 

ATTTTCTTGG AATTTTGTAA AAAGAAATTG TGAAAAATGA GCTTGTAAAT ACTCCATTAT 2880 

TTTATTTTAT AGTCTCAAAT CAAATACATA CAACCTATGT AATTTTTAAA GCAAATATAT 2940 

AATGCAACAA TGTGTGTATG TTAATATCTG ATACTGTATC TGGGCTGATT TTTTAAATAA 3000 
AATAGAGTCT GGAATGCT 

Seq ID NOi 337 protein sequence 
Protein Accession #* NPJ)05786.1 

1 11 21 31 41 51 

I I I I I I 

MEKKCTLYFL VLLPFFMILV TAELEESPED SIQLGVTRNK IMTAQYECYQ KIMQDPIQQA 60 

EGVYCNRTWD GWLCWNDVAA GTESMQLCPD YFQDFDPSEK VTKICDQDGN WFRHPASNRT 120 

WTNYTQCNVN THEKVKTALN I*FYLTIIGHG LSIASLLISL GIFFYFKSLS CQRITLHKNL 180 

FFSFVCNSW TIIHLTAVAN NQALVATNPV SCKVSQPIHL YLMGCNYFWM LCEGIYLHTL 240 

IWAVPAEKQ HLMWYYFLGW GFPL1PACIH AIARSLYYND NCWISSDTHli LY1IHGPICA 300 

ALLVNLFFLL NIVRVLITKL KVTHQABSNL YMKAVRATLI LVPLLGIEFV LIPWRPEGKI 360 

AEEVYDYIMH ILMHFQGLLV STIFCFFNGB VQAILRRNWN QYKIQFGNSF SNSEALRSAS 420 
YTVSTISDGP GYSHDCPSEH LNGKSIHDIE NVLLKPENLY N 

Seq ID NO r 338 DNA sequence 
Nucleic Acid Accession #: NMJJ01795 
Coding sequence: 2 S -2379 

1 11 21 31 41 51 

I I I I I I 

GCACGATCTG TTCCTCCTGG GAAGATGCAG AGGCTCATGA TGCTCCTCGC CACATCGGGC 60 

GCCTGCCTGG GCCTGCTGGC AGTGGCAGCA GTGGCAGCAG CAGGTGCTAA CCCTGCCCAA 120 

CGGGACACCC ACAGCCTGCT GCCCACCCAC CGGCGCCAAA AGAGAGATTG GATTTGGAAC 180 

CAGATGCACA TTGATGAAGA GAAAAACACC TCACTTCCCC ATCATGTAGG CAAGATCAAG 240 

TCAAGCGTGA GTCGCAAGAA TGCCAAGTAC CTGCTCAAAG GAGAATATGT GGGCAAGGTC 300 

TTCCGGGTCG ATGCAGAGAC AGGAGACGTG TTCGCCATTG AGAGGCTGGA CCGGGAGAAT 360 

ATCTCAGAGT ACCACCTCAC TGCTGTCATT GTGGACAAGG ACACTGGTGA AAACCTGGAG 420 

ACTCCTTCCA GCTTCACCAT CAAAGTTCAT GACGTGAACG ACAACTGGCC TGTGTTCACG 480 

CATCGGTTGT TCAATGCGTC CGTGCCTGAG TCGTCGGCTG TGGGGACCTC AGTCATCTCT 540 

GTGACAGCAG TGGATGCAGA CGACCCCACT GTGGGAGACC ACGCCTCTGT CATGTACCAA 600 

ATCCTGAAGG GGAAAGAGTA TTTTGCCATC GATAATTCTG GACGTATTAT CACAATAACG 660 

AAAAGCTTGG ACCGAGAGAA GCAGGCCAGG TATGAGATCG TGGTGGAAGC GCGAGATGCC 720 

CAGGGCCTCC GGGGGGACTC GGGCACGGCC ACCGTGCTGG TCACTCTGCA AGACATCAAT 780 

GACAACTTCC CCTTCTTCAC CCAGACCAAG TACACATTTG TCGTGCCTGA AGACACCCGT 840 

GTGGGCACCT CTGTGGGCTC TCT G TTTGTT GAGGACCCAG ATGAGCCCCA GAACCGGATG 900 

ACCAAGTACA GCATCTTGCG GGGCGACTAC CAGGACGCTT TCACCATTGA GACAAACCCC 960 

GCCCACAACG AGGGCATCAT CAAGCCCATG AAGCCTCTGG ATTATGAATA CATCCAGCAA 1020 

TACAGCTTCA TCGTCGAGGC CACAGACCCC ACCATCGACC TCCGATACAT GAGCCCTCCC 1080 

GCGGGAAACA GAGCCCAGGT CATTATCAAC ATCACAGATG TGGACGAGCC CCCCATTTTC 1140 

CAGCAGCCTT TCTACCACTT CCAGCTGAAG GAAAACCAGA AGAAGCCTCT GATTGGCACA 1200 

GTGCTGGCCA TGGACCCTGA TGCGGCTAGG CATAGCATTG GATACTCCAT CCGCAGGACC 1260 

AGTGACAAGG GCCAGTTCTT CCGAGTCACA AAAAAGGGGG ACATTTACAA TGAGAAAGAA 1320 

CTGGACAGAG AAGTCTACCC CTGGTATAAC CTGACTGTGG AGGCCAAAGA ACTGGATTCC 1380 

ACTGGAACCC CCACAGGAAA AGAATCCATT GTGCAAGTCC ACATTGAAGT TTTGGATGAG 1440 

AATGACAATG CCCCGGAGTT TGCCAAGCCC TACCAGCCCA AAGTGTGTGA GAACGCTGTC 1500 

CATGGCCAGC TGGTCCTGCA GATCTCOGCA ATAGACAAGG ACATAACACC ACGAAACGTG 1560 

AAGTTCAAAT TCACCTTGAA TACTGAGAAC AACTTTACCC TCACGGATAA TCACGATAAC 1620 

ACGGCCAACA TCACAGTCAA GTATGGGCAG TTTGACCGGG AGCATACCAA GGTCCACTTC 1680 

CTACCCGTGG TCATCTCAGA CAATGGGATG CCAAGTCGCA CGGGCACCAG CACGCTGACC 1740 

GTGGCCGTGT GCAAGTGCAA CGAGCAGGGC GAGTTCACCT TCTGCGAGGA TATGGCCGCC 1800 

CAGGTGGGCG TGAGCATCCA GGCAGTGGTA GCCATCTTAC TCTGCATCCT CACCATCACA 1860 

GTGATCACCC TGCTCATCTT CCTGCGGOGG CGGCTCCGGA AGCAGGCCCG CGCGCACGGC 1920 

AAGAGCGTGC CGGAGATCCA CGAGCAGCTG GTCACCTACG ACGAGGAGGG CGGCGGCGAG 19 B0 

ATGGACACCA CCAGCTACGA TGTGTCGGTG CTCAACTCGG TGCGCCGCGG CGGGGCCAAG 2040 

CCCCCGCGGC CCGCGCTGGA CGCCCGGCCT TCCCTCTATG CGCAGGTGCA GAAGCCACCG 2100 

AGGCACGCGC CTGGGGCACA CGGAGGGCCC GGGGAGATGG CAGCCATGAT CGAGGTGAAG 2160' 

AAGGACGAGG CGGACCACGA CGGCGACGGC CCCCCCTACG ACACGCTGCA CATCTACGGC 2220 

TACGAGGGCT CCGAGTCCAT AGCCGAGTCC CTCAGCTCCC TGGGCACCGA CTCATCCGAC 2280 
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WO 02/086443 

TCTGACGTGG ATTACGACTT CCTTAACGAC TGGGGACCCA GGTTTAAGAT GCTGGCTGAG 
CTGTACGGCT CGGACCCOCG GGAGGAGCTG CTGTATTAGG CGGCCGAGGT CACTCTGGGC 
CTGGGGACCC AAACCCCCTG CAGCCCAGGC CAGTCAGACT CCAGGCACCA CAGCCTCCAA 
AAATGGCAGT GACTCCCCAG CCCAGCACCC CTTCCTCGTG GGTCCCAGAG ACCTCATCAG 
CCTTGGGATA GCAAACTCCA GGTTCCTGAA ATATCCAGGA ATATATGTCA GTGATGACTA 
TTCTCAAATG CTGGCAAATC CAGGCTGGTG TTCTGTCTGG GCTCAGACAT CCACATAACC 
CTGTCACCCA CAGACCGCCG TCTAACTCAA AGACTTCCTC TGGCTCCCCA AGGCTGCAAA 
GCAAAACAGA CTGTGTTTAA CTGCTGCAGG GTCTTTTTCT AGGGTCCCTG AACGCCCTGG 
TAAGGCTGGT GAGGTCCTGG TGCCTATCTG CCTGGAGGCA AAGGCCTGGA CAGCTTGACT 
TGTGGGGCAG GATTCTCTGC AGCCCATTCC CAAGGGAGAC TGACCATCAT GCCCTCTCTC 
GGGAGCCCTA GCCCTGCTCC AACTCCATAC TCCACTCCAA GTGCCCCACC ACTCCCCAAC 
CCCTCTCCAG GCCTGTCAAG AGGGAGGAAG GGGCCCCATG GCAGCTCCTG ACCTTGGGTC 
CTGAAGTGAC CTCACTGGCC TGCCATGCCA GTAACTGTGC TGTACTGAGC ACTGAACCAC 
ATTCAGGGAA ATGCTTATTA AACCTTGAAG CAACTGTGAA TTCATTCTGG AGGGGCAGTG 
GAGATCAGGA GTGACAGATC ACAGGGTGAG GGCCACCTCC ACACCCACCC CCTCTGGAGA 
AGGCCTGGAA GAGCTGAGAC CTTGCTTTGA GACTCCTCAG CACCCCTCCA GTTTTGCCTG 
AGAAGGGGCA GATGTTCCCG GAGATCAGAA GACGTCTCCC CTTCTCTGCC TCACCTGGTC 
GCCAATCCAT GCTCTCTTTC TTTTCTCTGT CTACTCCTTA TCCCTTGGTT TAGAGGAACC 
CAAGATGTGG CCTTTAGCAA AACTGACAAT GTCCAAACCC ACTCATGACT GCATGACGGA 
GCCGAGCATG TGTCTTTACA CCTCGCTGTT GTCACATCTC AGGGAACTGA CCCTCAGGCA 
CACCTTGCAG AAGGAAGGCC CTGCCCTGCC CAACCTCTGT GGTCACCCAT GCATCATTCC 
ACTGGAACGT TTCACTGCAA ACACACCTTG GAGAAGTGGC ATCAGTCAAC AGAGAGGGGC 
AGGGAAGGAG ACACCAAGCT CACCCTTCGT CATGGACOGA GGTTCCCACT CTGGCAAAGC 
CCCTCACACT GCAAGGGATT GTAGATAACA CTGACTTGTT TGTTTTAACC AATAACTAGC 
TTCTTATAAT GATTTTTTTA CTAATGATAC TTACAAGTTT CTAGCTCTCA CAGACATATA 
GAATAAGGGT TTTTGCATAA TAAGCAGGTT GTTATTTAGG TTAACAATAT TAATTCAGGT 
TTTTTAGTTG GAAAAACAAT TCCTGTAACC TTCTATTTTC TATAATTGTA GTAATTGCTC 
TACAGATAAT GTCTATATAT TGGCCAAACT GGTGCATGAC AAGTACTGTA TTTTTTTATA 
CCTAAATAAA GAAAAATCTT TAGCCTGGGC AACAAAAAAA 



PCTYUS02/12476 



Seq ID NO: 339 Protein sequence 
Protein Accession #:' NP_001786 



MQRLMMLLAT 
NTSLPHHVGK 
VIVDKDTGEN 
PTVGDHASVM 
TATVLVTLQD 
DYQDAFTIET 
INITDVDEPP 
VTKKGDIYNE 
KPYQPKVCEN 
GQ FDREHTKV 
WAILLCILT 
SVLNSVRRGG 
DGPFYDTLHI 
ELLY 



11 
I 

SGACLGLLAV 
IKSSVSRKNA 
LETPSSFTIK 
YQILKGKEYP 
INDNFPFPTQ 
NPAHNEGIIK 
IFQQPFYHFQ 
KELDREVYPW 
AVHGQLVLQI 
HFLPWISDN 
ITVITLLIFL 
AKPPRPALDA 
YGYEGSESIA 



21 
I 

AAVAAAGANP 
KYLLKGEYVG 
VHDVNDNWPV 
AIDNSGRIIT 
TKYTPWPED 
PMKPLDYEYI 
LKBNQKKPLI 
YNLTVEAKEL 
SAIDKDITPR 
GMPSRTGTST 
RRRLRKQARA 
RPSLYAQVQK 
ESLSSLGTDS 



31 
I 

AQRDTHSLLP 
KVFRVDAETG 
FTHRLFNASV 
ITXSLDREKQ 
. TRVGTSVGSL 
QQYSFIVEAT 
GTVLAMDPDA 
DSTGTPTGKE 
NVKFKFTLNT 
LTVAVCKCNE 
HGKSVPEIHE 
PPRHAPGAHG 
SDSDVDYDFL 



41 
I 

THRRQKRDWI 
DVFAIERLDR 
PESSAVGTSV 
ARYEIWEAR 
FVEDPDEPQN 
DPTIDLRYMS 
ARHSIGYSIR 
SIVQVHIEVL 
ENNFTLTDNH 
QGEFTPCEDM 
QLVTYDEEGG 
GPGEMAAMZE 
HDWGPRFKML 



51 
I 

WNQMHIDEEK 
ENISEYHLTA 
ISVTAVDADD 
DAQGLRGDSG 
RMTKYSILRG 
PPAGNRAQVI 
RTSDKGQFFR 
DENDNAPEFA 
DNTANITVKY 
AAQVGVSIQA 
GEMDTTSYDV 
VKKDEADHDG 
AELYGSDPRE 



Seq ID NO: 340 DNA sequence 
Nucleic Acid Accession #: NMJ>03088 
Coding sequence! 112 - 15 93 



GCGGAGGGTG 
CCCGCCACCC 
AACGGCACAG 
CTGACGGCCG 
CAGATCTGGA 
AGCCACCTGG 
GTGCCCGGTC 
CAGTCCGAGG 
CAGACGGTGT 
ATCTACAGTG 
GCCGTGGACC 
CAGCGCTACA 
GCGCGCCCCG 
CGCGACTGCG 
AAGGCCACCA 
GTGCTGCAGG 
AATCAGGACG 
AAGTGTGCCT 
CAGTCCACCG 
CGCATCACAC 

CCCATCATCG 
CTGGACGCCA 
AACATCAAAG 
AGCGGCGACA 
AAGGTGGGCG 
ACCGTGGACC 
CCACATGGCG 
GGCGGGAGGC 
CCTGTCGCCC 



11 

I 

CGTGCGGGCC 
ACCTCCCGGG 
CCGAGGCGGT 
AGGCGTTCGG 
CGCTGGAGCA 
GCCGCTACCT 
CCGACTGCCG 
CGCACCGGCG 
CCCCOGCCGA 
TCACCCGTAA 
GCGACGTGCC 
GCGTGCAGAC 
AGCCGGCCAC 
AGGGCCGTTA 
AGGTGGGCAA 
CGGCCAACGA 
AGGAGACCGA 
TCCGTACCCA 
CCTCCAGCAA 
TGAGGGCGTC 
TGGAGACAGC 
TGTTCCGCGG 
ACCGCTCCAG 
ACTCCACAGG 
CTCCTGTGGA 
GGCGCTACCT 
CCGCCTCGCT 
GCTCCTGCCA 
AAGCCCCCTT 
CTATGGACTC 



21 



GCCGCGCAGC 
GCAGATCCAG 
GTTCAAGGTG 
GCCCCCTGAC 
GGCGGCGGAC 
TTTCCTCATC 
CTACTTCGGC 
GAAGTGGAGC 
GCGCTAOGCG 
CTGGGGCGTC 
CGCCGACCAC 
TGGCTACACG 
CCTGGCGCCG 
GGACGAGCTC 
GAGGAACGTG 
CCAGGAGACC 
CACGGGCAAG 
GAATGCCAGC 
CAATGGCAAG 
AGGGGACTCA 
GGAGCATGGC 
CTATGACGTC 
CAAATACTGG 
CTTCTTCTTC 
GAAGGGCGAC 
CTGGGAGTAC 
ACCCTCCCTG 
GCCTTTCAAA 
CCCACTCTCC 



31 
I 

AACAAAGGAG 
GGCCTCTCGT 
TTCGGCCTCA 
AAOGCGTCCG 
GAGGCGGGCA 
AAGGACGGCA 
GTGGCGCACG 
GGCACCGAGG 
GTGCACATCG 
CACCTGAGOG 
GACTCGCTCA 
CGCTTCCTGC 
CTGGAGTTCC 
TCGGGGCCCA 
TTTGCTCTGG 
TCCACGCGCC 
TTCCAGCTGG 
TACTGGACGC 
TGCTACTTTG 
TTTGTGACCT 
GAGCTCTTCC 
TTCATCGGCT 
TTCCAGCTGG 
ACGGTGGGCA 
GAGTTCTGCG 
CACGCAGGCG 
TAGGOCCGGC 
CTAACCCCTT 
CTGGAAACCC 
CCTCCGCCCG 



41 

\ 

CAGGGGCGCC 
CTACTGCCAC 
TCAACTGCGG 
CCAGCAGCCT 
GCGCGGCCGT 
ACGTGACCTG 
ACGACGGTCG 
ACCGCCTGTC 
CCATGCACCC 
CGCGGCCGGC 
TCACCCTCGC 
GCCACGACGG 
GCTCCGGCAA 
GCGGCACGCT 
AGCAGAGCTG 
AGGGTATGGA 
AGATCGACCG 
TGACGGCCAC 
ACATCGAGTG 
CCAAGAAGAA 
TCATGAAGCT 
GCCGCAAGGT 
AGTTCAACGA 
GTGACTCCGC 
ACTATAACAA 
TCCTGAAGGC 
CCGTCCTTCC 
CTCCGCCAGG 
CAGAGAAAAC 
GGTTCCCTAC 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



51 
I 

GCCGCAGGGA 
CATGACCGCC 
CAACAAGTAC 
GAAGAAGAAG 
GTGCCTGCGC 
CGAGCGCGAG 
CTGGTCGCTG 
CTGCTTCGCG 
TCAGGTCAAC 
CGACGAGATC 
CTTCCAGGAC 
GCGCCTGGTG 

CAAGGCGGGC 
CGCCCAGGTC 
CCTGTCTGCC 
CGACACCAAA 
CGGGGGCGTG 
GCGTGACCGG 
TGGGCAGCTG 
CATCAACCGC 
CACGGGCACC 
TGGCGCCTAC 
GGTCACCAGC 
GGTGGCCATC 
CTCGGCGGAA 
CCGCCCCTGC 
TGGGCTCCAG 

TCCCCTOGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 



60 
120 
180 
240 
300 
360 
420 
480 
• 540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



314 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCAGCGGCTG CGQCCTGGCC CTGGGAGGGA TTTCAGATGC 
CGGGGCQAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG 
GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTOGGC 
TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA 
CTGTCAGTGG CCCTCCCTGG TGCACTGTCC COGAAACCCC 
CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT 
CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG 
ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG 
GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT 
CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG 
GTAGTAGOGA GTGATCTGGC GGGGGGOGTC TCAGCACCCT 
CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG 
GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC 

cgccccctcc gggagccctg gggtgagcog ccggggcccc 
tccccaacat gcatctcact ctgggtgtct tggtctttta 
tataactcta aacgcccatg atagtagctt caaactggaa 
agtctgc 

Seq ID NO: 3*41 Protein sequence 
Protein Accession #: NP 003079 



PCTYUS02/12476 



CCCTGCCCTC 
CTCTGAGCCT 
GTGTAGTGTA 

gctgggcaca 
tgcttgggaa 
ggtggctgga 
agcggcaggg 
gcgggogggt 
gactggaagc 
cgtcccaggc 

CCCCAGGGGG 
CCTGGGCTGC 
GGGTGGATGA 
CCTGCTGCCA 
TTTTTTGTAA 
ATAGCGAAAT 



TTGTCTGCCA 
TATTTCTCTG 
ACTGGAATCT 
TGTCCCAAGC 
GGGAAGCTGT 
AACAGCCCCT 
CGTGAOGGCC 
AGGGGTGTGG 
AGAAAATGAC 
AAGCCTGGCT 
TGCATCTCAG 
CGACACCTGG 
AGCCAGGOGT 
GCCTCCCCCG 
GTGTCATTTG 
AAAATAACTC 



MTANGTAEAV 
CLRSHLGRYL 
CFAQTVSPAE 
FQDQRYSVQT 
KAGKATKVGK 
DTKKCAFRTH 
GQLAASVETA 
GAYNIKDSTG 
SAETVDPASL 



11 
I 

QIQFGLINCG 
AADKDGNVTC 
KWSVHIAMHP 
ADHRPLRHDG 
DELFALBQSC 
TGKYWTLTAT 
GDSBLFLMKL 
KYWTVGSDSA 
WEY 



21 
I 

NKYLTAEAFG 
EREVPGPDCR 
QVNIYSVTRK 
RLVARPEPAT 
AQWLQAANE 
GGVQSTASSK 
INRPI IVFRG 
VTSSGDTPVD 



31 
I 

FKVNASASSL 
PLIVAHDDGR 
RYAHLSARPA 
GYTLEFRSGK 
RNVSTRQGMD 
NASCYFDIEW 
EHGFIGCRKV 
FFFEFCDYNK 



41 



I 

KKKQIWTLEQ 
WSLQSEAHRR 
DEIAVDRDVP 
VAFRDCEGRY 
L5ANQDEETD 
RDRRITLRAS 
TGTLDANRSS 
VAIKVGGRYL 



Seq ID NO: 342 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence » 660 1705 



CGCTCCGCAC 
GGGATTCTTG 
CCGGGTTGGG 
CTCGAGCGGG 
CCGGCGTTCG 
CACCCCACTG 
GCTACCATGA 
ACCGTGCGTG 
CCCGGAGGGG 
TGCACCCTGG 
CTCGAAAGCG 
TGCGCGGTAC 
ACCTGCGOGC 
GCCCCGGGGC 
TGGACTTCAG 
CAGTTACTTG 
TGTGTCCCTG 
TAGACGACTT 
GCCGCTCTTG 
CCAGGCGCCC 
TCGAOGAGAA 
TTCCTGAGAT 
AAGCCGAGTC 
CGACTTCCTC 
TGAGCACAGC 
TCTGCTTTCA 
TGGAGAGTGA 
GGGTGAAAGT 
CCCCTCTTGG 



11 

I 

ACATTTCCTG 
GCAGTTGGGG 
GAAGCCAGCT 
ACAGATCCAA 
CCCTGTGCCT 
CCGACCGTGC 
AGCGGCAGGC 
CGGGCGCCGA 
GCTCCAAAGA 
AGAACGAGCC 
ACACGCTGCA 
TCCAGGCCAC 
CAACGGCTAC 
CGCCTCTAAC 
TCCACCTGGG 
CATCGOGGAC 
CCCCGGGAGG 
GGGAGGCTTT 
TGTGACCAGT 
GCCGGCCACT 
GCTGGGAGAG 
TCCTCGATGG 
AAAGGCCACT 
TGCCACTCCT 
AGTAGTAGTG 
CGAAAGCCCC 
TCCTGAGCCC 
CGGGGACTGT 
CTCTAGTGAT 



21 
I 

TCGCGGCCTA 
GGTCCGTCGG 
GTAGAGGGCG 
GTTGGGAGCA 
CCTCTGGCAG 
TGGCTGCTCG 
GGCCGAGGAG 
GCTGCGCGCT 
CCTGCTGTTC 
TTTGCGGGGT 
GTGGGTGGAG 
CGGTGGGGTC 
CTGTGCAAGT 
TTGAGCTATC 
ACCGAGGTGA 
GAAATCGGOG 
TACCTCCGTG 
GCCTGCGAAT 
GGGGAAGGAC 
GCAACCAGCC 
ACACCACTTG 
GGATCACAGA 
ATCACCCCAT 
CAGGCTTTCG 
TTGGTGATCT 
TCTTCCCAGC 
GCTGCTTTGG 
GATCTGOGGG 
GCATAG 



31 

I 

AGGGAAACTG 
GAGCGAGGGC 
GTGACCGCGC 
GCTCTGOGTG 
GCGCTCTGGC 
GCCTCGGGGG 
GCCTGCATCC 
GTGCTCGOGC 
TGGGTCGCAC 



GAGCCCCAAC 
GAGCCCGCAG 
ACCAGTTTGA 
GCGCGCCCTT 
GTGCGCTCTG 
CTCGCTGGGA 
CTGGCAAATG 
GTGCTAOGGG 
AGCCGACCCT 
CCGTGCCGCA 
TCCCTGAACA 
GCACGATGTC 
CAGGGAGCGT 
ACTCCTCCTC 
TGACCATGAC 
CAAGGAAGGA 
GCTCCAGTTC 
ACAGAGCAGA 



41 
I 

TTGGCCGCTG 
GGAGGGGAAG 
TCCAGACACA 
CGGGGCCTCA 
CCGGGCGGGG 
CCTGCTACAG 
TGCGAGGTGG 
TCCTGCGGGC 
TGGAGOGCAG 
TGTCCTCCGA 
GCTCCTGCAC 
CTGGAAGGAG 
GGTCTTGTGT 
CCAGCTGCAC 
COGGGGACAG 
CAAACTCTCG 
CGCAGAGCTC 
CTTCGAGCTG 
TGGGGGGACC 
GAGAACATGG 
AGACAATTCA 
TACCCTTCAA 
GATTTCCAAG 
TGCOGTGGTC 
AGTACTGGGG 
GTCTATGGGC 
TGCACATTGC 
GGGTGCCTTG 



Seq ID NO: 343 Protein sequence 
Protein Accession #x FGENESH predicted 



1 
I 

MGKDFMTKTP 
IARIYKELEQ 
GLGKPAVEGG 
PHCRPCWLLG 
RRGLQRPAVL 
RGTPGHRWGR 
DFSPPGTEVS 
DDLGGPACEC 
DEKLGETPLV 
TSSATPQAFD 
ESDPEPAALG 



11 
I 

KAFATKAKID 
IYKKKKPTKT 
DRAPDTALRP 
LGGLLQPAPR 
GRTGAQAFPL 
ARSWKEMRCH 
ALfCRGQLPIS 
ATGFELGKDG 
PEQDNSVTSI 
SSSAWFIFV 



21 
I 

KWDLIKLKSF 
LRTHFLSRPK 
RAGQIQVGSS 
YHEAAGGRGG 
HPGERAFAGF 
LRANGYLCKY 
VTCIADEIGA 
RSCVTSGEGQ 
PEIPRWGSQS 
STAWVLVIL 
VKVGDCDLRD 



31 



CTAKETI IRV 
GNCWPLGPRG 



LHPARWGAQH 
LLAVLRPRRS 
QFEVLCPAPR 
RWDKLSGDVL 
PTLGGTGVPT 
TMSTLQMSLQ 
TKTVLGLVKL 
RAEGALLAES 



41 
I 

NSQPTDWQKT 
DSWQLGGPSG 
GVRPVPPLAG 
RACGRRAARC 
RKRHAAVGGG 
PGAASNLSYR 
CPCPGRYLRA 
RRPPATATSP 
AESKATITPS 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



51 
I 

PPDEAGSAAV 
YFGGTEDRLS 
WGVDSLITLA 
LAPSGPSGTL 
QETFQLEIDR 
NGKFVTSKKN 
YDVFQLEFND 
KGDBAGVLKA 



51 

I 

GGCCCGOGGG 
GGAGGGGGAA 
GCTCTGCGTC 
GAGAATGAGG 
CGGCGGCGAA 
CCTGCACCAC 
GGCGCTCAGC 
AGGCCCAGGG 
GCGTTCCCAC 
CCCOGGCGGT 
GGCGOGGAGA 
ATGCGATGCC 
CCTGCGCCGC 
AGCGCCGCTC 
CTCCCGATCT 
GGCGATGTGT 
CCTAACTGCC 
GGGAAGGACG 
GGGGTGCCCA 
CCAATCAGGG 
GTAACATCTA 
ATGTCCCTTC 
TTTAATTCTA 
TTCATATTTG 
CTTGTCAAGC 
CCGCCGGGCC 
ACAAACAATG 
CTGGCGGAGT 



60 
120 
180 
240 
300 
360 
.420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



51 
I 

FAIYPSDKGV 
ARAEGKGGGT 
ALARAGRRRT 
ARAPAGRPRA 
APTLLHRAEM 
APFQLHSAAL 
GKCAELPNCL 
VPQRTWPIRV 
GSVISKFNST 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



PLGSSDA 



315 
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Seq ID NO: 344 DNA sequence 
Nucleic Acid Accession #: NM_012072 
Coding sequence. 149-2107 

1 . li 21 31 41 51 

AAAGCCCTCA GCCTTTGTGT CCTTCTCTGC GCCGQAGT6Q CTGCAGCTCA CCCCTCAGCT 60 

CCCCTTGQGQ CCCAGCTGGG AGCCGAGATA GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 

TCCCGCAGAG GGCCACACAG AGACCGGGAT GGCCACCTCC ATGGGCCTGC TGCTGCTGCT 180 

GCTGCTGCTC CTGACCCAGC CCGGGGCGGG GACGGGAGCT GACACGGAGG CGGTGGTCTG 240 

CGTGGGGACC GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG AGGCCCAGAA 300 

CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 360 

CGTCCAGCGA GTACTGGCCC AGCTCCTGAG GOGGGAGGCA GCCCTGACGG CGAGGATGAG 420 

CAAGTTCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCCGCT 480 

GAAGGGCTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA 540 

GCTCCGGAAC TCGTGCATCT CCAAGOGCTG TGTGTCTCTG CTGCTGGACC TGTCCCAGCC 600 

GCTCCTTCCC AACCGCCTGC CCAAGTGGTC TGAGGGCCCC TGTGGGAGCC CAGGCTCCCC 660 

CGGAAGTAAC ATTGAGGGCT TCGTGTGCAA GTTCAGCTTC AAAGGCATGT GCCGGCCTCT 720 

GGCCCTGGGG GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCAGACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGCGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 840 

CGAGACTCAG AGTCATTATT TCCTGTGCAA GGAGAAGGCC CCCGATGTGT TCGACTGGGG 900 

CAGCTCGGGC CCCCTCTGTG TCAGCCCCAA GTATGGCTGC AACTTCAACA ATGGGGGCTG 960 

CCACCAGGAC TGCTTTGAAG GGGGGGATGG CTCCTTCCTC TGCGGCTGCC GACCAGGATT 1020 

CGGGCTGCTG GATGACCTGG TGACCTGTGC CTCTCGAAAC CCTTGCAGCT CCAGCCCATG 1080 

TCGTGGGGGG GCCACGTGCG TCCTGGGACC CCATGGGAAA AACTACACGT GCCGCTGCCC 1140 

CCAAGGGTAC CAGCTGGACT CGAGTCAGCT GGACTGTGTG GACGTGGATG AATGCCAGGA 1200 

CTCCCCCTGT GCCCAGGAGT GTGTCAACAC CCCTGGGGGC TTCCGCTGCG AATGCTGGGT 1260 

TGGCTATGAG CCGGGOGGTC CTGGAGAGGG GGCCTGTCAG GATGTGGATG AGTGTGCTCT 1320 

GGGTCGCTCG CCTTGCGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 1380 

TGAGGAGGGC TACGTCCTGG CCGGGGAGGA CGGGACTCAG TGCCAGGACG TGGATGAGTG 1440 

TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC AACACACAAG GGTCCTTCCA 1500 

CTGTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA CCATGGGGCC 1560 

TGTGTCTCTG GGACCACCAT CTGGGCCCCC OGATGAGGAG GACAAAGGAG AGAAAGAAGG 1620 

GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGG GGCCCCGAGG GCACCCCCAA 1680 

GGCTACACCC ACCACAAGTA GACCTTCGCT GTCATCTGAC GCCCCCATCA CATCTGCCCC 1740 

ACTCAAGATG CTGGCCCCCA GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 1800 

CGCCACAGCT GCCTCTGGCC CCCAGGAGCC TGCAGGTGGG GACTCCTCCG TGGCCACACA 1860 

AAACAACGAT GGCACTGACG GGCAAAAGCT GCTTTTATTC TACATCCTAG GCACCGTGGT 1920 

GGCCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATCGCA AGCGGAGAGC 1980 

GAAGAGGGAG GAGAAGAAGG AGAAGAAGCC CCAGAATGCG GCAGACAGTT ACTCCTGGGT 2040 

TCCAGAGCGA GCTGAGAGCA GGGCCATGGA GAACCAGTAC AGTCCGACAC CTGGGACAGA 2100 

CTGCTGAAAG TGAGGTGGCC CTAGAGACAC TAGAGTCACC AGCCACCATC CTCAGAGCTT 2160 

TGAACTCCCC ATTCCAAAGG GGCACCCACA TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

CAAACAATTG TAAGTCTCCT CCTTAAAGGC CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

TGTTTGATGT TCCTGAAGTG GAAGCTGTGT GTTGGCGTGC CACGGTGGGQ ATTTCGTGAC 2340 

TCTATAATGA TTGTTACTCC CCCTCCCTTT TCAAATTCCA ATGTGACCAA TTCCGGATCA 2400 

GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

ATCTAAGAGG AAAAGGTGAG TTGCTCATGC TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

CCTAGGATGA AAACTAAATC AATTAATTAT TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TCAAAGGGAA CATGTTCGGA CTGGAAACAT TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

AGCACAAGTC TTGCTAAATG TGATACTGTT GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TAACCTCTTA GGTGGCAAGG AGGCAGGAAG TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

CAGGTGTTTG TGAAGTCACA TAATCTACGG GGCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

CACAGATACT TGAATTAATT CATCCAAATG TACTGAGGTT ACC ACACA CT TGACTACGGA 2940 

TGTGATCAAC ACTAACAAGG AAACAAATTC AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CCTCAGACAC CCTGCCTGTG GCCCCGOCTC CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

TGAACGGGAG ATGATGCACT GTGTTTTGAA AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

TCATAGTCCA CAGTTGATGC AGCATCCTGA GATTTTAAAT CC TGAA GTGT GGGTGGCGCA 3240 

CACACCAAGT AGGGAGCTAG TCAGGCAGTT TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG AAGAGGGAAA GAGAT GACTA ACTAAAATCA 3360 

TTTTTACAGC AAAAACTGCT CAAAGCCATT TAAATTATAT CCTCA TTTTA AAAGTTACAT 3420 

TTGCAAATAT TTCTCCCTAT GATAATGCAG TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

TCTCTCTCAC ACACACACAC ACACACACAC ACACACACAC AGAGACACGG CACCATTCTG 3540 

CCTGGGGCAC TGGAACACAT TCCTGGGGGT CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

TGAGTATCTC TGGGAGGCCT CATGTCTCCT GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

ACAGACAGAG GAAATGTGTC TCCCTCCAAG GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

GGTTTTGCCT TAGCAATGCA TCGGTCTCTG AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

CAAGGTGCAG GGTTAATACT CTTGCCAGTT TTGAAATATA GATGCTATGG TTCAGATTGT 3840 

TTTTAATAGA AAACTAAAGG GGCAGGGGAA GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

TCGATGGGGC ATTTGGAACT TCTTTTTAAA GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AACTCTGGTG TTTAACACTT AAGGGAGACA AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

GGCCACGAGA CTCTAGGTGA TGTGTGAAGC TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC ATGGCTGGAT GCGC TGCTGA CCAACATCAG 4140 

CACTTAAATA AATGCAAATG CAACATTTCT CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TCATTTGGGG TGAAGGAGAC ATTTCTGTCC TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

GTATGATTCC TGGGATCCAA CGAGCCCTCC TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

GCCCAGGCCC ATCGTCTGTT CTCTGAATGC AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 
CCCACCACGG GGGGAGAGCC AGCAACCCAA CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 
AACCACTGGG CTCAAACACG TGCTTTATTC TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 
ATGGAAATTC TTGTTTGGGG GATCTTGGGG CTACAGTAGT GGG TAAACAA ATGCCCACCG 4680 
GCCAAGAGGC CATTAACAAA TCGTCCTTGT CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 
CACAGTGGGG AATCCAAGGG TCACAGTATG GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 



316 
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TCTOGCTAGA CACAGTGTTT CTGCCCAGGT GACCTGTTCA 
CATGGGGACG GGGGAAGTTT TCACTTGGAG ATGGACACCA 
CCAAATAGGT CAATAATTCT GGGAGACTCT TGGAAAAAAC 
TCTCTCCCTC CCCTCATCCC ACATCTCAAA GCAGACAATG 
CACCCAGCTC GCCATGCCTA CTCATTCCTG AATTTCAGGT 
CTTCTTTGTC ATTTGAGAAA GGATGCAGGA GGACAATTCC 
CAGAAAAACC AGGGCAGGAC AGTTATCGAC AATGCATTAG 
TAGAGGGACT CCACCCCTGC TCAACAGCTT GGCTTCCAGG 
CTCTGCCTTC GGTGGCCCAC ACACCTAAGC GTCATCGTCA 
AACACATCTA CGTGTAGCAC TACGACGTTA TGTTTGGGTA 
AGGCTCTGAT TAAGGATGTG GGGAAGTGGG CTGCGGTCAC 
CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA GGAGCAAGGC 
TGCCATCTTC CCTGCGATCA GGCAAAAAAG TGGAATTAAA 
TGTGTTATGT CCATTTTGCA GGATGAACTG AGTTTAAAAG 
TTGCTTTGTC TTTTCCATCC TCATCACAAG CCCTTGTTTG 
TCTTTCGATG GATGGAGATG ATCATTAGGT ACTTTTGTTT 
ATTTCTGTGA AAACTAGGAG AACAGAGATG AGATTTGACA 
TAACACAGTC TTTTTAAAAC TAACATAGGA AAGCCTTTCC 
TCTCCATTGT CTAAATCAGG AAAACAGGAA AACACAGCTT 
TTAATGCCCC CTACATATTT CCATCACCTT GAACAATAGC 
TATGATCCCA GAAAACATCT GTCTCTACTT CGGCTGCAAA 
TGGTTTGTGC ATTTTCTCAA CTAAAAATAG AGATGATAAT 
CTAATCAAAG ACACTATTTT CATACTAGAT TCCTGAGACA 
TTTAAAAATA AATTGTGTTT TGGTCTGTTC TTGTAGATAA 
AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG CTCTTATCTG 
TTCAGCAGAT TTTGCCCACT ATTCCTCTGA GCTGAAGTTC 
GCTTGAATTA GATCCCTGCA AAGGCTTGCT CTGTGATGTC 
GTAATCACTT CATGAATGCT AAATGAGAAT GTAAGTATTT 
TTTGTTTGAC TAATTCTGGA ATTACAAGAT TTCTATGCAG 
ATGTTTCCCA AACTGTGAGG AGGGAAGGCT CAGAGATCGA 
CAAAATGGTG CTTTGAGGGT CAGCCTTTAG GAAGGTGCAG 
TCTGTTATGT GCCTATCCTA ATAAACTCTT AAACACATT 

Seq ID NO j 345 Protein sequence 
Protein Accession #: NP_036204 



PCT/US02/12476 



MATSMGLLLL 
ATVKSKEEAQ 
EDTPYSNWHK 
KFSFKGMCRP 
KEKAPDVFDW 
ASRNPCSSSP 
TPGGFRCECW 
DGTQCQDVDB 



SGVWREPSIH 
LGLLVYRKRR 



11 

i 

LLLLLTQPGA 
HVQRVLAQLL 
BLRNSCISKR 
LALGGPGQVT 
GSSGPLCVSP 
CRGGATCVLG 
VGYEPGGPGB 
CVGPGGPLCD 
GSTVPRAATA 
HATAASGPQE 
AKREBKKEKK 



21 
I 

GTGADTEAW 
RREAALTARM 
CVSLLLDLSQ 
YTTPFQTTSS 
KYGCNFNNGG 
PHGKNYTCRC 
GACQDVDECA 
SLCFNTQGSF 
SPTRGPEGTP 
PAGGDSSVAT 
PQNAADSYSW 



31 
I 

CVGTACYTAH 
SKFW1GLQRE 
PLLFNRLPKW 
SLEAVPFASA 
CHQDCFEGGD 
PQGYQLDSSQ 
LGRSPCAQGC 
HCGCLPGWVL 
KATPTTSRPS 
QNNDGTDGQK 
VPERAESRAM 



GCAGCAGAAC 
AGACAATGAA 
TGAATATATT 
TAAAGAGAGA 
GCCATCACTG 
CACAGATAAT 
AACTTGGTGA 
CAAGACCAAC 
TTGCCATAGC 
ATGTGGGGAT 

TTCAGGAAGG 
AAGTCAAACC 
AATTTTTTTT 
AGTGTCTTAT 
CAACCTTTAT 
AAAAAAAATT 
TATTATTTCT 
TCTAGCAGCT 
TTTAGCTTGG 
ACCCATGGTT 
CCGAATTCTC 
AATACTCACT 
TGCCCTTCTA 
CAAGGTGGCA 
TTTGCATAGA 
AGATGTAATT 
TTAAATGTGT 
GATTTACCTT 
GCTTCTCCTC 
CTTTGTTGTC 



41 

I 

SGKLSAAEAQ 
KGKCLDPSLP 
SEGPCGSPGS 
ANVACGEGDK 
GSFLCGCRPG 
LDCVDVDECQ 
TNTDGSFHCS 
APNGVSCTMG 
LSSDAPITSA 
LLLFYILGTV 
ENQYSPTPGT 



AAGCCAGGGC 
GATTTGTTGT 
CAGGACCAAC 
ACATCTCACA 
CTCTTTCTTT 
CTGAGGAATG 
GCATCCTCTG 
CACATCTGGT 
ATCATGATGC 
GAACTGCATG 
GCAAGGCCAC 
GCCAGCCACA 
TTTATATGCA 
TCTCTTCAAG 
CCCTGAGCAA 
TCCTGTAAAT 
GAATTAAAAA 
CTTCTTAGCT 
GCAAAATGGT 
GAATCTGAGA 
TAAATCTATA 
CATATATTCA 
GAAGGGCTTG 
TTTTAGGTAG 
AGCAGTTCTT 
TTTGGCTTAA 
GTAAATGTCA 
GTATTTCAAA 
CATCCTGTGC 
TGAGTTCTAA 
CTTTGAGCTT 



51 
I 

NHCNQNGGNL 
LKGFSWVGGG 
PGSNIEGFVC 
DETQSHYFLC 
FRIiLDDLVTC 
DSPCAQECVN 
CEEGYVLAGE 
PVSLGPPSGP 
PLKMLAPSGS 
VAILLLLALA 
DC 



Seq ID NO: 346 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequences <l-966 



CACAGCGCCC 
ACTTCGGGGG 
AGCCCGGACC 
CGCAAGATGG 
GCCGAGTGGA 
CTGCGAGCGC 
AAGACGCTCA 
AATAGCATGG 
ATGGACAGTT 
CAGCTGGGCT 
ATGCACCGCT 
ATGAACGGCT 
CTTGGCTCCA 
TCTTCCTCCC 
TATCTCCCCG 
CACTACCAGA 
ATGTGAGGGC 
TGGGAGGGGT 
AAAAA 



11 
I 

GCATGTACAA 
GCGGCGGCGG 
GCGTCAAGCG 
CCCAGGAGAA 
AACTTTTGTC 
TGCACATGAA 
TGAAGAAGGA 
CGAGCGGGGT 
ACGCGCACAT 
ACCCGCAGCA 
ACGACGTGAG 
CGCCCACCTA 
TGGGTTCGGT 
ACTCCAGGGC 
GCGCCGAGGT 
GCGGCCCGGT 
CGGACAGCGA 
GCAAAAGAGG 



21 
I 

CATGATGGAG 
CAACTCCACC 
GCCCATGAAT 
CCCCAAGATG 
GGAGACGGAG 
GGAGCACCCG 
TAAGTACACG 
CGGGGTGGGC 
GAACGGCTGG 
CCCGGGCCTC 
CGCCCTGCAG 
CAGCATGTCC 
GGTCAAGTCC 
GCCCTOCCAG 
GCCGGAACCC 
GCCCGGCACG 
ACTGGAGGGG 
AGAGTAAGAA 



31 
1 

ACGGAGCTGA 
GCGGCGGCGG 
GCCTTCATGG 
CACAACTCGG 
AAGCGGCCGT 
GATTATAAAT 
CTGCCCGGCG 

AGCAACGGCA 
AATGCGCACG 
TACAACTCCA 
TACTCGCAGC 
GAGGCCAGCT 
GCCGGGGACC 
GCCGCCCCCA 
GCCATTAACG 
GGAGAAATTT 
ACAGCATGGA 



41 
I 

AGCCGCCGGG 
CCGGCGGCAA 
TGTGGTCCCG 
AGATCAGCAA 
TCATCGACGA 
ACCGGCCCCG 
■GGCTGCTGGC 
GCGCGGGCGT 
GCTACAGCAT 
GCGCAGCGCA 
TGACCAGCTC 
AGGGCACCCC 
CCAGCCCCCC 
TCCGGGACAT 
GCAGACTTCA 
GCACACTGCC 
TCAAAGAAAA 
GAAAACCCGG 



Seq ID NO: 347 Protein sequence 
Protein Accession ft: CAA83435 



1 11 . 

I 1 
HSARMYNMME TELKPPGPQQ 
RKMAQENPKM HNSEISKRLG 
KTLMKKDKYT LPGGLLAPGG 
QLGYPQHPGL NAHGAAQMQP 
LGSMGSWKS EASSSPPWT 
HYQSGFVPGT AINGTLPLSH 



4860 
4920 
4960 
5040 
5100 
5160 
5220 
5260 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



51 
I 

CCCGCAGCAA 
CCAGAAAAAC 
CGGGCAGCGG 
GCGCCTGGGC 
GGCTAAGCGG 
GCGGAAAACC 
CCCCGGCGGC 
GAACCAGCGC 
GATGCAGGAC 
GATGCAGCCC 
GCAGACCTAC 
TGGCATGGCT 
TGTGGTTACC 
GATCAGCATG 
CATGTCCCAG 
CCTCTCACAC 
ACGAGGGAAA 
TACGCTCAAA 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 



21 31 41 51 

I I I I 

TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 
AEWKLLSETE KRPFIDEAKR LRALHMKBHP DYKYRPRRKT 
NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 
MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 
SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 
M 



60 
120 
180 
240 
300 



317 



WO 02/086443 



Seq ID NO: 348 DNA sequence 
Nucleotide Accession ft: NM_002638 
Coding sequence: 120-473 

1 11 21 31 41 51 

I I I I I I 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 

GCTGGACTGC ATAAAGATTO GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

TGPJ3GGCCAG CAGCTTCTTG ATOGTGGTGG TGTTCCTCAT CGCTGGGACG CTGGTTCTAG 180 

AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 

TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 

TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 480 

CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 

TGCTGCCCTT CCCCTTCCCA CACTGTCCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

Seq ID NO: 349 Protein sequence: 
Protein Accession ft: NP_002629 

1 11 21 31 41 SI 

I I II I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 60 

AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 



Seq ID NO: 350 DNA sequence 
Nucleic Acid Accession #: NM_007183 
Coding sequence: 75-2468 

1 11 21 31 41 51 

I I I I I I 

GAATTCCGGA CAGGACGTGA AGATAGTTGG GTTTGGAGGC GGCCGCCAGG CCCAGGCCCG 60 

GTGGACCTGC CGCCATGCAG GACGGTAACT TCCTGCTGTC GGCCCTGCAG CCTGAGGCCG 120 

GCGTGTGCTC CCTGGGGCTG CCCTCTGACC TGCAGCTGGA CCGCCGGGGC GCCGAGGGGC 180 

CGGAGGCCGA GOQGCTGCGG GCAGCCCGCG TCCAGGAGCA GGTCCGCGCC CGCCTCTTGC 240 

AGCTGGGACA GCAGCCGCGG CACAACGGGG CCGCTGAGCC CGAGCCTGAG GCCGAGACTG 300 

CCAGAGGCAC ATCCAGGGGG CAGTACCACA CCCTGCAGGC TGGCTTCAGC TCTCGCTCTC 360 

AGGGCCTGAG TGGGGACAAG ACCTCGGGCT TCCGGCCCAT CGCCAAGCCG GCCTACAGCC 420 

CAGCCTCCTG GTCCTCCCGC TCCGCCGTGG ATCTGAGCTG CAGTCGGAGG CTGAGTTCAG 480 

CCCACAATGG GGGCAGCGCC TTTGGGGCOG CTGGGTACGG GGGTGCCCAG CCCACCCCTC 540 

CCATGCCCAC CAGGCCCGTG TCCTTCCATG AGCGCGGTGG GGTTGGGAGC CGGGCCGACT 600 

ATGACACACT CTCCCTGCGC TCGCTGCGGC TGGGGCCCGG GGGCCTGGAC GACCGCTACA 660 

GCCTGGTGTC TGAGCAGCTG GAGCCCGCGG CCACCTCCAC CTACAGGGCC TTTGCGTACG 720 

AGCGCCAGGC CAGCTCCAGC TCCAGCCGGG CAGGGGGGCT GGACTGGCCC GAGGCCACTG 7 B0 

AGGTTTCCCC GAGCCGGACC ATCCGTGCCC CTGCCGTGCG GACCCTGCAG CGATTCCAGA 840 

GCAGCCACCG GAGCCGCGGG GTAGGCGGGG CAGTGCCGGG GGCCGTCCTG GAGCCAGTGG 900 

CTCGAGCGCC ATCTGTGCGC AGCCTCAGCC TCAGCCTGGC TGACTCGGGC CACCTGCCGG 960 

ACGTGCATGG GTTCAACAGC TACGGTAGCC ACCGAACCCT GCAGAGACTC AGCAGCGGTT 1020 

TTGATGACAT TGACCTGCCC TCAGCAGTCA AGTACCTCAT GGCTTCAGAC CCCAACCTGC 1080 

AGGTGCTGGG AGCGGOCTAC ATCCAGCACA AGTGCTACAG CGATGCAGCC GCCAAGAAGC 1140 

AGGCCCGCAG CCTTCAGGCC GTGCCTAGGC TGGTGAAGCT CTTCAACCAC GCCAACCAGG 1200 

AAGTGCAGCG CCATGCCACA GGTGCCATGC GCAACCTCAT CTACGACAAC GCTGACAACA 1260 

AGCTGGCCCT GGTGGAGGAG AAOGGGATCT TOGAGCTGCT GCGGACACTG CGGGAGCAGG 1320 

ATGATGAGCT TCGCAAAAAT GTCACAGGGA TCCTGTGGAA CCTTTCATCC AGCGACCACC 1380 

TGAAGGACCG CCTGGCCAGA GACACGCTGG AGCAGCTCAC GGACCTGGTG TTGAGCCCCC 1440 

TGTCGGGGGC TGGGGGTCCC CCCCTCATCC AGCAGAACGC CTCGGAGGCG GAGATCTTCT 1500 

ACAACGCCAC CGGCTTCCTC AGGAACCTCA GCTCAGCCTC TCAGGCCACT CGCCAGAAGA 1560 

TGCGGGAGTG CCACGGGCTG GTGGACGCCC TGGTCACCTC TATCAACCAC GCCCTGGACG 1620 

CGGGCAAATG CGAGGACAAG AGCGTGGAGA ACGCGGTGTG CGTCCTGCGG AACCTGTCCT 1680 

ACCGCCTCTA CGACGAGATG CCGOCGTCCG CGCTGCAGCG GCTGGAGGGT CGCGGCCGCA 1740 

GGGACCTGGC GGGGGCGCCG CCGGGAGAGG TCGTGGGCTG CTTCACGCCG CAGAGCCGGC 1800 

GGCTGCGCGA GCTGCCCCTC GCCGCCGATG CGCTCACCTT CGCGGAGGTG TCCAAGGACC 1860 

CCAAGGGCCT CGAGTGGCTG TGGAGCCCCC AGATCGTGGG GCTGTACAAC CGGCTGCTGC 1920 

AGCGCTGCGA GCTCAACCGG CACACGACGG AGGCGGCCGC CGGGGOGCTG CAGAACATCA 1980 

CGGCAGGCGA CCGCAGGTGG GCGGGGGTGC TGAGCCGCCT GGCCCTGGAG CAGGAGCGTA 2040 

TTCTGAACCC CCTGCTAGAC CGTGTCAGGA CCGCCGACCA CCACCAGCTG CGCTCACTGA 2100 

CTGGCCTCAT CCGAAACCTG TCTCGGAACG CTAGGAACAA GGACGAGATG TCCACGAAGG 2160 

TGGTGAGCCA CCTGATCGAG AAGCTGCCAG GCAGCGTGGG TGAGAAGTCG CCCCCAGCCG 2220 

AGGTGCTGGT CAACATCATA GCTGTGCTCA ACAACCTGGT GGTGGCCAGC CCCATCGCTG 2280 

CCCGAGACCT GCTGTATTTT GACGGACTCC GAAAGCTCAT CTTCATCAAG AAGAAGCGGG 2340 

ACAGCCCCGA CAGTGAGAAG TCCTCCCGGG CAGCATCCAG CCTCCTGGCC AACCTGTGGC . 2400 

AGTACAACAA GCTCCACCGT GACTTTCGGG CGAAGGGCTA TCGGAAGGAG GACTTCCTGG 2460 

GCCCATAGGT GAAGCCTTCT GGAGGAGAAG GTGACGTGGC CCAGCGTCCA AGGGACAGAC 2520 

TCAGCTCCAG GCTGCTTGGC AGCCCAGCCT GGAGGAGAAG GCTAATGACG GAGGGGCCCC 2580 

TCGCTGGGGC CCCTGTGTGC ATCTTTGAGG GTCCTGGGCC ACCAGGAGGG GCAGGGTCTT 2640 

ATAGCTGGGG ACTTGGCTTC CGCAGGGCAG GGGGTGGGGC AGGGCTCAAG GCTGCTCTGG 2700 

TGTATGGGGT GGTGACCCAG TCACATTGGC AGAGGTGGGG GTTGGCTGTG GCCTGGCAGT 2760 

ATCTTGGGAT AGCCAGCACT GGGAATAAAG ATGGCCATGA ACAGTCACAA AAAAAAAAAA 2820 
AAAAGGAATT C 

Seq ID NO: 351 Protein sequence 
Protein Accession ft: NP_009114.1 

1 11 21 31 41 51 
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5 
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WO 02/086443 
I I I I 

MQDGNFLLSA LQPEAGVCSL ALPSDLQLDR RGAEGPEAER 
PRHNGAAEPE PEAETARGTS RGQYHTLQAG FSSRSQGLSG 
SRSAVDLSCS RRLSSAHNGG SAFGAAGYGG AQPTPPMPTR 
LRSLRLGPGG LDDRYSLVSE QLEPAATSTY RAFAYERQAS 
RTIRAPAVRT LQRFQSSHRS RGVGGAVPGA VLEPVARAPS 
NSYGSHRTLQ RLSSGFDDID LPSAVKYLMA SDPNLQVLGA 
QAVPRLVKLF NHANQEVQRH ATGAMRNLIY DNADNKLALV 
KNVTGILWNL SSSDHLKDRL ARDTLEQLTD LVLSPLSGAG 
FLRNLSSASQ ATRQKMRECH GLVDALVTSI NHALDAGKCE 
EMPPSALQRL EGRGRRDLAG APPGEWGCF TPQSRRLREL 
WLWSPQIVGL YNRLLQRCEL NRHTTEAAAG ALQNITAGDR 
LDRVRTADHH QLRSLTGLIR NLSRNARNKD EMSTKWSHL 
IIAVLNNLW ASPIAARDLL YFDGLRKLIF IKKKRDSPDS 
HRDFRAKGYR KEDFLGP 



Seq ID NO: 352 DNA sequence 
Nucleic Acid Acceseion #: M31469 
Coding sequence: 1-651 



PCT/US02/12476 



LRAARVQEQV 
DKTSGFRPIA 
PVSFHERGGV 
SSSSRAGGLD 
VRSLSLSLAD 
AYIQHKCYSD 
EENGIFELLR 
GPPLIQQNAS 
DKSVENAVCV 
PLAADALTFA 
RWAGVLSRLA 
IEKLPGSVGE 
EKSSRAASSL 



RARLLQLGQQ. 
KPAYSPASWS 
GSRADYDTLS 
WPEATEVSPS 
SGHLPDVHGF 
AAAKKQARSL 
TLREQDDELR 
EAEIFYNATG 
LRNLSYRLYD 
EVSKDPKGLE 
LEQERILNPL 
KSPPAEVLVN 
LANLWQYNKL 



ATGGCTGCGC 
ACTGGAAAAA 
GCCACCTTGG 
TTCAATGTAT 
ATCCAAGCCC 
GTGCCTAACT 
GGCAACAAAG 
AAGAAGAATC 
TTCCTCTGGC 
GCTCTCGCCC 
TTAGAGGTTG 



11 
1 

AGGGAGAGCC 
CGACCTTCGT 
GTGTTGAGGT 
GGGACACAGC 
AGTGTGCCAT 
GGCATAGAGA 
TGGATATTAA 
TTCAGTACTA 
TTGCTAGGAA 
CACCAGAAGT 
CTCAGACAAC 



21 
I 

CCAGGTCCAG 
GAAACGTCAT 
TCATCCCCTA 
CGGCCAGGAG 
CATAATGTTT 
TCTGGTACGA 
GGACAGGAAA 
CGACATTTCT 
GCTCATTGGA 
TGTCATGGAC 



31 

I 

TTCAAACTTG 
TTGACTGGTG 
GTGTTCCACA 
AAATTCGGTG 
GATGTAACAT 
GTGTGTGAAA 
GTGAAGGCGA 
GCCAAAAGTA 
GACCCTAACT 
CCAGCTTTGG 
GATGAGGATG 



41 
1 

TATTGGTTGG 
AATTTGAGAA 
CCAACAGAGG 
GACTGAGAGA 
CGAGAGTTAC 
ACATCCCCAT 
AATCCATTGT 
ACTACAACTT 
TGGAATTTGT 
CAGCACAGTA 
ATGACCTGTG 



51 
I 

TGATGGTGGT 
GAAGTATGTA 
ACCTATTAAG 
TGGCTATTAT 
TTACAAGAAT 
TGTGTTGTGT 
CTTCCACCGA 
TGAAAAGCCC 
TGCCATGCCT 
TGAGCACGAC 
A 



Seq ID NO: 353 Protein sequence 
Protein Accession #: AAA36546 



1 11 21 31 41 51 

I I I I . I I 

MAAQGEPQVQ FKLVLVGBGG TGKTTFVKRH LTGEFEKKYV ATLGVEVHPL VFHTNRGPIK 
FNVWDTAGQE KFGGLRDGYY IQAQCAIIMF DVTSRVTYKN VPNWHRDLVR VCENIPIVLC 
GNKVDIKDRK VKAKSIVFHR KKNLQYYDIS AKSNYNFEKP FLWLARKLIG DPNLEFVAMP 
ALAPPEWMD PALAAQYEHD LEVAOTTALP DEDDDLi 



Seq ID NO: 354 DNA sequence 
Nucleic Acid Accession 8: NM_002820 
Coding sequence: 304-831 



1 
I 

CCGGTTCGCA 
CCCTGTTCCA 
CGTGTAAACA 
TTCAGAGGAA 
GTTTGGAGAA 
ACGATGCAGC 
GTGCCCTCCT 
GAACATCAGC 
CTTCACCATC 
CCTAACTCCA 
GAGGGCAGAT 
AAGACACCTG 
AAACGGCGAA 
GACCACCTGT 
CTGGCCCGTA 
GCTTGGACAA 
CAGAGAATAA 
TGTCCTCCAG 
CATCAATCCT 
ATCTTCATAA 
TTCTTCAGTG 
GATATTATCT 
ACTTTTTATT 
TAAATTATGT 
CCAGCTCATA 
GGTTTTTCTC 
CCGTAGGAAA 



11 
I 

AAGAAGCTGA 
CGAACCCAGG 
CACTACTTAT 
GCGCCTCTGA 
AGCACAGTTG 
GGAGACTGGT 
GCGGGCGCTC 
TCCTCCATGA 
TGATCGCAGA 
AGCCCTCTCC 
ACCTAACTCA 
GGAAGAAAAA 
CTCGCTCTGC 
CTGACACCTC 
GCCTCAGCGG 
ACCTAGAATT 
CTCAGAATAT 
CACCATAGAG 
TTACCACTCT 
TTTGCTGGAG 
TTTTTCATTT 
ACAAACACTG 
TAATTAAATG 
TTTAAACACA 
CAAAATAAAT 
ATGTATCTTT 
AATAAAACTT 



21 
I 

CTTCAGAGGG 
AGAACTGCTG 
CATTGATGCA 
TTTGTTTCTT 
GAGTAGCCGG 
TCAGCAGTGG 
GGTGGAGGGT 
CAAGGGGAAG 
AATCCACACA 
CAACACAAAG 
GGAAACTAAC 
GAAAGGCAAG 
CTGGTTAGAC 
CACAACGTCG 
GGTGCTCTCA 
TTCTCCCTTT 
TGTCTGCCTT 
AGGCGCTAGA 
ACCAAATAAT 
AAGTGTATTT 
CTTACGTTCT 
CAGAACAGCA 
TATTTAATTA 
TGCCTTAAAT 
GGTTTCTGAA 
TTGTTCATTG 
CACATTTAAA 



31 
I 

GGAAACTTTC 
GCCAGATTAA 
TATATAAAAC 
TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
TCATGTCATA 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 
AAAAA 



41 
I 

TTCTTTTAGG 
TTAGACATTG 
CATTTTATTT 
TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



51 

I 

AGGCGGTTAG 
CTATGGGAGA 
TCGCTATTAT 
TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATTCAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



Seq ID NO i 355 Protein sequence 
Protein Acceseion #: NM_002820 

1 11 21 31 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 



51 



MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 



60 
120 



319 



WO 02/086443 

TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 

Seq ID NO: 356 DNA sequence 
Nucleic Acid Accession #: NMJJ17522 
Coding sequence: 1-2100 



PCTAJS02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ATGGGCCTCC 
CTGCTGCTGC 
CAA6GGCCGG 
CCCTCTQTGT 
TGCCCCAAGA 
GAACGGTG6A 
ACTTGCACCA 
TGTGTACCTG 
GCCGGCTGTG 
ACATGTGTCC 
GAAGCTGGCT 
ATCTGCACTG 
GACCAGAAGA 
TGTGTCAATT 
CTGACCAAGA 
ACGAGTGCGG 
AATGTCGTGG 
TACCGTAAGA 
CTCATTGACG 
ATCTACTGGA 
CGACGCACTC 
CGAGGGTTCA 
AACGGTGTGG 
CTGGATCTGC 
ATTGACTTCA 
CCTTTTGGGA 
ATTTTCAGTG 
AACCCACATG 
GAGCTGAGTG 
TCCAGCCACT 
ATGAAGAGGT 
GCTGTTATCG 
CTGATCTGGA 
TACAGGAAAA 
ATTGGCCATG 
GGATGGGATC 
GTGTATGACT 
GATTTTTTTT 
ACATCCAAAG 
GCACTACCCA 
AATGGGGGCC 
TTTGGTTGCT 
ATGCTTTGTG 



11 

1 

CCGAGCCGGG 
TGCGGCTCCA 
CCAAGGAGTG 
GGAGATGOGA 
AGACCTGTGC 
AGTGTGACGG 
AGCAGGTGTG 
CCTCGTGGCG 
CTACCTCACT . 
TTGCAATCAA 
GCCTACAGGG 
ACCTCAAGAT 
CTTGTGGCGA 
ACAAGGGCTA 
ACTGCAAGGC 
AGGATCGACC 
CACTAGATGT 
TCTATAGCGC 
AGCAGTTGCA 
CTGACTCGGG 
TCTTCAGCCG 
TGTATTGGTC 
ACCGGCAAAC 
TGAGCCAGCG 
GTGGAGGCAA 
TAGCTGTGTT 
CAAATGGGCT 
ACATTGTCAT 
TCCAGCCTAA 
CTCCCAAGTA 
GCTACCGAGA 
GGATCATCGT 
GAAACTGGAA 
CAACAGAAGA 
TCTATCCTGC 
ACCCCCTTCG 
GGATGAATGG 
TTTAAATTTA 
GATGTGAGAG 
TGAGGAATTC 
AATGGCACAG 
GGGGGGCTTT 
GCTATCCATC 



21 
I 

CCCTCTCCGG 
GCATCTTGCG 
CGAAAAGGAC 
CGAGGACGAT 
AGACAGTGAC 
CGAGGAGGAG 
TCCTGCAGAG 
CTGCGACGGG 
GGGCACCTGC 
GCACTGCAAC 
GCTGAACGAG 
TGGCTTTGAA 
CATTGATGAG 
TTTTAAGTGT 
TGCTGCTGGC 
TGTGAAGCGG 
GGAAGTTGCC 
CTACATGGAC 
CTCTCCAGAG 
CAATAAGACC 
TAACCTCAGT 
TGACTGGGGG 
ACTGGTGTCA 
CTTGTACTGG 
CAGAAAGACG 
TGAGGACAAG 
CAATGGCCTG 
CTTCCATGAG 
TGGAGGCTGT 
CACATGTGCC 
TGCAAATGAA 
GCCCATAGTG 
GCGGAAGAAC 
AGAAGATGAA 
ACGAGTGGCA 
TGCCTCATGG 
GTTTCTATAT 
TGTTGCGGAA 
TTTTTCTATG 
GTGGAATGGC 
TACCTTACTC 
TTTAGGTTTT 
AACATAAGT 



GAAATCTCCA 
CTGAAGCAGC 
GAATACCTGT 
TGTCCTGACA 
GACAG7AAGA 
GTGATAGCCC 
ACCAAAAGCA 
GATGAGCTCC 
TTAAGCCTTG 
AATTCAGTCC 
ATGGGTCTGT 
AGGTAACCAC 
TATAATGTTT 
TACTGCTGAC 
ATCATTTAAA 
GGGCATTTGT 



Seq ID NO: 357 Protein sequence 
Protein Accession #: NP 059992 



MGLPBPGPLR 
PSVWRCDEDD 
TCTKQVCPAE 
TCVLAIKHCN 
DQKTCGDIDE 
TSAEDRPVKR 
LIDEQLHSPE 
RGFMYWSDWG 
IDPSGGNRKT 
NPHDIVIFHE 
MKRCYRDANE 
YRKTTEEEDE 



11 

I 

XiLALLLLLLL 
DCLDHSDEDD 
KLSCGPTSHK 
QEQDCPDGSD 
CKDPDACSQI 
NYSRLIPMLK 
GLAVDWVHKH 
DQAKIEKSGL 
LISSTDFLSH 
LKQPRAPDAC 
DSKMGSTVTA 
DELHI GRTAQ 



21 
I 

LLLLRLOHLA 
CPKKTCADSD 
CVPASWRCDG 
EAGCLQGLNE 
CVNYKGYFKC 
NWALDVEVA 
IYWTDSGNKT 
NGVDRQTLVS 
PFGIAVFEDK 
ELSVQPNGGC 
AVIGIIVPIV 
IGHVYPARVA 



31 
I 

AAAADPLLGG 
PTCDNGHCIB 
BKDCEGGADE 
CLHNNGGCSH 
ECYPGCEMDL 
TNRIYWCDLS 
ISVATVDGGR 
DNIEWPNGIT 
VFWTDLENEA 
EYLCLPAPQI 
VIALLCMSGY 
LSLEDDGLP 



41 

I 

TGCTGCTGCT 
CTGATCCGCT 
GCCGGAACGA 
ACCACAGCGA 
ACAACGGCCA 
GCTCCGATGA 
GTGGACCCAC 
GCGAGGGTGG 
AGTTCCAGTG 
ACTGTCCAGA 
ACAATGGCGG 
CAGCAGGCTT 
CAGATGCCTG 
CTGGCTGCGA 
CCCTAATCTT 
GCCTCATCCC 
TCTACTGGTG 
ACCCGAAAGA 
TGGACTGGGT 
CCACAGTTGA 
CCATCGCTGT 
AGATTGAGAA 
AATGGCCCAA 
AGCTACACCA 
CCACTGACTT 
CAGACCTGGA 
TCCTGGCTGA 
CAAGAGCTCC 
GCCTTCCTGC 
CAATGTGGCT 
TGGGCTCAAC 
TCCTGTGCAT 
TGAATTTTGA 
ATATAGGGAG 
AAGATGATGG 
CATGCACTAC 
GTGAGTGTAT 
AAAGTTATGA 
TATACACTTT 
TAACATGATG 
AACTATATTT 
TTTTTGTAAA 



41 

I 

QGPAKECEKD 
ERWKCDGEEE 
AGCATSLGTC 
ICTDLKIGPE 
LTKNCKAAAG 
YRKIYSAYMD 
RRTLFSRNLS 
LDLLSQRLYW 
IFSANRLNGL 
SSHSPKYTCA 
LIWRNWKRKN 



51 
I 

GCTGCTGCTG 
GCTCGGCGGC 
GCGCTGCATC 
CGAGGACGAC 
CTGCATCCAC 
GTCCGAGGCC 
CAGCCACAAG 
AGCGGATGAG 
TGGGGATGGG 
TGGGAGTGAT 
CTGCTCACAC 
CCAGCTCCTG 
CAGCCAGATC 
GATGGACCTA 
CACCAACCGC 
CATGCTCAAG 
TGACCTCTCC 
GCGGGAGGTC 
CCACAAGCAC 
TGGTGGCCGC 
TGACCCCCTG 
ATCTGGGCTC 
CGGAATCACC 
ACTGTCCAGC 
CCTGAGCCAC 
GAACGAGGCC 
GAACCTCAAC 
AGATGCCTGT 
TCCTCAGATC 
GGGTCCAGAC 
AGTCACTGCC 
GAGTGGATAC 
CAACCCAGTC 
AACTGCTCAG 
ACTACCCTGA 
ACTCCGGATG 
GTGTGTGTGT 
TGAACTGCAA 
TTAACTGGTT 
CACATAACCA 
ACAGAAGATG 
TAAGATGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 356 DNA sequence 
Nucleic Acid Accession #: M27826 
Coding sequence: <l-503 



1 
I 

AGCCCAAGAA 
GACGCTGCCC 
CTCTCACAGT 
CGTTGCCTTC 
GCCAAGCTTC 
TTATGCACTC 
TAACCAAATT 
TTCTTCCCAA 
CACAAGACCT 
CACTTGAAGC 
TTCGCCGCTC 



11 
I 

ACATCTCACC 
GATCGCCTCG 
GGAGGGTAAG 
TTTTCAAGGG 
AAAACCCCTG 
TTTTTTAGTT 
ATCTGCTTCC 
TCCAAAGCCT 
CCCTTCAGCT 
AGCCCTGAGA 
CAACACTTCA 



21 
I 

AATTTCAAAT 
GAAGTCCCCT 
TCCATCCCCT 
CCTGTTTCCC 
AAAACTCCCC 
ATCCCCACCT 
CTGACTATTC 
CCTTTGTGTC 
TAATCTCTCC 
AACATCGCCC 
ACACTATTTT 



31 

I 

CTGATCTATT 
GGAC CATCAC 
GTTTAATCGA 
TTGCCCCCAT 
CACTCTGGTG 
GCCCACTTCC 
CTGGAGTACA 
CTCTAACATC 
CACTCTAGGT 
ATTCTCTCTC 
GTTTTATTTG 



41 

I 

CGGCTTAGCG 
AGAAGCCGAG 
TACGGGGGCT 
AACTGTTGTG 
CCAACTTGGA 
CTTATTAGGC 
GCTACATCTC 
CCCACAATAT 
TCCCACGCCG 
CATACCACCC 
TCTTATTAAT 



51 

I 

QFQCRNERCI 

CPDGSDESEA 

RGDEFQCGDG 

CTCPAGFQLL 

KSPSLIPTOR 

KASDPKEREV 

EPRAIAVDPL* 

VDSKLHQLSS 

EISILAENLN 

CPDTMWLGPD 

TKSMNFDNPV . 



51 

I 

ACTGAAGATT 
CTTCGGGTAA 
ACCCACTCCA 
GGTATTGACG 
CAACACTCTT 
CGAAATATTT 
ATTGCTGCCC 
CAGCCCTTAC 
CCCCTAATCC 
CCCAAAAATT 
ATCAGAAGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



320 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CCCCTGTGAC 


TTGCACGTAT 


720 


AGTAAAAACA 


GCCTTAACTG 


780 


ACTGATCAAT 


GTACTTTGTA 


840 


ACCCTTGAGA 


ATGTACTTTG 


900 


TTGTAATTTT 


TTATTACCTT 


960 


CTGACTCTCT 


TTTOGGACTC 


1020 


AC 







WO 02/086443 

AGGAATGTCA GGCCTCTGAG CCCAGGCCAG GCCATCGCAT 
ACATCCAGAT GGCCTGAAGT AACTGAAGAT CCACAAAAGA 
ATGACATTCC ACCATTGTGA TTTGTTCCTG CCCCACCCTA 
ATCTCCCCCA CCCTTAAGAA GGTTCTTTGT AATTCTCCCC 
TGAGATCCAC CCCTGCCCAC CAGAGAACAA CCCCCTTTGA 
CCCAAATCCT ATAAAACAGC CCCACCCCTA TCTTCCTTCA 
AGCCACCGGC ACCCAGGTGA AATAAACAGC TTTATTGCTC 

Seq ID NO: 359 Protein sequence 
Protein Accession #: AAA65999 



1 11 21 31 41 51 

I I I I I j 

PKKHLTNFKS DLFGLATEDW RCPIASEVPW TITEAELRVT LTVEGKSIPC LIDTGATHST 
LPSFQGPVSL APITWGIDG QASKPLKTPP LWCQLGQHSF MHSFLVIPTC PLPLLGRNIL 
TKLSASLTIP GVQLHLIAAL LPNPKPPLCP LTSPQYQPLP QDLPSA 

Seq ID NO: 360 DNA sequence 
Nucleic Acid Accession #t NN_001854 
Coding sequence: 162-5582 



PCT7US02/12476 



i 
I 

AACCATCAAA 
TGTGTAGCAA 
TTOGTGGGTT 
GGTGGAAAAC 
TCTTCCAAGC 
TTCACAATTC 
CTAAAGGCTC 
AACAGTTATT 
CAAAAAAAGG 

CCCCAGAAGA 
TAGCAATCAG 
CGAAACCACT 
GAACAAGGAT 
GTGATCCCAA 
CACCCAAGGC 
TCGAATATGA 
GACCCACTGT 
AAGAATACAA 
GGACAAATGA 
ATTATGATTC 
GCAGGGATTC 
AAGAATATGA 
CAGAAACTGA 
AGAAAGGAGA 
CAGGACCTGC 
ACCCTGGCGA 
GTCCTCCTGG 
CAACCATCTC 
TGAGAGGCCC 
GTTCATCTGG 
AGGGTCCCCC 
GAAGAGGAAT 
GTCTGCCAGG 
CTGGTGATGA 
AAGCTGGCCC 
GTATGGCAGG 
CTGGGCCTCC 
CAATTGGTCC 
GTGCTGATGG 
TGGGTCCCCC 
CAGATGGTGT 
GATTCAAAGG 
GAGGGNAAGA 
CTTCAGGTCA 
GAAGACAAGG 
AAGGTGCACG 
CTCGAGGTTC 
GTGGCGATGG 
TTGGATTCCC 
ACCCTGGGCA 
GAGTGGTTGG 
CTGGTCCTCC 
CAAAGGGTGA 
GTTTCCCAGG 
AAGGTCCCCA 
CAGCTGGCOC 
GAGAGAAAGG 
AAGGTCCTGT 
ACAAGGGTGA 
GCCCTCCCGG 
ATGGTGAACC 
CCAGAGGCTT 
GTGAAAAAGG 



11 
I 

TTTAGAAGAA 

ACATCCCTGG 

GAGTTCACAG 

GAAACGGTGG 

TAGAGAGGTC 

TCCAGAGGGA 

AGATACTGCT 

TCCAGGTGGA 

AATTCAGTCT 

GGTTGGGAGA 

CTATCCCCTC 

CGTGGAGAAG 

TGATAGAAGT 

TTTGGATGAA 

GGCAGCATAT 

TGCTCAAGCT 

CTATGAGTAT 

AACTGAGGAG 

CTATGGAACA 

GCCAAATCCA 

CCAGAGGAAA 

TGATCTTCTG 

AGATAAACCA 

TATTACAGAA , 

ACCAGCAGTG i 

AGGTATTATG i 

TAGGGGCCCC 

TACTATGTTG 

TGCTCAGGAA 

ACCTGGCCCA 

GGCCAAAGGT 

TGGTCCAACG 

GCCAGGAGAA 

TGACAAAGGT 

TGGAATGAGG 

ACGAGGTTTG 

TGTAGATGGC 

AGGTCAACAA 

TCCTGGTGAA 

GCCTCCTGGT 

TGGTCCACAA i 

CAGAGGTCTC 

TGACATGGGT 

TGGCCCTGAA 

AGCAGGAGAA 

TCCAAAGGGT 

GGGAGTAGCT 

AAGAGGTGCA 

CCCTCCTGGC 

TGGACCAAAA i 

ACGTGGGGAG 

ACCACAGGGA 

TGGCCCTCCT 

TCCAGGTCCT 

GGAAAGAGGT 

GGGCC CACC A 

AATTGGTTTA 

TGCTCCTGGA 

TGGTCTCCCA i 

AATTGGTGAG 

TCCCCCAGGT 

AGGTCCTAGA 

CCCTGGACCT 

TGAAAATGGG 



GCCCTCGGGG 
CTGGGAAACC 
AAAGAGGTCC 
GACCACCAGG 
AAGGCAAGAC 
AGACTGGTCC 
GTCTTCCTGG 
CAGGGAAAGA 
CTCAGGGTGC 
GCTCACCAGG 
CGGGACCTCA 
CCCAAGGGCC 
GTCCTGCCGG 
AAGGCAGCAA 
CAGTTGGTGC 
GGATGTTTGG 
TAGGTCTTCA 
CATGGGGGCC 



41 
I 

CCCTCTCCCT 
AGTTGGTCTG 
GATGGAGCCG 
AACCCTCGCA 

tgtactaaAa 
ttgcacaaac 

ACAACTCAGT 
AATACTATTT 
TGAGCATGGT 
AGACCACACT 
TGACGGGAAG 
TGATTGTAAG 
CAATGGAATC 
TCAGCAGTTT 
TCCAGACTGT 
GTATGCACCA 
GGCTGAAAGT 
AAACATCGTT 
AGCTCCTAGG 
AGAATATCTA 
TGAAAACAAA 
ATATGATTTT 
ATTTGGTCCA 
TGCATATGGA 
CGAAGGACCA 
CCCCACTGGA 
AGGGGCTGAT 
TGGTGATGGT 
TCAGCAGGCT 
AGGTCCTGTG 
TCAGGGCCCT 
TCGTCCAGGT 
AGGGTTTGAT 
TCAAGGTCCT 
ACCAAGAGGT 
AGGAGCTCCA 
CATGGGTCCC 
TCTTCCTGGT 
AGGACTTGCT 
GTCTGGAGAA 
CCCCCGGGGA 
GGGTGAAGAT 
AGTTGGTCAA 
CCCAACTGGA 
AGGATTACCA 
TCCAGGTGCC 
TCAGCGTGGT 
TGGGCCAAAG 
TCAAGGACCT 
AAGGATGGGC 
CGGCCCTCCT 
AATAGGGGAA 
TGCTGCAGGA 
TGGACCAGCA 
ACCTGGACTG 
AGAACGTGGG 
GGGTCCTCCT 
TGCAGGGAGA 
CTCCCCTGGG 
GGGTGGCAAG 
CCCTGGAATT 
GCAAAAAGGT 
GGGTCTGCCA 
ACCTGGTCCT 



60 
120 



51 

I 

CCCCAATGGC 
CAGTCGCAAT 
TGGTCCTCTA 
TTGACCTTCC 
GCACTAGATT 
AGAAAGAATT 
GCCCCAACAA 
ACAGTAAAAC 
ATTCAGCAAA 
GGAAAACCTG 
TGGCATCGGG 
AAGAAAACCA 
ACGGTTTTTG 
TTGATCACAG 
GACTCTTCAG 
GAGGATATAA 
GTAACAGAGG 
GATGATTTTC 
CATGTTTCTG 
ACGGGAGAGG 
GAAATAGACG 
TATGAATATA 
GGTGTACCAG 
GAGAAAGGAC 
CCAGGACCAG 
CCCCCTGGTG 
GGTCTACCTG 
TCCAAAGGAC 
CGGATTGCTC 
GGGGGGCCTG 
CGAGGCGTCC 
GCAGATGGAG 
GGACTTCCGG 
CCAGGTCCTC 
CTTCCAGGTG 
GGGCAGCCTG 
CAAGGGGAGC 
CCACAAGGTC 
GGACTTCCTG 
AAGGGGGCTC 
GTAAAGGGAG 
GGTTTTCCAG 
ATTGGCCCAA 
GACCCAGGTC 
GGATATCCAG 
AATGGAGAGA 
CCAACGGGTC 
GGCACTTCAG 
CAGGGTCCAG 
TGCCCAGGAC 
GGGCCAGGGG 
CGTGGGTATC 
AAAGAAGGTG 
GGATTACGTG 
AAAGGAGGGG 
TCAGCAGGTA 
GGTCCAGCTG 
GATGGAGTTC 

GGAGAAAATG 
GCTGGAGGTG 
GATGAGGGTG 
GGCCCACCTG 
CCAGGCCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1250 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



321 



WO 02/086443 

GAGGCCCTCA AGGTCCCAAT GGAOCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 

GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 

CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 

ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 

GTCAAGATGG TGTTGGTGQT GACAAGGGTG AAGATGGAGA TCCTGGTCAA COGGGTCCTC 4200 

CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 

GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 43 BO 

AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 

AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

6CTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 

AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 

ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 

GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 

GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 

CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 

ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 

AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 

GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 

ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 

AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 

AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 

GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 

GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 

TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 

ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCT 5580 

AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 

TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CA TATAC AGG 5700 

TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 

ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 

TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 

AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 

TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGGAT GCCGAACTCT 6060 

GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
T T T CT T TG GT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 361 Protein sequence 
Protein Accession 8: NP_00184S 

1 11 21 31 41 SI 

1 I I 1 I I 

MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 

CTNRKNSKGS DTAYRVSKQA QI»SAPTKQLF PGGTFPEDPS ILFTVKPKKG IQSFLLSIVK 120 

EHGIQQIGVE VGRSPVFLPE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV 180 

DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDBEVFEGDI QQFLITGDPK AAYDYCEHYS 240 

PDCDSS APKA AQAQEPQ IDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

NIVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 

ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 

AYGEKGQKGE PAWEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 

GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 

GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 

GFDGIiPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 

GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 

GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLXGSKGEK 780 

GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPSGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 

GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 

GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 

GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 

GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1140 

GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 

GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGBPGEA 1260 

GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 

GELGPAGQDG VGGDKGEDGD PGQPGPPGP8 GEAGPPGPPG KRGPPGAAGA EGRQGBKGAK 1380 

GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 

GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 

GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 

MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 

EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDiOCSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

IiSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 

EMSYDMNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVKISDFGD. QNQKFGFEVG 1800 
PVCFLG 



Seq ID NO i 362 DMA sequence 
Nucleic Acid Accession #i NM_003107 
Coding sequence: 351-1775 

1 11 21 31 41 SI 

i I I I I I 

TTCCCCAGCA TTCGAGAAAC TCCTCTCTAC TTTAGCACGG TCTCCAGACT CAGCCGAGAG 60 

ACAGCAAACT GCAGCGCGGT GAGAGAGCGA GAGAGAGGGA GAGAGAGACT CTCCAGCCTG 120 

GGAACTATAA CTCCTCTGCG AGAGGCGGAG AACTCCTTCC CCAAATCTTT TGGGGACTTT 180 



322 
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WO 02/086443 

TCTCTCTTTA CCCACCTCCG 
CGTCTTCCCG TTCGGCGTGT 
CGGCCGCCGC GAGGGTGTGA 
AAACCAACAA TGCCGAGAAC 
CCGGCCTCGA GCTGGGAATC 
GCAAGGCCGA CGACCCGAGC 
AOGCCTTCAT GGTGTGGTCG 
TGCACAACGC CGAGATCTCC 
ACAAGATCCC TTTCATTCGA 
CCGACTACAA GTACCGGCCC 
CGGCCGCCGC CTCCTCCAAG 
GCGGCCATGG GGGCGGCGGC 
CGAGTGGCGG CGGCGCCAAC 
CGGGCGGCGC GGGCGGTGGG 
GCGGCGGCGG GAAAGCAGCG 
CCGCCGCCCT GCTGCCCCTG 
CTCCCAGCGC CTCGGCCTCC 
CGGGCAAGCA CCTGGCGGAG 
CGTCGTCGTC GCCCGTGGGC 
TGTACGAGGA GGAGGGCGCG 
GCGCCGCCTC GTCCCCCGCC 
TGCGCGCCGC CTCGCCCGCC 
CCCACTCCTC CTCTTCCTCC 
ACCTGCTCGA CCTGAACCCC 
CGTCGTCGGC GCTCGACCGG 
TCGAGTTCCC GGACTACTGC 
AGTCCAGCAT CTCCAACCTG, 
CGGGGGGGGT AGGAGAGGAG 
GAGTTTAAAG AGAAAAGGGA 
TCTCGTCGTC GGATCAAGGA 
TCCCGGGCCG GGGACCCACT 
AGGGGCGACC TGTGATTGTT 
TTCGAGTTTG CTCCCCTTTG 
TTGTCTGCAC CCCCAGCAAG 
CTTCCTGCAT CACCACCTTG 
GAACCCAGCG CACCCCTCCC 
GGTGACGAAT TTGGCCGATG 
GCAGGCGAAT TCCCGTTTGG 
AGCCGGAGGA GGAGATGTTG 
ACCCGAGAAC CCCGTTGGAA 
CGAACTGGAA GGGGGTTCAC 
CGGCGGCTGC TGGGCCTCCG 
GACCCCGGAG GCGTGGAGGA 
GGCGAGTGGT TTCGGAAAAA 



PCT/US02/12476 



CCCCTGCGAG 
GCTTGGCCCG 
GCGCGCGTGG 
ACGGAAGCGC 
GCCTCCTCCC 
TGGTGCAAGA 
CAGATCGAGC 
AAGCGGCTGG 
GAGGCGGAGC 
AGGAAGAAGG 
CCGGGGGAGA 
GGCGGCGGGA 
TCCAAACCGG 
GTTAGCAAAC 
GCTGCCGCCG 
GGCGCCGCCG 
GCCTCCTCGG 
AAGAAGGTGA 
GGCGTGGGCG 
GGCTGCTCGC 
GCCGGCCGCT 
CCGTCCAGCG 
TCCTCGGGCT 
AGCTCAAACT 
GACCTGGATT 
ACGCCCGAGG 
GTTTTCACCT 
AAAAAAAAAG 
AAAAAGAAAG 
GGGCGGCGGC 
CTGCCCAGCC 
GTTATTGATG 
CTTGAAGAGA 
AAGGCGAGTT 



CCCCTTTTTT 
GCAGATGTTT 
GGCCTTTTTT 
AGGGGAGGAG 
GCGCAGCAGC 
GGTCAAACTG 
CCTTCTTTTC 
GAGGAGACTG 
AAAAAAGAAA 



GAGTTGAGGG 
GGGAACCGGG 
GCGCCCGCCG 
TGCTGGCCGG 
CCACGCCCGG 
CCCCGAGTGG 
GGCGCAAGAT 
GCAAACGCTG 
GGCTGCGCCT 
TGAAGTCCGG 
AGGGAGACAA 
GCAGCAACGC 
CGCAGAAAAA 
CGCACGCCAA 
CCGCCTCCTT 
CCGACCACCA 
CAGCCTCGGC 
AGCGCGTCTA 
CGGGAGCCGA 
CCGACGCGCC 
CGCCCGCCGA 
CGCCCTCGCA 
CCTCGTCCTC 
TTGAGAGCAT 
TTAACTTCGA 
TGAGCGAGAT 
ACTGAAGGGC 
TGAAAAAAAG 
AAAAAGTAAG 
GTTTTGGACC 
GGAGGGACGC 
TTGTTGTTGA 
CCCCCTCCCC 
AGTTTTCTAG 
ATTTTGCTTC 
TAAACGCGTG 
TGGGGGAACG 
TCCTCCCTCT 
GCCAGCCAGT 
GGGAGCTAGG 
AAATGGATTT 
TACGTGAAAT 
TTTGATGTGG 
AAAAGGG 



Seq ID NO: 363 Protein sequence 
Protein Accession ft: NP 003098 



1 
I 

MVQQTNNAEN 
RPMNAFMVWS 
ADYPDYKYRP 
GGGASGGGAN 
QAGAAALLPL 
GLGTSSSPVG 
' YASLRAASPA 
SFSSSSALDR 



11 

I 

TEALLAGESS 
QIERRKIMEQ 
RKKVKSGNAN 
SKPAQKKSCG 
GAAADHHSLY 
GVGAGADPSD 
PSSAPSHASS 
DLDFNFEPGS 



21 

I ' 

DSGAGLELGI 
SPDMHNAEIS 
SSSSAAASSK 
SKVAGGAGGG 
KARTPSASAS 
PLGLYEEEGA 
SASSHSSSSS 
GSHFEFPDYC 



31 

I 

ASSPTPGSTA 
KRLGKRWKLL 
PGEKGDKVGG 
VSKPHAKLIL 
ASSAASASAA 
GCSPDAPSLS 
SSGSSSSDDE 
TPEVSEMISG 



GCCAGTTCGG 
AGGGCCCGGC 
AGCCGAGGCC 
CGAGAGCTCG 
CTCCACCGCC 
GCACATCAAG 
CATGGAGCAG 
GAAGCTGCTC 
CAAGCACATG 
CAACGCCAAC 
GGTCGGTGGC 
GGGGGGAGGA 
GAGCTGCGGC 
GCTCATCCTG 
CGCCGCCGAA 
CTCGCTGTAC 
CTCCGCAGCG 
CCTGTTCGGC 
CCCCAGCGAC 
CAGCCTGAGC 
CCACCGCGGC 
CGCGTCCTCC 
CGACGACGAG 
GTCCCTGGGC 
GCCCGGCTCC 
GATCTCGGGA 
GCGCAGGCAG 
AAACGAAAAG 
CAGGGCTCGT 
CGCGCTCCCA 
GGAGGAGGAA 
TGGCAAAAAA 
CTTCCAACGA 
AGACTTGAAG 
TTGGTCAAGA 
ATGAAGACAG 
CCGGGACTGA 
TTTCCCCTTG 
GTGACCGGCG 
GGCGGGGGCG 
GCACGTTGGG 
CAGTGAGGTG 
TACAGGGGCA 



41 
I 

STGGKADDPS 
KDSDKIPFIR 
SGGGGHGGGG 
AGGGGGGKAA 
LAAPGKHLAE 
GRSSAASSPA 
FEDDLLDLNP 
DWLESSISKL 



CCGCCGCGCG 
GATCGCGCGG 
ATGGTGCAGC 
GACTCGGGCG 
TCCACGGGCG 
CGACCCATGA 
TCGCCCGACA 
AAAGACAGCG 
GCTGACTACC 
TCCAGCTCCT 
AGTGGCGGGG 
GGCGGCGGTG 
TCCAAAGTGG 
GCAGGCGGCG 
CAGGCGGGGG 
AAGGCGCGGA 
CTCGCGGCCC 
GGCCTGGGCA 
CCCCTGGGCC 
GGCCGCAGCA 
TACGCCAGCC 
TCGGCCTCGT 
TTCGAAGACG 
AGCTTCAGTT 
GGCTCGCACT 
GACTGGCTCG 
GGAGAAGGGC 
GACAGACGAA 
TCGCCCGCGT 
TCCCCCACCT 
GAGGGTAGAC 
AAAAAGCGAC 
GCTTCCGGAC 
GAGTCTCCCC 
AAGGAGGGGA 
AAGGCTCCGG 
GAGACTCCAC 
CCCCCTCTGC 
d'AGGAAATG 
GAGGAGGACA 
GAGCTGGCGG 
AGACTTCCCA 
GTCAGTGGAG 



51 



WCKTPSGHIK 
EAERLRLKHM 



VFTY 



Seq ID NO: 364 DNA sequence 
Nucleic Acid Accession #: U10860 
Coding sequence: 123-2204 



TGCCGGCTGC 
GGCACCCTCC 
CGATGGCTCT 
GCCACCACCA 
TCATAGACCG 
CAGCATTTGC 
CTGTGTATGC 
TTCTTGGAAT 
AAAAAAGTGT 
TCAGGGGCCT 
TAGCTGATGG 
AATCTAAAAA 
AAGTAATACT 
AGAACAGAGA 
TGGTTTTACT 
TGAACCAAGA 
GCCAGTCTGT 
CTCATTCTTT 
GGAAAAGAAT 
TTGGGGATAC 
AGGAGGTTTT 



11 
I 

TCCTCGACCA 
CGCCCCGTCT 
GTGCAACGGA 
CTATGAAGGA 
AAGAGTGAGG 
TATAAAGGAA 
TGAAGATGCT 
TTGCTATGGT 
CAGAGAAGAT 
TCAGAAGGAA 
ATTCAAGGTT 
GTTATATGGA 
GAAGAATTTC 
ACTTGAGTGT 
CAGTGGTGGA 
ACAAGTCATT 
TGAAGAGGCC 
CTACAATGGA 
TAGCAAAACG 
TTTTGTTAAG 
CCTTGCCCAA 



21 

] 

GGCCTCCTTC 
CGTACTGTCG 
GACTCCAAGC 
GCTGTTGTCA 
GAACTGTTCG 
CAAGGATTCC 
CCCTGGTTTG 
ATGCAGATGA 
GGAGTTTTCA 
GAAGTTGTTT 
GTGGCACGTT 
GCACAGTTCC 
CTTTATGATA 
ATTCGAGAGA 
GTAGACTCAA 
GCTGTGCACA 
CTCAAAAAGC 
ACAACAACCC 
TTAAATATGA 
ATTGCCAATG 
GGTACTTTAC 



31 

I 

TCAACCTCAG 
CCGTCACCGC 
TGGAGAATGC 
TTCTGGATGC 
TGCAGTCTGA 
GTGCTATTAT 
ATCCAGCAAT 
TGAATAAGGT 
ACATTAGTGT 
TGCTTACACA 
CTGGAAACAT 
ACCCTGAAGT 
TAGCTGGATG 
TCAAAGAGAG 
CAGTTTGTAC 
TTGATAATGG 
TTGGAATTCA 
TACCAATATC 
CCACAAGTCC 
AAGTAATTGG 
GGCCTGATCT 



41 

I 

CCCGCGGCGC 
CGCGGCTCCG 
TGGAGGAGAC 
TGGTGCTCAG 
AATTTTCCCC 
CATCTCTGGA 
ATTCACTATT 
ATTTGGAGGT 
GGATAATACA 
TGGAGATAGT 
AGTAGCAGGC 
TGGCCTTACA 
CAGTGGAACC 
AGTAGGCACG 
AGCTTTGCTA 
CTTTATGAGA 
GGTCAAAGTG 
AGATGAAGAT 
TGAAGAGAAA 
AGAAATGAAC 
AATTGAAAGT 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



AAAAASFAAE 
KKVKRVYLFG 
AGRSPADHRG 



60 
120 
180 
240 
300 
360 
420 



51 

I 

CGACCCTTCC 
GCCCTGGCCC 
CTTAAGGATG 
TACGGGAAAG 
TTGGAAACAC 
GGACCTAATT 
GGCAAGCCTG 
ACTGTGCACA 
TGTTCATTAT 
GTAGACAAAG 
ATAGCAAATG 
GAAAATGGAA 
TTCACCGTGC 
TCAAAAGTTT 
AATCGTGCTT 
AAACGAGAAA 
ATAAATGCTG 
AGAACCCCAC 
AGAAAAATCA 
TTGAAACCAG 
GCATCCCTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



323 



5 

10 
15 
20 
25 
30 
35' 



WO 02/086443 

TTGCAAGTGG CAAAGCTGAA CTCATCAAAA CCCATCACAA 
AGTTGAGAGA GGAGGGAAAA GTAATAGAAC CTCTGAAAGA 
GAATTTTGGG CAGAGAACTT GGACTTCCAG AAGAGTTAGT 
GTCCTGGCCT GGCAATCAGA GTAATATGTG CTGAAGAACC 
CTGAAACCAA CAATATTTTG AAAATAGTAG CTGATTTTTC 
ATACCCTATT ACAGAGAGTC AAAGCCTGCA CAACAGAAGA 
AAATTACCAG TCTGCATTCA CTGAATGCCT TCTTGCTGCC 
AGGGTGACTG TCGTTCCTAC AGTTACGTGT GTGGAATCTC 
GGGAATCACT TATTTTTCTG GCTAGGCTTA TACCTCGCAT 
TTGTTTATAT ATTTGGCCCA CCAGTTAAAG AACCTCCTAC 
TGACAACAGG GGTGCTCAGT •ACTTTACGCC AAGCTGATTT 
GGGAGTCTGG GTATGCTGGG AAAATCAGCC AGATGCCGGT 
TTGATCGGGA CCCACTTCAA AAGCAGCCTT CATGCCAGAG 
TTATTACTAG TGACTTCATG ACTGGTATAC CTGCAACACC 
AGGTGGTATT AAAGATGGTC ACTGAGATTA AGAAGATTCC 
ATGACTTAAC ATCAAAGCCC CCAGGAACTA CTGAGTGGGA 

Seq ID NO i 365 Protein sequence 
Protein Accession #: AAA60331 



PCT/US02/12476 



MALCNGDSKL 
AFAIKEQGFR 
KSVREDGVFN 
SKKLYGAQFH 
VLLSGGVDST 
HSFYNGTTTL 
BVFLAQGTLR 
ILGRELGLPE 
TIiLQRVKACT 
BSLIFLARLI 
ESGYAGKISQ 
WLKMVTEIK 



11 
I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLF 
PEVGLTENGK 
VCTALliNRAL 
PISDEDRTPR 
PDLIESASLV 
ELVSRHPFPG 
TEEDQEKLMQ 
PRMCHNVNRV 
MPVILTPIiHF 
KIPGISRIMY 



21 
I 

HHHYBGAWI 
VYAEDAPWFD 
RGUJKSEW1* 
VILKNFLYDI 
NQEQVIAVHI 
KRISKTLNMT 
ASGKAELIKT 
PGLAIRVICA 
ITSLHSLNAF 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 
I 

IiDAGAQYGKV 
PAIFTIGKPV 
LTHGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKI I 
HHNDTELIRK 
EEPYICKDFP 
LLPIKTVGVQ 
PPTDVTPTFL 
CQRSWIRTF 



TGACACAGAG 
TTTTCATAAA 
TTCCAGGCAT 
TTATATTTGT 
TGCAAGTGTT 
GGATCAGGAG 
AATTAAAACT 
CAGTAAAGAT 
GTGTCACAAC 
AGATGTTACT 
TGAGGCCCAT 
GATTTTGACA 
ATCTGTGGTT 
TGGCAATGAG 
TGGTATTTCT 
GTAATAAACT 



41 

I 

IDRRVRELFV 
LGICYGMQMM 
ADGFKWARS 
NRBLECIREI 
QSVEEALKKL 
GDTFVKXANE 
LREEGKVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TTGVLSTLRQ 
ITSDFMTGIP 



CTCATCAGAA 
GATGAAGTGA 
CCATTTCCAG 
AAGGACTTTC 
AAAAAGCCAC 
AAGCTGATGC 
GTAGGTGTGC 
GAACCTGACT 
GTTAACAGAG 
CCCACTTTCT 
AACATTCTCA 
CCATTACATT 
ATTCGAACCT 
ATCCCTGTAG 
CGAATTATGT 
TC 



51 
I 

QSEIFPLETP 
NKVFGGTVHK 
GNIVAGIANE 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPE 
LKDFHKDEVR 
DFSASVKKPH 
GISSKDEPDW 
ADFEAHNIXiR 
ATPGNEIPVE 



1320 
1360 
1440 
1500 
1560 
1620 
16B0 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 366 DNA sequence 
Nucleic Acid Accession #: NM_004219 
Coding sequence t 46-654 



1 
I 

GCGGCCTCAG 
TATGTTGATA 
CTGGGGTCTG 
TTTGGCAAAA 
ACTGTCAACA 
CCAAGCTTTT 
GCCTCAGATG 
GAGAGTTTTG 
CTCATGATCC 
CCTGTGAAGA 
CTGTCGACCC 
TAGTGCTTCA 
AAAAAAAA 



11 
I 

ATGAATGCGG 
AGGAAAATGG 
GACCTTCAAT 
CGTTCGATGC 
GAGCTACAGA 
CTGCCAAAAA 
ATGCCTATCC 
ACCTGCCTGA 
TTGACGAGGA 
TGCCCTCTCC 
TGGATGTTGA 
GAGTTTGTGT 



21 
I 

CTGTTAAGAC 
AGAACCAGGC 
CAAAGCCTTA 
CCCACCAGCC 
AAAGTCTGTA 
GATGACTGAG 
AGAAATAGAA 
AGAGCACCAG 
GAGAGAGCTT 
ACCATGGGAA 
ATTGCCACCT 
GTATTTGTAT 



31 
I 

CTGCAATAAT 
ACCCGTGTGG 
GATGGGAGAT 
TTACCTAAAG 
AAGACCAAGG 
AAGACTGTTA 
AAATTCTTTC 
ATTGCGCACC 
GAAAAGCTGT 
TCCAATCTGT 
GTTTGCTGTG 
TAATAAAGCA 



41 

I 

CCAGAATGGC 
TTGCTAAGGA 
CTCAAGTTTC 
CTACTAGAAA 
GACCCCTCAA 
AAGCAAAAAG 
CCTTCAATCC 
TCCCCTTGAG 
TTCAGCTGGG 
TGCAGTCTCC 
ACATAGATAT 
TTCTTCAACA 



51 

I 

TACTCTGATC 
TGGGCTGAAG 
AACACCACGT 
GGCTTTGGGA 
ACAAAAACAG 
CTCTGTTCCT 
TCTAGACTTT 
TGGAGTGCCT 
CCCCCCTTCA 
TTCAAGCATT 
TTAAATTTCT 
GAAAAAAAAA 



Seq ID NO: 367 Protein sequence 
Protein Accession ft: NP_004210 

31 



11 21 31 41 51 

I I I . ! I I 

MATLIYVDKE NGEPGTRWA KDGLKLGSGP SIKALDGRSQ VSTPRFGKTF DAPPALPKAT 

RKALGTVNRA TEKSVKTKGP LKQKQPSFSA KKMTEKTVKA KSSVPASDDA YPEIEKFFPF 

NPLDFESFDL PEEHQIAHLP LSGVPLMILD EERELEKLFQ LGPPSPVKMP SPPVfESNLLQ 
SPSSILSTLD VELPPVCCDI DI 



Seq ID NO: 368 DNA sequence 
Nucleic Acid Accession #: NM_O00597 
Coding sequence: 118-1104 



1 
I 

ATTCGGGGCG 
CCTGCCCGCC 
CTGCCGAGAG 
CCGCTGCTGC 
CTGTTCCGCT 
GCGCCGCCCG 
GTCCGGGAGC 
GGCGTCTACA 
CTGCCCCTGC 
TATGGCGCCA 
GTGGAGAACC 
AAGCCCCTCA 
CACCGGCAGA 
CGACCACCCC 



11 

I 

AGGGAGGAGG 
CGCCCGCTCG 
TGGGCTGCCC 
TGCTGCTACT 
GCCCGCCCTG 
CCGCGGTGGC 
CGGGCTGCGG 
CCCCGCGCTG 
AGGCGCTGGT 
GCCCGGAGCA 
ACGTGGACAG 
AGTCGGGTAT 
TGGGCAAGGG 
CTGCCAGGAC 



21 

1 

AAGAAGCGGA 
CTCGCTCGCC 
CGCGCTGCCG 
GGGCGCGAGT 
CACACCCGAG 
CGCAGTGGCC 
CTGCTGCTCG 
CGGCCAGGGG 
CATGGGCGAG 
GGTTGCAGAC 
CACCATGAAC 
GAAGGAGCTG 
TGGCAAGCAT 
TCCCTGCCAA 



31 

I 

GGAGGCGGCT 
CGCCGCGCCG 
CTGCCGCCGC 
GGCGGCGGCG 
CGCCTGGCCG 
GGAGGCGCCC 
GTGTGCGCCC 
CTGCGCTGCT 
GGCACTTGTG 
AATGGCGATG 
ATGTTGGGCG 
GCCGTGTTCC 
CACCTTGGCC 
CAGGAACTGG 



41 
I 

CCCGCTCGCA 
CGCTGCCGAC 
CGCCGCTGCT 
GCGGGGCGCG 
CCTGCGGGCC 
GCATGCCATG 
GGCTGGAGGG 
ATCCCCACCC 
AGAAGCGCCG 
ACCACTCAGA 
GGGGAGGCAG 
GGGAGAAGGT 
TGGAGGAGCC 
ACCAGGTCCT 



51 

I 

GGGCCGTGCA 
CGCCAGCATG 
GCCGCTGCTG 
CGCGGAGGTG 
CCCGCCGGTT 
CGCGGAGCTC 
CGAGGCGTGC 
GGGCTCCGAG 
GGACGCCGAG 
AGGAGGCCTG 
TGCTGGCCGG 
CACTGAGCAG 
CAAGAAGCTG 
GGAGCGGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
• 600 
660 
720 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



324 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCCACCATGC GCCTTCCGGA TGAGCGGGGC CCTCTGGAGC 
CCCAACTGTG ACAAGCATGG CCTGTACAAC CTCAAACAGT 
CAGCGTGGGG AGTGCTGGTG TGTGAACCCC AACACCGGGA 
ACCATCCGGG GGGACCCCGA GTGTCATCTC TTCTACAATG 
GTGCACACCC AGCGGATGCA GTAGACOGCA GCCAGCCGGT 
GCCCCTCTCC AAACACCGGC AGAAAACGGA GAGTGCTTGG 
TTCCAGTTCT GACACACGTA TTTATATTTG GAAAGAGACC 
CCCGGCCTCT CTCTTCCCAG CTGCAGATGC CACACCTGCT 
GAGGAAGGGG GTTGTGGTCG GGGAGCTGGG GTACAGGTTT 
TTTATTTTTG AACCCCTGTG TCCCTTTTGC ATAAGATTAA 

Seg ID NO: 369 Protein sequence 
Protein Accession ft: NP 0005B8 



ACCTCTACTC 
GCAAGATGTC 
AGCTGATCCA 
AGCAGCAGGA 
GCCTGGOGCC 
GTGGTGGGTG 
AGCACCGAGC 
CCTTCTTGCT 
GGGGAGGGGG 
AGGAAGGAAA 



11 



21 31 41 

I I 1 1 t 

MLPRVGCPAL PLPPPPLLPL LPLLLLLLGA SGGGGGARAE VLFRCPPCTP 
VAPPAAVAAV AGGARMPCAE LVREPGCGCC SVCARLEGEA CGVYTPRCGQ 
ELPLQALVMG EGTCEKRRDA EYGASPEQVA DNGDDHSEGG LVENHVDSTM 
RKPLKSGMKE LAVFREKVTE QHRQMGKGGK HHLGLiEEPKK LRPPPARTPC 
ISTMRLPDER GPLBHLYSLH IPNCDKHGLY NLKQCKMSLN GQRGECWCVN. 
PTIRGDPECH LFYNEQQEAC GVHTQRMQ 

Seq ID NO i 370 DNA sequence 
Nucleic Acid Accession ft: NM_004264 
Coding sequence: 6-440 



GGAACATGGC 
TTTGTAATGC 
AGACAGCAAT 
CAGCACTGAT 
AAGAATCTAC 
AAGCTGCTAC 
AAAGCGCACT 
AGTCTCTTCC 
GTGCCATTAA 
TTAAACACTA 
GATAAGCTTA 
GAGTGAAATT 
AATTCTGTTA 
C 



11 
I 

GGATCGGCTC 
CATTGGAGTA 
TAACAAAGAC 
TGCACGAACA 
AGCTGCTTTA 
ATGTGTGGAG 
TGCTGATATT 
AGACTCATAG 
GAATTCTGCA 
TGACACATTA 
TAAATCATGA 
ATTAAGGCAT 
TGACATAATT 



21 
I 

ACGCAGCTTC 
TTGCAGCAAT 
CAGCCAGCTA 
GCAAAAGACA 
CAGGCTGCTA 
GATGTTGTTT 
GCACAGTCAC 
CATCAGTGGA 
TCAGACTTAG 
CCTTTTTAGC 
TTGAATCAGC 
GTAATACATT 
TATGTCTCCA 



31 
I 

AGGACGCTGT 
GTGGTCCTCC 
ACCCTACAGA 
TTGATGTTTT 
GCTTGTATAA 
ATCGAGGAGA 
AGCTGAAGAC 
TACCATGTGG 
ATACAAGCCT 
TATTTTTAAT 
TTTAAAGCAT 
AATGAACATA 
TTTTGTTGTA 



41 
I 

GAATTCGCTT 
TGCCTCTTTC 
AGAGTATGCC 
GATAGATTCC 
GCTAGAAGAA 
CATGCTTCTG 
AAGAAGTGGT 
CTGAGAAAAG 
TACCAACAAT 
AGTCTTCTAT 
CATACCATCA 
ATATAAGGAA 
TTGGCCAGTA 



CCTGCACATC 
TCTGAACGGG 
GGGAGCCCCC 
GGCTTGCGGG 
CCTGCCCCCC 
CTGGAGGATT 
TCGGCACCTC 
TTCCCCGGGG 
AAGAGAAATT 
AGT 



SI 

I 

ERLAACGPPP 
GLRCYPHPGS 
NMLGGGGSAG 
QQEbDQVIiER 
PNTGKLIQGA 



51 
I 

GCAGATCAGT 
AATAATATTC 
CAGCTTTTTG 
TTACCCAGTG 
GAAAACCATG 
GAGAAGATAC 
ACCCATAGCC 
AACTGTTTGA 
TACAGAAACA 
TTTCACTCTT 
TTTTTTAACT 
ACATATGTAA 
CTTTTACAAT 



Seq ID NO: 371 Protein sequence 
Protein Accession ft t NP_0042S5 



11 
1 



31 



41 



51 



1 

i 

ATGGAGAATC 
CTGCGCGTGG 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 

AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AGCTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
GTCACCTTTA 
ATCCCAGCGC 



21 

I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 
I 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



41 

I 

AGCGACACCG 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



Seq ID NO: 373 Protein sequence 
Protein Accession ft: CAB69070 



41 



51 



900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 



PCT/US02/12476 



MADRLTQLQD AVNSLADQFC NAIGVLQQCG P PAS PNNI QT AINKDQPANP TEEYAQLFAA 
LIARTAKDID VLIDSLPSEE STAALQAASL YKLEEENHEA ATCVEDWYR GDKLLEKIQS 
ALADIAQSQL KTRSGTHSQS LPDS 

Seq ID NO: 372 DNA sequence 
Nucleic Acid Accession ft: AJ271091 
Coding sequence: 1-1113 



51 
I 

CGAGCTATAT 
AAACGTGCTG 
TCACCTGGAG 
GGTAAACATT 
AAAGCGACCA 
AATGGAGCTC 
CTCTCCTGAA 
ATTCTTGGGA 
AGAGTCCTTT 
GGCAGTTGTG 
rCTGATCCAG 
AATGCAGAAC 
CAGGTACTCT 
TCGTTACACT 
GATTCAGTCC 
AGTGAAAATC 
AGGTTTATAC 
CGCAGTGGCT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
790 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780. 
840 
900 
960 
1020 
1060 



1 11 21 31 

I I 1 I I I 

MENQVLTPHV YWAQRHRELY LRVELSDVQN PAISITENVL HPKAQGHGAK GDNVYEFHLE 
PLDLVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LPIAPDFDRW LDESDAEMEL 
RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIPVNLTV RPCILGKESF 



60 
120 
180 



325 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

YDTFHTVADM MYFCQMLAW ETINAAIGVT TSPVLPSLIQ LLGRNFILPI IFGTMEEMQN 
KAWFPVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LAEAVSVIQS 
IPIPNETGRP SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RLKMKAGAVA 
HACDPSALGG 



Seq ID NO: 374 DNA sequence 
Nucleic Acid Accession *h NMJ)16395 
Coding sequence i 1-1113 



PCT/US02/12476 



240 
300 
360 



1 
I 

ATGGAGAATC 
CTGCGCGTGG 
CATTTCAAAG 
TTCTTAGACC 
ACAGTACAGA 
CTGTTTTTGQ 
AGAGCTAAGG 
ACTCTTACAA 
TTCTCCTGGA 
TATGACACAT 
GAAACTATCA 
CTTCTTGGAA 
AAAGCTGTGG 
TTCTACATGC 
CTGTGGATTC 
ATTCCAATAT 
AAAGTTAGAT 
ATAAATTTTC 
CATGCCTGTG 



11 
I 

AGGTGTTGAC 
AGCTGAGTGA 
CTCAAGGACA 
TTGTGAAACC 
AGAAAGTGAG 
CTCCTGACTT 
AAGAAGAGCG 
ACTTAAGGAA 
TCTTTGTCAA 
TCCATACTGT 
ATGCAGCAAT 
GAAATTTTAT 
TTTTCTTTGT 
TGACGTGCAT 
CCTTATATCC 
TCAATGAGAC 
TTTCCTTTTT 
GTCACCTTTA 
ATCCCAGCGC 



21 
I 

GCCGCATGTC 
CGTACAGAAC 
TGGTGCCAAA 
AGAGCCTGTT 
TCAGTGGTGG 
TGATCGTTGG 
CCTAAATAAA 
AGGATACCTG 
CCTGACTGTG 
GGCTGACATG 
TGGAGTCACT 
TTTGTTTATC 
GTTTTATTTG 
TGACATGGAT 
ACTGGGATGT 
CGGACGATTC 
TCTTCAGATT 
TAAACAGCGC 
TTTGGGAGGC 



31 

I 

TACTGGGCTC 
CCTGCCATCA 
GGAGACAATG 
TACAAACTGA 
GAGAGACTCA 
CTGGATGAAT 
CTCCGACTGG 
TTTATGTATA 
CGATTCTGTA 
ATGTATTTCT 
ACGTCACCGG 
ATCTTTGGCA 
TGGAGTGCAA 
TGGAAGGTGC 
TTGGCGGAAG 
AGTTTCACAT 
TATCTTATAA 
AGACTGAAAA 
TGA 



Seq ID NO: 375 Protein sequence 
Protein Accession #: NP_057479 



MENQVLTFHV 
FLDLVKPKPV 
RAKEEERLNK 
YDTFHTVADM 
KAWFPVFYL 
IPIFNETGRF 
STKKKDLDGF 



11 

I 

YWAQRHRELY 
YKLTQRQVNI 
LRLESEGSPE 
MYFOQMLAW 
WSAIEIFRYS 
SFTLPYPVKI 
LPV 



21 

! 

LRVELSDVQN 
TVQKKVSQWW 
TLTNLRKGYL 
ETINAAIGVT 
FYMLTCIDMD 
KVRFSFFLQI 



31 
I 

PAISITENVL 
ERLTKQEKRP 
FMYNLVQFLG 
TSPVLPSLIQ 
WKVLTWLRYT 
YLIMIFLGLY 



41 

I 

AGCGACACCG 
GCATCACTGA 
TCTATGAATT 
CCCAGAGGCA 
CAAAGCAGGA 
CTGATGCGGA 
AAAGCGAAGG 
ATCTTGTGCA 
TCTTGGGAAA 
GCCAGATGCT 
TGCTGCCTTC 
CCATGGAAGA 
TTGAAATTTT 
TCACATGGCT 
CTGTCTCAGT 
TGCCATATCC 
TGATATTTTT 
TGAGGGCAGG 



41 

I 

HFKAQGHGAK 
LFLAPDFDRW 
FSWIFVNLTV 
LLGRNFILFI 
LWIPLYPLGC 
INFRHLYKQR 



51 

I 

CGAGCTATAT 

AAACGTGCTG 

TCACCTGGAG 

GGTAAACATT 

AAAGCGACCA 

AATGGAGCTC 

CTCTCCTGAA 

ATTCTTGGGA 

AGAGTCCTTT 

GGCAGTTGTG 

TCTGATCCAG 

AATGCAGAAC 

CAGGTACTCT . 

TCGTTACACT 

GATTCAGTCC 

AGTGAAAATC 

AGGTTTATAC 

CGCAGTGGCT 



51 
I 

GDNVYEFHLE 
LDESDAEMEL 
RFCILGKESF 
IFGTMEEMQN 
LVEAVSVIQS 
RRRYGKKRKR 



ACAGCAGTTA 
CTACCTCGCT 
GTCGCGCTCG 
GAAGGGCCAC 
GGAGTACCTC 
GACGCGAATC 
GCTGCTGGGC 
GCTGCOCAAG 
CACCCAGGCC 
CATGCCACCA 
CTTCAGACTG 
TCGCCGCCCG 
CGGCCTCGGG 
CGGCTTGGGC 



11 
I 

CACTGCGGCG 
AGCATGTCGG 
TCGCGCGCCG 
TACGCCGAGC 
ACCGCTGAGA 
ATCCCCCGCC 
GGCGTGACGA 
AAGACCAGCG 
TCCCAGGAGT 
CAAAGGCCCT 
CGGGGCAAGC 
GCCTCGAGTC 
CCTGCCCTGT 
GCTCTTCGGG 



21 

i 

GGOGTCTGTT 
GCCGCGGCAA 
GCCTCCAGTT 
GCGTTGGCGC 
TCCTGGAGCT 
ACCTGCAGCT 
TCGCCCAGGG 
CCACCGTGGG 
ACTAAGAGGG 
TTTAAGGGCC 
GGGCCGCGGC 
CCCGCCCGCC 
CCGCCGTCCG 
GACCTCCGTG 



31 

I 

CTAGTGTTTG 
GACTGGCGGC 
CCCAGTGGGC 
CGGCGCGCCA 
GGCGGGCAAT 
GGCCATCCGC 
AGGCGTCCTG 
GCCGAAGGCG 
CCCGCGCCGC 
ACCACCGCCC 
TCCCTTCCCC 
CCCGCTCCCG 
CCCTCCGGTA 
GCGCGGAAGA 



41 
I 

AGCCGTCGTG 
AAGGCCCGCG 
CGTGTACACC 
GTGTACCTGG 
GCGGCCCGCG. 
AACGACGAGG 
CCCAACATCC 
CCCTCGGGCG 
GGCCGGCCGC 
TCATGGAAAG 
TCCCCTCCCC 
TCCCGCACCG 
GGGTTCGGGC 
CCCGAGCCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 
300 
360 



Seq ID NO: 376 DNA sequence 
Nucleic Acid Accession #: NMJ505987 
Coding sequence: 1-270 

1 11 21 31 41 51 

1 I I I I I 

ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA GCAGCAGCAG 
GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC CAAGGAGCCC 
TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG CCAGCCCAAG 
ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT CACTCCAGCA 
CCAGCCCAGC AGAAGACCAA GCAGAAGTAA 

Seg ID NOt 377 Protein sequence . 
Protein Accession #: NP_005978 

1 11 21 31 41 51 

I I I I I I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPGQPK 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



Seq ID NO: 378 DNA sequence 
Nucleic Acid Accession #t NM_002105 
Coding sequence: 74-505 



60 
120 
180 
240 



51 

I 

CTTCACCGGT 
CCAAGGCCAA 
GGCTGCTGCG 
CGGCAGTGCT 
ACAACAAGAA 
AGCTCAACAA 
AGGCCGTGCT 
GCAAGAAGGC 
CCCAGCTCCC 
AGCTGAGCCG 
TCGCCCGCCT 
CCTGCCGCGT 
CTTCCGGATG 
CCGGGGGGAG 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 



326 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCCGGCGGCG CCGCACCTGC CCGCCTOGGC GTTCGTGACT 
GCTAAGGGGC TGOGGQGAGG CCGCAGCACC TTCTGGAAGA 
CAGOQCCGAG GTGGGCAGTC CAGGCCGAGA GCCGGCGGCC 
CGGCAGCTGC AGCCGGGGTG TCTGGTACCC CCCCGGCGTG 
AGACGGCCGC TGGCCGGGAG GCTTTGGTGG GAGAGACGCG 
GCCCCTTCTG CGGCCGGGAC CCAGGCCTTT CACATCAGCT 
AGGTCTGCGC TGGGGCOGGG ACGAAGCACT TGGTAACAGG 
CTGCCTCCTA GGAGGACATT TAGGGGAGGG CAGAGGCCTG 
CTATGTGGAC AGCAAGAGTC GTTTTGCGGA ACGCGACTGG 
CCGACGCCGC CCCATTTCCC TTCCAGCAAA CTCAACTCGG 
CAGCACAAGT CGGTTAATCC CTGTCTGGAC TGAGCCTCCG 
CTGCAGCTAA CCCTTCCACG ACTAGAACCT TAGGCATTGG 
TTTATTAAAG GATTGTTTTT TTTTT 

Seq ID MO: 379 Protein sequence 
Protein Accession ft: NP_002096 



PCT7US02/12476 



CAGCCGCCCC 
CTTGGCCTTC 
CTGAAGGTGA 
GTGCTTAGCC 
ATCGCCGATT 
CTCCCTCCAT 
CACATCTTCC 
CAGTTTGGCT 
CAGCCAGGCC 
CAATCCAAGC 
TTGGCTTCTG 
GGAGTTTTAG 



ATCCCGAGTC 
CGCTCTGACG 
GTGAGGCCCT 
CAGGACTTTC 
TOGGTCTGGC 
CTTCATTCAT 
TCCCGAGTGA 
TCACGGCTGG 
TGTCGGGCCC 
ACCTAGATAC 
AACTGGAATT 
ATGGACTAAT 



1 11 21 31 41 51 

I I I I I i 

MSGRGKTGGK ARAKAKSRSS RAGLQFPVGR VHRLLRKGHY ABRVGAGAPV YLAAVLEYLT 
AEILELAGNA ARDNKKTRI I PRHLQLAIRN DEBLNKLLGG VTIAQGGVLP NIQAVLLPKK 
TSATVGPKAP SGGKKATQAS QEY 



Seq ID NO: 380 DNA sequence 
Nucleic Acid Accession #: AL136942 
Coding sequence: 184-864 



ACGCGTCCGG 
GGGCGAGCGG 
CAGGCGAGGC 
GCGATGAAGA 
CATGTCCGCA 
CTGTTGATTT 
CTGGGAGGTG 
CTTCTCATGA 
TGGATCATCC 
ATCACTGTGC 
TTTCCCTACA 
CTGTTTATTA 
TACCGATACA 
ACTACGGTGC 
CCGCCACCTT 
CTTTGCAGAC 
TTGTTTGTTG 
TCAACATATG 
TGGGGATATA 
GGACCTAGAA 
TCCCTCTCTT 
CCAGCATAGA 
ATTAGGTAAA 
TGACTGAAGT 
TAAGACCATT 
GATCTTGTGT 
GTGTTTGGCG 
GGCCCGCTTT 
TATCAAGTGG 
TCTTTTCCCT 
TAAAATGTAA 
GACTACCTGA 
TGTTGGTTCA 
CCACATCCAA 



11 
I 

CAGAAGCTCG 
GCCGGGAGCC 
GGTCGACGCT 
TGGTOGCGCC 
CCGGCACCAT 
TATTGAGTGC 
ACTTTGAGTT 
TCCTGATATG 
CATTCTTCTG 
TTATTTATCC 
GAGATGATGT 
GCATTATCTT 
TCAATGGTAG 
TGCTACCCCC 
ACGTGTCTGC 
ATCTGAGCAA 
CTGAAATGCT 
CTTTGCTAGA 
ACGGGCTTCA 
GTCTGCTTTT 
TTGAAAATGT 
GAACAAAACC 
TAGAAGTCCT 
TCAATGACAG 
AGAAAGCACC 
CCAGGGACAT 
CTGCATGGGA 
TACTAAGTGT 
AATTGGGATA 
GCAAGCTACA 
ACATTTTCAG 
ATTGCAAGGG 
TTATTGAATG 
AAAAAAAAAA 



21 
I 

GAGCTCTCGG 
GGAGCGGCGG 
CCTGAAAACT 
CTGGACGCGG 
CCTGCTCGGC 
CCTGGCTGAT 
CATGGATGAT 
TGCTATGGCT 
TTACCAGATC 
AAACTCCATT 
CATGTCAGTG 
GACTTTTAAG 
GAACTCCTCT 
GTATGATGAT 
CTAAGCCTTC 
TAGTTCTGTT 
ACTTTTTAAA 
ACACTGTGAT 
CTAACCTTCC 
GTACCTGCTG 
AAAATAAAAC 
TTATGGAAAC 
TATGTATGTG 
TTTGTGTTTG 
AGGCCGTGGG 
GGGGTGACAT 
TCTGGTGCCC 
TCTGCCCTAG 
TATTTGATAT 
TCCTACTGCT 
AAAAATGAGG 
ATTTTTATAT 
TGCTGTAAAT 
AAAAA 



31 
I 

GGTATCGAGG 
AGGAGCCGGC 
TGCGCGCGCG 
TTCTACTCCA 
GTCTGGTATC 
CCGGATCAGT 
GCCAACATGT 
ACTTACGGAG 
TTTGACTTTG 
CAGGAATACA 
AATCCTACCT 
GGTTACTTGA 
GATGTCCTGG 
GCCACTGTGA 
AAGTGGGCGG 
ATTTCACTTT 
ATTTAGATGT 
AGATTAACTG 
CTAGGCATTG 
GGCCCCAAAG 
CAAAAATAGA 
AGGAATGTCA 
TTACAAGAAT 
GTGGTAAAGG 
AGCAGTGACC 
GCCTCGTATG 
CTCTTCTCCT 
ATTGGTTCAA 
ACTTCTGCCT 
TTGAACTTCC 
ATTGCCTTCC 
ATTCATATGT 
TAAGTCGTTT 



41 

I 

AGGCAGGCCC 
AGCAGCGGCG 
CTOGCGCCAC 
ACAGCTGCTG 
TGATCATCAA 
ATAACTTTTC 
GCATTGCCAT 
CGTACAAGCA 
CCCTGAACAT 
TACGGCAACT 
GTTTGGTCCT 
TTAGCTGTGT 
TTTATGTTAC 
ATGGTGCTGC 
AGCTGAGGGC 
TGCCATGAGC 
TAGATTGAAA 
TAGAATTCTT 
AAACTTCCCC 
TTGGGCATTT 
CAACTTTTTC 
ATTGTGTAAT 
TTCCCCCACA 
ATTTTCTCCA 
ATCTACTGAC 
TGTTAGAGGG 
GGATTCACAT 
GGAGGTCATC 
AACAACATGG 
AAGTATGTCT 
TTGTATGCGC 
TACAAAGTCA 
GCAATTAAAA 



Seq ID NOt 381 Protein sequence 
Protein Accession #: CAB66876 



11 



21 
I 



31 



41 



51 



CAGATGCCAG 
CCTTGAGTGC 
TCTGCTCCTG 
CAATGAAAGA 
TGAAAATGAC 
AAACTCCTGG 



11 
I 

AAGAACACTG 
CTGCGTCCGT 
GCTGCAAGAT 
CCTTCTGCTT 
TGGAATGAAA 
AAGGGAGGCC 



21 
I 

TTGCTCTTGG 
GAGAATTCAG 
TGCCACTTGA 
ACATGAGGGA 
AACTCTACCC 
GTGTGCAGGC 



31 
I 

TGGAOGGGCC 
CATGGAATGT 
TGCCGCCAAA 
GCACAATCAA 
AGTGTGGAAG 
GQTCCTGACC 



41 

I 

CAGAGGAATT 
CTCTACTATT 
OGATTTCATG 
TTAAATGGCT 
OGGGGAGACA 
AGTGACTCAC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



51 

I 

GCGGGCGCAC 

OGGCGGGCTC 

TGCGCCCGGA 

CTTGTGCTGC 

TGCTGTGGTA 

AAGTTCTGAA 

TGCGATTTCT 

ACGCGCAGCC 

GTTGGTTGCA 

GCCTCCTAAT 

TATTATTCTT 

TTGGAACTGC 

CAGCAATGAC 

CAAGGAGCCA 

AGCAGCTTGA 

CTCTCTGAGC 

ACTGTAGTTT ■ 

CCTGTACGAT 

CAAATCTGAT 

TTCTCTCTGT 

TTCAGCCATT 

CATTGTTCTA 

ACATCCTTTA 

TGGCCTGAAT 

TGTTCTTGTG 

TGGAATGGAT 

CCCCACCCAG 

CAACTGACTT 

AAAAGGGTTT 

AGTCACCTTT 

TTTTTACCTT 

GCAACTCTCC 

CAAGGTTTGC 



MKMVAPWTRP YSNSCCLCCH VRTGTILLGV WYLIINAWL LILLSALADP DQYNFSSSEL 
GGDFEFMDDA NMCIAIAISL LMIIjICAMAT YGAYKQRAAW IIPFPCYQIP DFALNMLVAI 
TVLIYPNSIQ EYIRQLPPNF PYRDDVMSVN PTCLVLIILL PISIILTFKG YLISCVWNCY 
RYINGRNSSD VLVYVTSNDT TVLLPPYDDA TVNGAAKEPP PPYVSA 

Seq ID NO: 382 DNA sequence 
Nucleic Acid Accession ft: NM_002510 
Coding sequence: 92-1774 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 



60 
120 
180 



51 

1 

CAGAGTTAAA 
TCCTGGGATT 
ATGTGCTGGG 
GGTCTTCTGA 
TGAGGTGGAA 
CAGCCCTCGT 



60 
120 
180 
240 
300 
360 



327 



WO 02/086443 

GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 420 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 480 

ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AA6CCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAGATGTGTA 780 

CGTGGTAACA GATCAGATTC CTGTGTTTGT GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCOGAOGAA ACCTTCCTCA AAGATCTCCC CATTATGTTT GATGTCCTGA TTCATGATCC 900 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 

CCTGTTTGTT TCCACCAATC ATACTGTGAA TCACACGTAT GTGCTCAATG GAACCTTCAG 1020 

CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 1080 

CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 1200 

AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 1260 

GCCATGGCCT GAAAGCTCCC TAATAGACTT TGTCGTGACC TGCCAAGGGA GCATTCCCAC 1320 

GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC ACCCAGAACA CAGTCTGCAG 1380 

CCCTGTGGAT GTGGATGAGA TGTGTCTGCT GACTGTGAGA CGAACCTTCA ATGGGTCTGG 1440 

GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA CGAGCACCCT 1500 

GA TT TCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA GTGCCCTGAT 1560 

CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC CTCTTGGTGT ACAAAAAACA 1620 

CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA AAGGCCTGAG 1680 

TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA AGGATCCGCT 1740 

ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC TGAAGCTCAC 1800 

TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT TTCCTAAAGA 1860 

TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT TAAATGTCAT 1920 

TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA GCCATGTTGT GAAACTGATA 1980 

AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA AAGTCTTAGG "2040 

TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC TATTGATTAG 2100 

AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT TCAGCTTTCC ATGTAACTGT 2160 

ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC TGAATCCCAC 2220 

TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT GTGGTATGAT 2280 

GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG AGTATTTTGG 2340 

TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT CATATATTCA TTTATTCCAT 2400 

GGACATTTAG TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC TCTTGTGTAT 2460 

ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT CATATATTAA GACTTTCCAA 2520 

AGATGAGGTC CCTGGTTTTT CATGGCAACT TGATCAGTAA GGATTTCACC TCTGTTTGTA 2580 

ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC CTGAAAAATA 2640 
AAGTGTGGGA AGAGACAAAA AAAAAAAAA 

Seq ID NO: 383 Protein sequence 
Protein Accession ft: NP_002S01 

1 11 21 31 41 51 

I I I I ! I 

MECLYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP 60 

VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSN1TFAVNL IFPRCQKEDA NGNIVYEKNC 120 

RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETPLKDLP I MFD VL I HDP SHFLNYSTIK YKWSFGDNTG LFVSTNHTVN 300 

HTYVLNGTPS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 360 

RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 420 

CEITQNTVCS PVDVDEMCLL TVRRTFNGSG TYCVHLTbGD DTSLALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLLVYKKH KEYNPIENSP GNWRSKGLS VFLKRAKAVF 540 
FPGNQEKDPli LKNQEFKGVS 

Seq ID NO: 384 dna sequence 
Nucleic Acid Accession ft: NMJ) 01134 
Coding sequence: 48-1877 

1 11 21 31 41 51 

1 I I I I I 

TCCATATTGT GCTTCCACCA CTGCCAATAA CAAAATAACT AGCAACCATG AAGTGGGTGG 60 

AATCAATTTT TTTAATTTTC CTACTAAATT TTACTGAATC CAGAACACTG C ATAGA AATG 120 

AATATGGAAT AGCTTCCATA TTGGATTCTT ACCAATGTAC TGCAGAGATA AGTTTAGCTG 180 

ACCTGGCTAC CATATTTTTT GCCCAGTTTG TTCAAGAAGC CACTTACAAG GAAGTAAGCA 240 

AAATGGTGAA AGATGCATTG ACTGCAATTG AGAAACCCAC TGGAGATGAA CAGTCTTCAG 300 

GGTGTTTAGA AAACCAGCTA CCTGCCTTTC TGGAAGAACT TTGCCATGAG AAAGAAATTT 360 

TGGAGAAGTA CGGACATTCA GACTGCTGCA GCCAAAGTGA AGAGGGAAGA CATAACTGTT 420 

TTCTTGCACA CAAAAAGCCC ACTCCAGCAT CGATCCCACT TTTCCAAGTT CCAGAACCTG 480 

TCACAAGCTG TGAAGCATAT GAAGAAGACA GGGAGACATT CATGAACAAA TTCATTTATG 540 

AGATAGCAAG AAGGCATCCC TTCCTGTATG CACCTACAAT TCTTCTTTGG GCTGCTCGCT 600 

ATGACAAAAT AATTCCATCT TGCTGCAAAG CTGAAAATGC AGTTGAATGC TTCCAAACAA 660 

AGGCAGCAAC AGTTACAAAA GAATTAAGAG AAAGCAGCTT GTTAAATCAA CATGCATGTG 720 

CAGTAATGAA AAATTTTGGG ACCCGAACTT TCCAAGCCAT AACTGTTACT AAACTGAGTC 780 

AGAAGTTTAC CAAAGTTAAT TTTACTGAAA TCCAGAAACT AGTCCTGGAT GTGGCCCATG 840 

TACATGAGCA CTGTTGCAGA GGAGATGTGC TGGATTGTCT GCAGGATGGG GAAAAAATCA 900 

TGTCCTACAT ATGTTCTCAA CAAGACACTC TGTCAAACAA AATAACAGAA TGCTGCAAAC 960 

TGACCACGCT GGAACGTGGT CAATGTATAA TTCATGCAGA AAATGATGAA AAACCTGAAG 1020 

GTCTATCTCC AAATCTAAAC AGGTTTTTAG GAGATAGAGA TTTTAACCAA TTTTCTTCAG 1080 

GGGAAAAAAA TATCTTCTTG GCAAGTTTTG TTCATGAATA TTCAAGAAGA CATCCTCAGC 1140 

TTGCTGTCTC AGTAATTCTA AGAGTTGCTA AAGGATACCA GGAGTTATTG GAGAAGTGTT 1200 

TCCAGACTGA AAACCCTCTT GAATGCCAAG ATAAAGGAGA AGAAGAATTA CAGAAATACA 1260 

TCCAGGAGAG CCAAGCATTG GCAAAGCGAA GCTGCGGCCT CTTCCAGAAA CTAGGAGAAT 1320 

ATTACTTACA AAATGCGTTT CTCGTTGCTT ACACAAAGAA AGCCCCCCAG CTGACCTCGT 1380 

CGGAGCTGAT GGCCATCACC AGAAAAATGG CAGCCACAGC AGCCACTTGT TGCCAACTCA 1440 

GTGAGGACAA ACTATTGGCC TGTGGCGAGG GAGCGGCTGA CATTATTATC GGACACTTAT 1500 



328 
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WO 02/086443 

GTATCAGACA TGAAATGACT CCRGTAAACC CTGGTGTR38 
ATGCCAACAG GAGGCCATGC TTCAGCAGCT TGGTGGTGGA 
CATTCTCTGA TGACAAGTTC ATTTTCCATA AGGATCTGTG 
TGCAAACGAT GAAGCAAGA3 TTTCTCATTA ACCTTGTGAA 
AGGAACAACT TGAGGCTGTC ATTGCAGATT TCTCAGGCCT 
GCCAGGAACA GGAAGTCTGC TTTGCTGAAG AGGGACAAAA 
CTGCTTTGGG AGTTTAAATT ACTTCAGGGG AAGAGAAGAC 
TGTGAACTTT TCTCTTTAAT TTTAACTGAT TTAACACTTT 
AAAGACTTTT ATGTGAGATT TCCTTATCAC AGAAATAAAA 



PCT/US02/12476 



Seq ID NO: 38S Protein sequence 
Protein Accession ft: NP 001125 



15 | 



MKWVESIFLI 
KEVSKMVKDA 
RHNCFLAHKK 
WAARYDKIIP 
TKLSQKPTKV 
ECCKLTTLER 
RHPQLAVSVI 
KLGEYYLQKA 
IGHLCIRHEM 
QGVALQTMKQ 
SKTRAALGV 



11 
I 

FLLNFTESRT 
LTAIEKPTGD- 
PTPASIPLFQ 
SCCKAENAVE 
NFTEIQKLVL 
GQCIIHAEND 
LRVAKGYQEL 
FLVAYTKKAP 
TPVKPGVGQC 
EFLINLVKQK 



21 

I 

LHRNEYGIAS 
EQSS6CLENQ 
VPBPVTSCEA 
CPQTKAATVT 
DVAHVBEHCC 
EKPEGLSPNL 
LEKCFQTENP 
QLTSSELMAI 
CTSSYANRRP 
PQITEEQLEA 



31 
I 

ILDSYQCTAE 
LPAFLEELCH 
YEEDRETFMN 
KEbRESSLLS 
RGDVLDCLQD 
NRFLGDRDFN 
LECQDKGEEE 
TRKMAATAAT 
CFSSLWDET 
VIADFSGLIiE 



Seq ID NO: 366 DHA sequence 

Nucleic Acid Accession fti NM_002205.l 

Coding sequence: 1..3149 



ATGGGGAGCC 
CGCCGACCCC 
GGCTTCAACT 
GGATTCTCAG 
CCCAAGGCTA 
TGGGGTGCCA 
CTGGAGTCCT 
TGGTTCGGGG 
AGCTGGCGCA 
GATAACTTCA 
GGACAGGGTT 
TTAGGTGGAC 
ATTGCAGAAT 
CGCCAGGCCA 
TTCAGTGGTG 
GGCTATGTCA 
CAGATGGCCT 
GATGACTTGC 
GAGGTGGGCA 
CTTACCCTCA 
GACCTGGACC 
CAGCAGGGAG 
CAGGTTCTGC 
CGAGGAGGCC 
GTGGACAAGG 
ATCTTCCCCG 
GCCTGCATCA 
GGTTTCACAG 
CTGTTCCTGG 
CGAGAGGATT 
CTCTCGCCGA 
CACGGCCTCA 
ATCTTGCTGG 
GGGGAGCAGA 
CAGAATGTGG 
GCTGAGTACT 
TTTGCCGTGA 
GCCAGTCTGT 
ATCCAGTTTG 
TCCTTTCGGC 
GAGGCAGTGC 
GACCTGGGAC 
AGCCAGGGTG 
GTGACCAGAG 
GAGTTGGATC 
TCTGCTTCCT 
TGTGAGCTCG 
TGGGCCAAGA 
TACAAAGCCC 
CAGGTGGCCA 
ATCATCATCC 
TACAAGCTTG 
CTCAAGCCTC 



11 

I . 

GGACGCCAGA 
CGCTSSTGCC 
TAGACGCGGA 
TGGAGTTTTA 
ATACCAGCCA 
GCCCCACACA 
CACTGTCCAG 
CAACAGTTCG 
CAGAGAAGGA 
CCCGAATTCT 
ACTGCCAAGG 
CAGGAAGCTA 
CTTATTACCC 
GTTCCATCTA 
ATGACACAGA 
CCATCCTTAA 
CXTTACTTTGG 
TGGTGGGGGC 
GGGTCTACGT 
CTGGCCATGA 
AGGATGGCTA 
TAGTGTTTGT 
AGCCCCTGTG 
GAGACCTGGA 
CTGTGGTATA 
CCATGTTCAA 
ACCTTAGCTT 
TGGAACTTCA 
CCTCCAGGCA 
GCAGAGAGAT 
TTCACATCGC 
GGCCAGCCCT 
ACTGTGGAGA 
ACCATGTGTA 
GTGAGGGTGG 
CAGGACTCGT 
ACCAGAGCCG 
GGGGTGGCCT 
ACTTCCAGAT 
TCTCOGTGGA 
TATTCCCAGT 
CTGCTGTCCA 
TGCTGGAACT 
TTAOGGGACT 
CCGAGGGTTC 
CGGGACCTCA 
GGCCCCTGCA 
CTTTCTTGCA 
TGAAGATGCC 
CAGCTGTGCA 
TAGCCATCCT 
GATTCTTCAA 
CAGCCACCTC 



21 

I 

GTCCCCTCTC 
GCTGCTGTTG 
GGCCCCAGCA 
CCGGCCGGGA 
GCCAGGAGTG 
GTGCACCCCC 
CTCAGAGGGA 
AGCCCATGGC 
GCCACTGAGC 
GGAGTATGCA 
AGGCTTCAGT 
TTTCTGGCAA 
CGAGTACCTG 
TGATGACAGC 
AGACTTTGTT 
TGGCTCAGAC 
CTATGCAGTG 
ACCCCTGCTC 
CTACCTGCAG 
TGAGTTTGGC 
CAATGATGTG 
ATTTCCTGGG 
GGCAGCCAGC 
TGGCAATGGA 
CAGGGGCCGC 
CCCAGAGGAG 
CTGCCTCAAT 
GCTGGACTGG 
GGCAACCCTG 
GAAGATCTAC 
TCTCAACTTC 
ACATTATCAG 
AGACAACATC 
CCTGGGTGAC 
CGCCTATGAG 
CAGACACCCA 
CCTGCTGGTG 
TCGGTTTACA 
CCTCAGCAAG 
GGCTCAGGCC 
AAGCGACTGG 
CCATGTCTAT 
CAGCTGTCCC 
CAACTGCACC 
CCTGCACCAC 
GATCCTGAAA 
CCAACAAGAG 
GCGGGAGCAC 
CTACCGAATC 
ATGGACCAAG 
GTTTGGCCTC 
ACGCTCCCTC 
TGATGCCTGA 



31 
I 

CACGCCGTGC 
CTGCTSSTGC 
GTACTCTCGG 
ACAGACGGGG 
CTGCAGGGTG 
ATTGAATTTG 
GAGGAGCCTG 
TCCTCCATCT 
GACCCCGTGG 
CCCTGCCGCT 
GCCGAGTTCA 
GGCCAGATCC 
ATCAACCTGG 
TACCTAGGAT 
GCTGGTGTGC 
ATTCGATCCC 
GCCGCCACAG 
ATGGATCGGA 
CACCCAGCCG 
CGATTTGGCA 
GCCATCGGGG 
GGCCCAGGAG 
CACACCCCAG 
TATCCTGATC 
CCCATCGTGT 
OGGAGCTGCA 
GCTTCTGGAA 
CAGAAGCAGA 
ACCCAGACCC 
CTCAGGAACG 
TCCTTGGACC 
AGCAAGAGCC 
TGTGTGCCTG 
AAGAATGCCC 
GCTGAGCTTC 
GGGAACTTCT 
TGTGACCTGG 
GTCCCTCATC 
AATCTCAACA 
CAGGTCACCC 
CATCCCCGAG 
GAGCTCATCA 
CAGGCTCTGG 
ACCAATCACC 
CAGCAAAAAC 
TGCCCGGAGG 
AGCCAAAGTC 
CAGCCATTTA 
CTGCCTCGGC 
GCAGAAGGCA 
CTGCTCCTAG 
CCATATGGCA 



CCAGTGCTGC 
TGAAACATAT 
CCAAGCTCAG 
GCAAAAGCCA 
GTTGGAGAAA 
ACTGATTTCA 
AAAACGAGTC 
TTGTGAATTA 
TATCTCCAAA 



41 
I 

ISLADLATIF 
EKEILEKYGH 
KFIYEIARRH 
QHACAVMKNF 
GEKIMSYICS 
QFSSGEKNIF 
LQKYIQESQA 
CCQLSEDKLL 
YVPPAFSDDK 
KCCQGQEQEV 



41 
I 

AGCTGCGCTG 
CGCCGCCACC 
GGCCCCCGGG 
TCAGTGTGCT 
GTGCTGTCTA 
ACAGCAAAGG 
TGGAGTACAA 
TGGCATGOGC 
GCACCTGCTA 
CAGATTTCAG 
CCAAGACTGG 
TGTCTGCCAC 
TTCAGGGGCA 
ACTCTGTGGC 
CCAAAGGGAA 
TCTACAACTT 
ACGTCAATGG 
CCCCTGACGG 
GCATAGAGCC 
GCTCCTTGAC 
CTCCCTTTGG 
GGCTGGGCTC 
ACTTCTTTGG 
TGATTGTGGG 
CCGCTAGTGC 
GCTTAGAGGG 
AACACGTTGC 
AGGGAGGGGT 
TGCTCATCCA 
AGTCAGAATT 
CCCAAGCCCC 
GGATAGAGGA 
ACCTGCAGCT 
TGAACCTCAC 
GGGTCACCGC 
CCAGCCTGAG 
GCAACCCCAT 
TCCGGGACAC 
ACTCGCAAAG 
TGAACGGTGT 
ACCAGCCTCA 
ACCAAGGCCC 
AAGGTCAGCA 
CCATTAACCC 
GGGAAGCTCC 
CTGAGTGTTT 
TGCAGTTGCA 
GCCTGCAGTG 
AGCTGCCCCA 
GCTATGGCGT 
GTCTACTCAT 
CCGCCATGGA 



ACTTCTTCAT 
GTCCCTCCTG 
GGTGTAGCGC 
CAAATAACAG 
TGCTGCCAAG 
AAAACTCGTG 
TTTCATTCGG 
ATGAAATGAT 
TG 



51 
I 

FAQFVQEATY 
SDCCSQSEEG 
PFLYAPTILL 
GTRTFQAITV 
QQDTLSNKIT 
LASFVHEYSR 
LAKRSCGLFQ 
ACGEGAADII 
FIFHKDLCQA 
CFAEEGQKLI 



51 
1 

GGGCCCCCGG 
CAGGGTCGGG 
CTCCTTCTTC 
GGTGGGAGCA 
CCTCTGTCCT 
CTCTCGQCTC 
GTCCTTGCAG 
TCCACTGTAC 
CCTCTCCACA 
CTGGGCAGCA 
CCGTGTGGTT 
TCAGGAGCAG 
GCTGCAGACT 
TGTTGGTGAA 
CCTCACTTAC 
CTCAGGGGAA 
GGACGGGCTG 
GCGGCCTCAG 
CACGCCCACC 
CCCCCTGGGG 
TGGGGAGACC 
TAAGCCTTCC 
CTCTGCCCTT 
GTCCTTTGGT 
CTCCCTCACC 
GAACCCTGTG 
TGACTCCATT 
ACGGCGGGCA 
GAATGGGGCT 
TCGAGACAAA 
AGTGGACAGC 
CAAGGCTCAG 
GGAAGTGTTT 
TTTCCATGCC 
CCCTCCAGAG 
CTGTGACTAC 
GAAGGCAGGA 
TAAGAAAACC 
CGACGTGGTT 
CTCCAAGCCT 
GAAGGAGGAG 
CAGCTCCATT 
GCTCCTATAT 
AAAGGGCCTG 
AAGCCGCAGC 
CAGGCTGOGC 
TTTCCGAGTC 
TGAGGCTGTG 
AAAAGAGCGT 
CCCACTGTGG 
CTACATCCTC 
AAAAGCTCAG 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
19B0 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



329 



WO 02/086443 



PCT/US02/12476 



Seq ID HO: 387 Protein sequence 
Protein Accession ti: NP_002196.1 



10 

15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 
I 

MGSRTPESPL 
GFSVEFYRPG 
LESSLSSSEG 
DNFTRILEYA 
IAESYYPEYL 
GYVTILNGSD 
EVGRVYVYLQ 
QQGWFVFPG 
VDKAWYRGR 
GFTVBLQLDW 
LSPIHIALNF 
GEQNHVYLGD 
FAVNQSRLLV 
SFRLSVEAQA 
SQGVLELSCP 
5ASSGPQXLK 
YKALKMPYRI 
YKLGPPKRSL 



11" 
I 

HAVQLRWGPR 
TDGVSVLVGA 
ESPVEYKSLQ 
PCRSDPSWAA 
'INLVQGQLQT 
IRSLYNFSGE 
HPAGIEPTPT 
GPGGLGSKPS 
PIVSASASLT 
QKQKGGVRRA 
SLDPQAPVDS 
KNALNLTFHA 
CDLGNPMKAG 
QVTliNGVSKP 
QALEGQQLLY 
CPEAECFRLR 
LPRQLPQKER 
PYGTAMEKAQ 



21 

I 

RRPPLLPLLL 
PKANTSQPGV 
WFGATVRAHG 
GQGYCQGGFS 
RQASSIYDDS 
QMASYFGYAV 
LTLTGHDEFG 
QVLQPLWAAS 
IFPAMFKPEE 
LFLASRQATI* 
HGLRPALHYQ 
QNVGEGGAYE 
ASLWGGLRPT 
EAVIiFPVSDW 
VTRVTGLNCT 
CELGPLHQQE 
QVATAVQWTK 
LKPPATSDA 



31 
I 

LLLPPPPRVG 
LQGGAVYliCP 
SSILACAPLY 
AEFTKTGRW 
YLGYSVAVGE 
AATDVNGDGL 
RFGSSLTPLG 
HTPDFFGSAL 
RSCSLEGNPV 
TQTLLIQNGA 
SKSRIEDKAQ 
AELRVTAPPE 
VPHLRDTKKT 
HPRDQPQKEE 
TNHPINPKGL 
SQSLQLHFRV 
AEGSYGVPLW 



41 

I 

GFNLDAEAPA 
WGASPTQCTP 
SWRTEKEPLS 
LGGPGSYFWQ 
FSGDDTEDFV 
DDLLVGAPLL 
DLDQDGYNDV 
RGGRDLDGNG 
ACINLSFCLN 
REDCREMKIY 
ILLDCGEDNI 
AEYSGLVRHP 
IQFDFQILSK 
DLGPAVHHVY 



WAKTFLQREH 
IIILAILFGXi 



51 

I 

VLSGPPGSFF 
IEFDSKGSRL 
DPVGTCYLST 
GQILSATQEQ 
AGVPKGNLTY 
MDRTPDGRPQ 
AXGAPFGGET 
YPDLIVGSFG 
ASGKHVADSI 
LRNESEFRDK 
CVPDLQLEVF 
GNFSSLSCDY 
NLNNSQSDW 
ELXNQGPSSI 
QQKREAPSRS 
QPFSLQCEAV 
LLLGlAlYIL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq' ID NO: 388 DNA sequence 

Nucleic Acid Accession #: NMJ>02425 

Coding sequences 26.. 1453 



AAAGAAGGTA 
AGTCTGCTCT 
TGCCCAGCAA 
AAAGGACAGT 
GGTGACAGGG 
TCCTGACGTT 
TACATACAGG 
TGAGAAAGCT 
AGGAGAGGCT 
TGATGGCCCA 
TATTCACTTT 
CGTTGCTGCT 
TTTGATGTAC 
TGATGTGAAT 
GGTGCCCACA 
GTCCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AGGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGGTT 
ACAGTTTGAG 
GTTACATTGC 
ATTATTCATC 
GAAGAAGATG 
ACTTGCTTTT 
ATGTATTTTC 
CTT 



11 
I 

AGGGCAGTGA 
GCCTATCCTC 
TACCTAGAAA 
AATCTCATTG 
AAGCTAGACA 
GGTCACTTCA 
ATTGTGAATT 
CTGAAAGTCT 
GATATAATGA 
GGACACAGTT 
GATGATGATG 
CATGAACTTG 
CCACTCTACA 
GGCATTCAGT 
AAATCTGTTC 
GCCATCAGCA 
TCCCACTGGA 
TCATATTTGG 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAG 
GATGCTGTAT 
TTTGACCCCA 
TAGGCGAGAT 
TAATGTATTA 
AGCCTTGCAG 
GAATTGCACT 
ATAGATGTGT 



21 

! 

GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCTCCTTTCC 
ATACACCAGA 
GGGAAGAGGT 
TCTCTTTCGC 
TGGCTCATGC 
AAAAATGGAC 
GCCACTCCCT 
ACTCATTCAC 
CTCTCTAOGG 
CTTOGGGATC 
CTCTGAGGGG. 
ACCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTTGC 
GCTTCCCTAG 
TACAGGCATT 
ATGCCAGGAT 
AGGGGGAAGA 
TGAGCCAAAA 
ATATCTGCAT 
GAACAGAATT 
TATTACTTCC 



31 
I 

TCTTGCATTC 
AGGAAAAGAG 
CCTCGAAAAG 
CCAAGGAATG 
GGAGGTGATG 
TGGCATGCCG 
TTTGCCAAGA 
GACTCCACTC 
AGTTAAAGAA 
CTACCCACCT 
AGAAGATGCA 
GGGGCTCTTT 
AGAGCTCGCC 
ACCTCCCCCT 
TGAGATGCCA 
AGAATATCTG 
TGAATTTCAT 
TGAAGTTAAC 
AGGAAATGAG 
CATAAGGAAA 
AGCGGACAAA 
ACTAATAGCT 
TGGATTTTTC 
GGTGACACAC 
CAGATATGGG 
TGGTTAATTT 
GTGTCATGAA 
AAGAAATACT 
TCAATAAAAA 



Seq ID NO: 389 Protein sequence 
Protein Accession #: NP_002416 



MHLAFLVLLC 
KIQGMQKFLG 
PDLPRDAVDS 
HAYPPGPGLY 
PTELAQFRLS 
RGEYLFFKDR 
IRGNEVQAGY 
PRXiXADDFPG 



11 
I 

LPVCSAYPLS 
LEVTGKLDTD 
AIEKALKVWE 
GDIHFDDDEK 
QDDVNGIQSL 
YFWRRSHWNP 
PRGIHTXiGFP 
VEPKVDAVLQ 



TLEVMRKPRC 
EVTPLTFSRIi 
WTEDASGTNI* 
YGPPPASTEE 
EPEFHLISAF 
PT IRK ID AAV 
AFGFFYFFSG 



31 
I 

DLAQQYLEKY 
GVPDVGHFSS 
YEGEADIMIS 
FLVAAHELGH 
PLVPTKSVPS 
WPSLPSYLDA 
SDKEKKKTYF 
SSQFEFDPNA 



GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AAGTGGAGGA 
GATGCTGTTQ 
ACATTCTCCA 
CATGGAGACT 
GGACCTGGGC 
TCAGGCACCA 
CACTCAGCCA 
CAGTTCCGCC 
GCCTCTACTG 
GCCAAGTGTG 
TTCTTTAAAG 
TTGATTTCTG 
AGCAGGGACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGAGAT 
GATGACTTTC 
TACTTCTTCA 
ATATTAAAGA 
TGTTTTTAAT 
TTCCTGCATG 
GAATGTTTCT 
CATGTGCAAT 
GTTTTATTTT 



41 

I 

YNLEKDVKQF 
PPGMPKWRKT 
FAVKEHGDFY 
SLGLFHSANT 
GSEMPAKCDP 
AYEVNSRDTV 
FAADKYWRFD 
RMVTHILKSN 



ACAAGGATCT 
AGTTTAGAAG 
TTGGGTTGGA 
GGTGTGGAGT 
AAACCCACCT 
ATTCTGCCAT 
GGCTGTATGA 
TTTACTCTTT 
TTTATGGAGA 
ATTTATTCCT 
ACACTGAAGC 
TTTCGCAAGA 
AGGAACCCCT 
ATCCTGCTTT 
ACAGATATTT 
CATTTTGGCC 
CCGTTTTTAT 
GTTATCCAAG 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTTGA 
GTGGATCATC 
GTAACAGCTG 
AAATCTAATA 
TTCTGTGACT 
GGAATTCTTC 
AGGTGAGAGA 
GGGCCTGTTC 



51 
I 

RRKDSNLIVK 
HLTYRIVNYT 
SFDGPGHSLA 
EALMYPLYNS 
ALSFDAISTL 
FIFKGNEFWA 
ENSQSMEQGF 
SWLHC 



Seq ID NO: 390 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



1 11 21 

I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG 
CCAGCGACTC TAGAAACACA AGAGCAAGAT 
TACTACAACC TGAAGAATGA TGGGAGGCAA 
GTTGAAAAAT TGAAGCAAAT GCAGGAATTC 
GCTGAAACCC TGAAGGTGAT GAAGCAGCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
.1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 



31 41 51 

I I I 

CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 
GTGGACTTAG TCCAGAAATA CCTGGAAAAA 
GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 
TTTGGGCTGA AAGTGACTGG GAAACCAGAT 
AGATGTGGAG TGCCTGATGT GGCTCAGTTT 



60 
120 
180 
240 
300 



330 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC 
TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG 
ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT 
CTTGCTCATG CTTTTCAACC AGGCCCAGOT ATTGGAGGGG 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC 
GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG 
GGAOGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA 
AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG 
TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC 
TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT 
CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC 
CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG 
CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG 
GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA 
GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NO: 391 Protein sequence 
Protein Accession #: NP 002412.1 



PCT/US02/12476 



TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGAACTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



1 

1 

MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
IiAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



Seq ID NO: 392 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 

I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 
1 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
TCATCGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATTCTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 

AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 

I 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACACATC 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 



41 

I 

GTGTGGTGTC 
TCCAGAAATA 
GGAGAAATAG 
AAGTGACTGG 
TGCCTGATGT 
TGACCTACAG 
TTGAGAAAGC 
AGGGTCAAGC 
TTGATGGACC 
ATGCTCATTT 
GTGTTGCGGC 
CTTTGATGTA 
ATGGCATCCA 
CCCCAAAAGC 
TGATGTTCTT 
TCAATTTCAT 
TTGCCGACAG 
AGAATGTGCT 
TGAAGCATAT 
CTAACAAATA 
TGATAGCACA 
GATTTTTCTA 
TTTTGACTCT 



51 
I 

ACACAGCTTC 
CCTGGAAAAA 
TGGCCCAGTG 
GAAACCAGAT 
GGCTCAGTTT 
GATTGAAAAT 
CTTCCAACTC 
AGACATCATG 
TGGAGGAAAT 
TGATGAAGAT 
TCATGCCCTC 
CCCTAGCTAC 
AGCCATATAT 
ATGTGACAGT 
TAAAGACAGA 
TTCTGTTTTC 
AGATGAAGTC 
ACACGGATAC 
CGATGCTGCT 
CTGGAGGTAT 
TGACTTTCCT 
TTTCTTTCAT 
CCAGAAAGCT 



Seq ID NO: 393 Protein sequence 
Protein Accession Jh NP_002412.l 



MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

! 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
ABTLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEX 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHAL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 

I. 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDFP 



Seq ID NO: 394 DNA sequence 

Nucleic Acid Accession #> NM_014331.2 

Coding sequence : 1 . . 1506 



360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



41 


51 




1 

YYNLKNDGRQ 


1 

VEKRRNSGPV 


60 


VLTEGNPRWE 


QTHLTYRIEN 


120 


ISFVRGDHRD 


NSPFDGPGGN 


180 


GHSLGLSHST 


DIGALMYPSY 


240 


KLTFDAITTI 


RGEVMFFKDR 


300 


RFFKGNKYWA 


VQGQNVLHGY 


360 


DEYKRSMDPG 


YPKMIAHDFP 


420 


NSWFNCRKN 







60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 
360 
420 



11 
I 



21 

I 



31 
I 



41 
I 



51 



331 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ATGGTCAGAA AGCCTGTTGT 
AACGOGAGGC TGCCTTCCCT 
AAGAGGAAAG TCACTTTACT 
GGAATCTTCA TCTCTCCTAA 
ACCATCTGGA CGGTGTGTGG 
GGAACAACTA TAAAGAAATC 
TTACCAGCTT TTGTACGAGT 
GTGATATCCC TGGCATTTGG 
CCTGAACTTG CGATCAAGCT 
AGCATGAGTG TCAGCTGGAG 
GCAATTCTGA TAATTATAGT 
TTTAAAGACG OGTTTTCAGG 
TATGGAATGT ATGCATATGC 
AACCCTGAAA AAACCATTCC 
TATGTGCTGA CAAATGTGGC 
AATGCAGTGG CAGTGACCTT 
ATCTTTGTTG CCCTCTCCTG 
TTATTCTATG TTGCGTCTCG 
CGCAAGCACA CTCCTCTACC 
TTCTCTGGAG ACCTOGACAG 
GGGCTGGCAG TTGCTGGGCT 
TTCAAGGTGC CACTGTTCAT 
CTTTCCCTCT ATTCGGACCC 
GTCCCTGOGT ATTATCTCTT 
TCAGAGAAAA TAACCAGAAC 
TTATGAACTA ATGGACTTGA 
TTTTTACTTC ATTTTCTGAA 
CAGTTATTTT TATTCATATA 
AACTCTATGT AGTTATAGAA 
GTCTCTGATA CCTACCTATT 
TCTCTACAAC ATATGTTAGC 
TATATATGGG TTTTGTAAAG 
TTTTCAATTC. TGAAAAAAAG 
ATTTTACATT GACATTGCAT 
GCTTTAATGG ATTATACCCA 
TTAAAGAAGA GTTTCTAGGG 
AAAAATCCTT GAGAATTTAT 
TTATCTGTCA TTTTTTTTTT 
AGCAAGAGTT AGTTTGGTAT 
TACCCCTGAT GAGTCTATCT 
TGAGAGAAAT AACCAACAAA 
CTACATGCAA TGTTAGTAAT 
TGAATTTTGA CAGTTTGTGC 
CTTCAGATGA AACTGTCCAG 
AAGAAATGTC GCTGTAAATA 
ATCTAGGCTT TGTCAGTAAT 
CTGATAAGAA GAAAATTGAA 
GTTTTGCCAG TATTAGAAAA 
GCACTTTGGG AGGCTGAGGG 
CAACATGGAG AAACCCCATC 
GCTGGTAATC TCAGCTATTG 
GAGGTTGCAG TGAGCCAAGA 
CCATCTCCAA AAAAAAAAAA 



PCT/US02/12476 



GTCCACCATC 
GGGCAACAAO 
GAGGGGAGTC 
GGGCGTGCTC 
GGTCCTGTCA 
TGGAGGTCAT 
CTGGGTGGAA 
ACGCTACATT 
CATTACAGCT 
CGCCCGGATC 
CCCTGGAGTT 
AAGAGATTCA 
TGGCTGGTTT 
CCTTGCAATA 
CTACTTTACG 
TTCTGAGCGQ 
CTTTGGCTCC 
AGAGGGTCAC 
AGCTGTTATT 
TCTTTTGAAT 
GATTTATCTT 
CCCAGCTTTG 
ATTTAGTACA 
TATTATATGG 
ATTACAAATA 
GATCTTGGCA 
AGTCTAGAGA 
TTTTAGCATA 
AGTGAATATG 
GGGGTTAGGA 
ACGGCAAAGA 
ATGGTTTtAC 
CATACATCAT 
TGCTTCCCCT 
GAGCACTTTG 
GCTACTGTTT 
TATGTCAGAT 
TCACATCAGT 
TAAATCCTCA 
AAACATATGC 
GAAGATGTTC 
TCTGAAGTTT 
ATTTTCTTTA 
ATTAATTAGG 
AGATTTACAA 
TTCCACACCT 
ATGAGAATCT 
TACTGTGAGC 
GGTGGATCAC 
TCTACTAAAA 
AGGAGGCTGA 
TTGCACCACT 



TCCAAAGGAG 
GAGCCACCTG 
TCCATTATCA 
CAGAACACGG 
CTATTTGGAG 
TACACATATA 
CTCCTCATAA 
CTGGAACCAT 
GTGGGCATAA 
CAGATTTTCT 
ATGCAGCTAA 
AGTATTAOGC 
TACCTCAACT 
TGTATATCCA 
ACCATTAATG 
CTACTGGGAA 
ATGAACGGTG 
CTTCCAGAAA 
GTTTTGCACC 
TTCCTCAGTT 
CGATACAAAT 
TTTTCCTTCA 
GGGATTGGCT 
GACAAGAAAC 
ATACTGGAAG 
ATCTGCCCAA 
ATTACAACTT 
TTOGAACTAA 
CAGTTATTCT 
GAAAAGACTA 
ACCTTCAAAT 
ACACTACAGA 
GATTATGGCA 
TAGATACCAA 
AACAAAGGTC 
ATGAGACACA 
GTTTTTTCAT 
TTGATCAGGA 
TTAGAACAAC 
AT TTTAAGCC 
AAAATAATAG 
TTTAAATTTA 
TACATTTTAT 
AAAAGGCATA 
CTGATGTTTC 
TAATTATCAT 
GTGGATAAGT 
CGGGCATGGT 
CTGAGGTCGG 
ATACAAAATT 
GGCAGGAGAA 
GTACTCCAGC 



Seq ID NO: 395 Protein sequence 
Protein Accession g: NP 055146.1 



1 

I 

MVRKPWSTI 
GIFISPKGVL 
LPAFVRVWVE 
SMSVSWSARI 
YGMYAYAGWP 
NAVAVTPSER 
RKHTPLPAVI 
FKVPLFIPAI, 
SEKITRTLQI 



11 

I 

SKGGYLQGNV 
QNTGSVGMSL 
LLIIRPAATA 
QIPLTFCKLT 
YLNFVTEEVE 
LLGNFSLAVP 
VLHPLTMIML 
FSFTCLFMVA 
ILEWPEEDK 



21 
I 

NGRLPSLGNK 
TIWTVCGVLS 
VISIiAPGHYI 
AILIIIVPGV 
NPEKTIPLAI 
IFVALSCFGS 
FSGDLDSLLN 
LSLYSDPFST 
Xi 



31 
I 

EPPGQEKVQL 
LFGALSYAEL 
LEPFFIQCEI 
MQLIKGQTQN 
CISMAITIGV 
MNGGVFAVSR 
FLSFARWLFI 
GIGFVITLTG 



GTTACCTGCA 
GGCAGGAGAA 
TTGGCACCAT 
GCAGCGTGGG 
CTTTGTCTTA 
TTTTGGAAGT 
TACGCCCTGC 
TTTTTATTCA 
CTGTAGTGAT 
TAACCTTTTG 
TTAAAGGTCA 
GGTTGCCACT 
TTGTTACTGA 
TGGCCATTGT 
CTGAGGAGCT 
ATTTCTCATT 
GTGTGTTTGC 
TCCTCTCCAT 
CTTTGACAAT 
TTGCCAGGTG 
GCCCAGATAT 
CATGCCTCTT 
TCGTCATCAC 
CCAGGTGGTT 
TTGTACCAGA 
GGGGAGACAC 
TGGTGATAAA 
TTTCTAAGAA 
ATGAGTCGCA 
GACAATTACT 
TGAAGACTGA 
TGTCTATACT 
AAGAGGAGAG 
TTTAGATAAC 
AGTGGGGATT 
TCCAGGAGTT 
TCATTATCAG 
AAGTGTATAA 
CACCTGTTTC 
TTCAAATTAC 
TCCCATATCT 
TGGCTATTTT 
ATTCTTCTGT 
TATTAACATA 
TAGAAAATTT 
TCAACTTGCA 
GTTTGTGTTC 
GGCTTACATC 
GAGTTCTAGA 
AGCTGGGCAT 
TTGCTTGAAC 
CTGGGTGACA 



41 

I 

KRKVTLLRGV 
GTTIKKSGGH 
PELAIKLITA 
PKDAFSGRDS 
YVLTNVAYFT 
LFYVASREGH 
GLAVAGLIYL 
VPAYYLFIIW 



GGGAAATGTT 
AGTGCAGCTG 
CATTGGAGCA 
CATGTCTCTG 
TGCTGAATTG 
CTTTGGTCCA 
AGCTACTGCT 
ATGTGAAATC 
GGTCCTAAAT 
CAAGCTCACA 
AACGCAGAAC 
GGCTTTTTAT 
AGAAGTAGAA 
CACCATTGGC 
GCTGCTTTCA 
AGCAGTTCCG 
TGTCTCCAGG 
GATTCATGTC 
GATAATGCTC 
GCTTTTTATT 
GCATCGTCCT 
CATGGTTGCC 
TCTGACTGGA 
TAGAATAATG 
AGAAGATAAG 
AAAATAGGGA 
CAAAAGGAGT 
ATTTAGTTAT 
CAATTCTTGA 
ATGTGGTCAT 
GATTTTTCTG 
GTGAAAAGTG 
AAAGAAATTT 
AAACACTCAT 
GTTGAATACA 
ATGTTTAAGT 
GAAGTTTTAG 
CACATCTTAG 
ACTAATAACT 
ATTATCAACA 
GTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGCCTGAC 
GGTGGCACAT 
CCGGGAGGCG 
AAGTCAGACT 



51 

I 

SIUGTIIGA 
YTYILEVFGP 
VGITWMVLN 
SITRLPLAFY 
TINAEELLLS 
LPEILSMIHV 
RYKCPDMHRP 
DKKPRWFRIM 



Seq ID NO: 396 DNA sequence 
Nucleic Acid Accession #: NMJ)06528 
Coding sequence: 57.. 764 



GCCGCCAGCG 
ACCCCGCTCG 
GCGATGCTGC 
ACGGACCCTG 
GCCAGTTCCT 
GCGACGATGC 
TGGACGACCA 
GTGAAAAATT 
AAGCTACTTG 
AAGATGAGGG 
CCTGTGATGC 



11 
I 

GCTTTCTCGG 
CCCCCTGGGG 
TCAGGAGCCA 
CCGGGCCCTA 
GTACGGGGGC 
TTGCTGGAGG 
GTGTGAGGGG 
CTTTTCCGGT 
TATGGGCTTC 
ACTGTGCTCT 
TTTCACCTAT 



21 
I 

ACGCCTTGCC 
CTGTCGATTC 
ACAGGAAATA 
CTTCTCCGTT 
TGCGAGGGCA 
ATAGAAAAAG 
TCCACAGAAA 
GGGTGTCACC 
TGCGCACCAA 
GCCAATGTGA 
ACTGGCTGTG 



31 

I 

CAGCGGGCCG 
TGCTGCTTTT 
ACGCGGAGAT 
ACTACTACGA 
ACGCCAACAA 
TTCCCAAAGT 
AGTATTTCTT 
GGAACCGGAT 
AGAAAATTCC 
CTCGCTATTA 
GAGGGAATGA 



41 
I 

OCCGACCCCC 
CCTGACGGAG 
CTGTCTCCTG 
CAGGTACACG 
TTTCTACACC 
TTGCCGGCTG 
TAATCTAAGT 
TGAGAACAGG 
ATCATTTTGC 
TTTTAATCCA 
CAATAACTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



60 
120 
180 
240 
300 
360 
420 
480 



51 

I 

TGCACCATGG 
GCTGCACTGG 
CCCCTAGACT 
CAGAGCTGCC 
TGGGAGGCTT 
CAAGTGAGTG 
TCCATGACAT 
TTTCCAGATG 
TACAGTCCAA 
AGATACAGAA 
GTTAGCAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



332 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AGGATTGCAA ACGTGCATGT 
GCTTTGCCAG TAGAATCCGG 
ATCTTGTTTG TCTTTATGGC 
GCATGAGGAA ACAAATCATT 
TTCAAAAATT TGGATTTTTT 
TTTAATTTAT GGTTCAACTG 
AAATATGACT CACTCATTTC 
AAACAACATA AGACAATATA 
CC 



Seq ID NO: 397 Protein sequence 
Protein Accession #: NP_006519 



GCAAAAGCTT 
AAAATTCGGA 
TTATTTGCCT 
GGTGATTTAT 
TATATATAAC 
TTTGTGAGAC 
TTGGGGTCGT 
ATCATGTGCT 



TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACATAT 



AAAGAAGATG 
TTAAACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



CCAAAGCTTC 
TTAATATGTC 
TAATATGACA 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



31 



41 



51 



1 11 21 

I I I I I I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 
CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCBGST EKYFFNLSSM 
TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDBGLCSAN VTRYYFNPRY 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO: 398 DNA sequence 

Nucleic Acid Accession ft: NM_001508.1 

Coding sequence: 1..1361 



1 
I 

ATGGCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 

I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
CGCTGCAGCA 
GCGCCCGCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 

i 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGTC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACGCTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 
I 

TGCTCCCAAA 
ATCACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACOGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAGTTT 



41 

I 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



Seq ID NO i 399 Protein sequence 
Protein Accession #: NP_001499.l 



1 
I 

MASPSLPGSD 
KGYIjQKEVTD 
ATLLHVLTLS 
VNVPSHRGLT 
MCWNMMQVLM 
IRRIMAAAKP 
CRLSLQHANH 



11 
I 

CSQIIDHSHV 
HMVSLACSDI 
FERYIAICHP 
CNRSSTRHHB 
KSQKGSLAGG 
KHDWTRSYFR 
EKRLRVHAKS 
LEPNSGAKPA 



21 
I 

PEFEVATWIK 
LVPLIGMPME 
FRYKAVSGPC 
QPETSNMSIC 
TRPPQLRKSE 
AYMILLPFSE 
TTDSARFVQR 
NSAAENGFQE 



31 

1 

ITLILVYLII 
PYSIIWNPLT 
QVKLLIGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TPFYLSSVIN 
PLLFASRRQS 
HEV 



41 
I 

FVMGLLGNSV 
TSSYTLSCKL 
VTSALVALPL 
QSSIFGAFW 
TIIFIiRLIW 
PLLYTVSSQQ 
SARRTEKIFL 



Seq ID NO: 400 DNA sequence 

Nucleic Acid Accession ft: NM_006475.1 

Coding sequence: 28.. 2538 



1 
I 

AACAGAACTG 
TTGCTGCTTA 
AGTCGTATCA 
ACCAAAAAGA 
AAAACGACTG 
TGCCCAGCAG 
ACAACGCAGC 
TTCACTTACT 
GGTTTGGAGA 
AAGAGAATGT 
TTGGGGCTTT 
ATCCATGGGA 
CAAATTGGTA 
GCAGCTGCCA 
TTTGCTCCCA 
GGAGACAAAG 
TCTGAGTCTA 



11 

I 

CAACGGAGAG 
TTGTTAACCC 
GGGGTCGGGA 
AATACTTCAG 
TTTTATATGA 
TTTTGCCCAT 
GCTATTCTGA 
TTGCACCGAG 
GCAACGTGAA 
TGACCAAGGA 
TCATTAACCA 
ACCAGATTGC 
CCTCAATTCA 
TCACATCGGA 
CCAATGAGGC 
TGGCTTCCGA 
TTATGGGAGG 



21 
I 

ACTCAAGATG 
TATAAACGCC 
CCAAGGCCCA 
CACTTGTAAG 
ATGTTGCCCT 
TGACCATGTT 
CGCCTCAAAA 
TAATGAGGCT 
TGTTGAATTA 
CTTAAAAAAT 
TTATCCTAAT 
AACAAATGGT 
AGACTTCATT 
CATATTGGAG 
TTTTGAGAAA 
AGCTCTTATG 
AGCAGTCTTT 



31 

i 

ATTCCCTTTT 
AACAATCATT 
AATGTCTGTG 
AACTGGTATA 
GGTTATATGA 
TATGGCACTC 
CTGAGGGAGG 
TGGGACAACT 
CTGAATGCTT 
GGCATGATTA 

GTTGTCCATG 
GAAGCAGAAG 
GCCCTTGGAA 
CTTCCACGAG 
AAGTACCACA 
GAGACGCTGG 



41 

r 

TACCCATGTT 
ATGACAAGAT 
CCCTTCAACA 
AAAAGTCCAT 
GAATGGAAGG 
TGGGCATCGT 
AGATCGAGGG 
TGGATTCTGA 
TACATAGTCA 
TTCCTTCAAT 
CTGTTAATTG 
TCATTGACCG 
ATGACCTTTC 
GAGACGGTCA 
GTGTCCTAGA 
TCTTAAATAC 
AAGGAAATAC 



720 
780 
840 
900 
960 
1020 
1080 
1140 



PCMJS02/12476 



60 
120 
180 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



51 
I 

TIRVTQVLQK 
HTFLFEACSY 
LFAMGTEYPL 
YLWLLSVAF 
TIAVCWMPNQ 
FRRVFVOVLC 
STFQSEAEPQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 



51 
I 

TTCTCTACTA 
CTTGGCTCAT 
GATTTTGGGC 
CTGTGGACAG 
AATGAAAGGC 
GGGAGCCACC 
AAAGGGATCC 
TATCCGTAGA 
CATGATTAAT 
GTATAACAAT 
TGCTCGAATC 
TGTGCTTACA 
ATCTTTTAGA 
CTTCACACTC 
AAGGTTCATG 
TCTCCAGTGT 
AATTGAGATA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



333 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
%0 
85 



WO 02/086443 

GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC 
GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC 
TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA 
TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 



PCTYUS02/12476 



Seq ID NO: 401 Protein sequence 
Protein Accession #: NP_006466.l 



MIPFLPMFSL 
KNWYKKSICG 
KLREEIEGKG 
NGMIIPSMYN 
IEAEDDLSSP 
MKYHILNTLQ 
IDQVLIPDSA 
VQRLLKLILQ 
RNGAIHIFRE 
FKGMTSEEKE 
TLLVNELKSK 
PKEIPVTVYT 
TEVIHGEPII 
QEEVTKVTKF 



11 
I 

LLLLIVNPIN 
QKTTVLYECC 
SPTYFAPSNE 
NLGLFINHYP 
RAAAITSDIL 
CSESIMGGAV 
KQVIEIiAGKQ 
NHILKVKVGL 
IIKPABKSLH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKIIDGV 
IBGGDGHLFE 



21 
1 

ANNHYDKILA 
PGYMRMEGMK 
AWDNLDSDIR 
NGWTVNCAR 
EALGRDGHFT 
FETLEGNTIE 
QTTFTDLVAQ 
NELYNGQILE 



NIILYHLTPG 
IHWDKLLYP 
KIKVIEGSLQ 
PVEITEKETR 
DEEIKRLLQG 



31 
[ 

HSRIRGRDQG 
GCPAVLPIDH 
RGLESNVNVE 
I IHGNQIATN 
LFAPTNEAFE 
IGCDGDSITV 
LGLASALRPD 
TIGGKQLRVF 
TFLSLLEAAD 
VPIGKGFBPG 
ADTPVGNDQL 
PIIKTEGPTL 
EERIITGPEI 
DTPVRKLQAN 



41 

I 

PNVCALQQIL 
VYGTLGIVGA 
LLNALHSHMI 
GWHVIDRVL 
KXiPRGVLERF 
NGXKMVKKXD 
GEYTLLAPVN 
VYRTAVCIEN 
LKELLTQPGD 
VTNILKTTQG 
LEILNKLIKY 
TKVKIEGEPE 
KYTRISTGGG 
KKVQGSRRRL 



AAAGGATATT 
TTCTGCCAAA 
GGCCCAATTA 
TGTGAATAAT 
TCTGCAGAAT 
ACTGGAAACC 
TGAAAATTCA 
CCGCGAGATC 
CTTTAGCACC 
TGGAGACTGG 
AAAAGAAATT 
ACCAGGAGTT 
ACAAGGAAGC 
ATCAAAAGAA 
CTATCCAGCA 
CAAATACATC 
CTATACAACT 
TCTTCAGCCT 
ACCTGAATTC 
AATTATTAAA 
GACACGAGAA 
AGGTGGAGAA 
CAAATTCATT 
TCAGGGAGAC 
ACGATTAAGG 
ACCCTAAGTC 
AAACATCAGC 
TTCTGAATGA 
GAAAATATAA 
AATCCATTAG 
AACTGTTATC 
CGCAAGCCAT 
ACTTTTTATA 
TTATTTTTTA 
TATTTTTTAA 
AAATTGAGTA 



51 
I 

GTKKKYFSTC 
TTTQRYSDAS 
NKRMLTKDLK 
TQIGTSIQDP 
MGDKVASEAL 
IVTNNGVIHL 
NAFSDOTLSM 



WTLFVPTNDA 
SKIPLKEVKD 
IQIKFVRGST 
FRItlKEGETI 
ETEETLKKLL 



Seq ID MO: 402 DNA sequence 

Nucleic Acid Accession 8: NM_002416 

Coding sequence: 40.. 417 



1 
I 

ATCCAATACA 
TTCCTCTTGG 
AAGGGTCGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGTGGGAGA 
AAGAAAGTTC 
ACCACTTCAC 
TTCCAAAGGA 
ATTACTCTGA 
TTGTTAAAGG 
TTAAGGCCAT 
CTCACAACAG 
TATCTGAGGC 
TGAAATTGAG 
CTACAGGCCT 
ACTGGAGATC 
TTCCATCTTG 
AAGTCAGCTC 
CATATGCTCT 
AGTGCTTTCT 
AAATAAACCT 
CACATGGGTG 



11 
I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TGAAAGTTCG 
CAATAAGTAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTAGCA 
CTGCCTGGAA 
ACATGTCAGC 
CTGGACCTCA 
CACACACAAT 
ACCAGTGTGT 
CCCGCTCAGG 
TTCTCCATCC 
GATTTATCTG 
TCTCCCAATT 
TTTTGGACAC 
AACACTCAAT 



21 
I 

GGAACTCCAT 
GCTGGTTCTG 
CAGCACCAAC 
AAGCCCTTCC 
TCTAAACCCA 
CCAAAAGAAA 
AAAATCTCAA 
TCTGTGTTAA 
TAATACAAAG 
AAAGTTAGAA 
CTTTGTTCTT 
ATACCCATGT 
GAGCAGCCCT 
AAGTCCTAAG 
CCAAGCTGCT 
GTGTCTGAGA 
GGCTTTCAGA 
CTGACCACTT 
TACCACAATG 
AGTCAACTCC 
CATCCTCACT 
ACAAATTATC 
GGTTAACTAA 



31 
I 

TCTATCACTA 
ATTGGAGTGC 
CAAGGGACTA 
TGCGAGAAAA 
GATTCAGCAG 
AAGCAAAAGA 
CGTTCTCGTC 
AAATGTTCTA 
GCTTATTAAT 
AGTTGATTTT 
CTACCACCCA 
CTACACAGAT 
AGGCTTCCAC 
CCTGTTAGCA 
GTGGCCATCA 
GATTCATGCT 
GCCTCCTTTC 
TATTTCTTTT 
CAGTGCCTTT 
TTTCTCATCT 
CAGTCCAGCT 
TTAAAACTCC 
TTCTTGGGTG 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760. 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 




334 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AGATTGTCAG CTCCTTGAGG GCAAGAGCCA 
CTAATAATAC TGTGGAACTA GGTTTTAATA 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT 
CTAGCCTCTG GTAACCTCTT ACTTATTATC 
GATGCAACAT CCTTGTCTTT TTATGACAGG 
GCACGTGGTA AAACACTTGC GGATATTCTG 
AAAATCATAT AATCTTACAA TGAAAAGGAC 
CCAACCATAC AAAAATTCCT TTTCCCGAAG 
TCTAAGATCT AACAAGATAG CCACCGAGAT 
AGTTTTATTG TCCGTTTACT TGTTTCAGAG 
TCTCCCATGA AGAAAGGGAA CGGTGAAGTA 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT 
CTTTCCCAAA TTGAATCACT GCTCACACTG 
TCCCACCCGA AOGTCTTATC TAATCATGAA 
AAAAATCTAA GTGTTTCATA AATTTGAGAG 
GTAGACAGTA TATAACTAAC AACCAAAGAC 
TCATTTATCA TATATATACA TACATGCATA 
AAACAGTATT GACTTGTATA CCTTGTAATT 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 403 Protein sequence 
Protein Accession NP_002407 

21 



PCT/US02/12476 



CAGTATATTT 
ATTTTTTAAT 
GCTGGCTCTT 
TTCAGGACAC 
ATGTTTGCTC 
GACTGTTTTT 
TTTATAGATC 
GAAAAGGGCT 
CCTTATCGAA 
TTTGTATTGT 
CTAAGCGCTA 
CTCTGCAGGA 
GTCTGTGGCC 
CTGATGATTT 
ACTCCCTAGT 
TCTGTGACCC 
TACATATTGT 
CACTCTCAAA 
TGAAATATTT 



CCCTGTTTCT 
TGATGTTGTT 
TCCTGGCTAC 
TCACTACAGG 
AGCTTCTCCA 
AAAAAATATA 
AGCCAGTGAC 
TTCTCAATAA 
ACTCATTTTA 
GATTATCAAT 
GAGGAAGCAG 
TGTGGAAACC 
AGAATTTAAA 
AGAGTGCTGT 
TCCTTCATGT 
ACTTACCTTG 
CACTGACACA 
GCAAATAATT 
TCTTTGTTAA 



TCCACAGTGC 
ATGGGCAGGA 
TCCATGTTGG 
GACCAGGGAT 
ACAATAAGAA 
CAGTTTACCG 
CAACCTTTTC 
GCCTCAGCTT 
GGCAAATATG 
TACCACACCA 
CCAAGTCGGT 
TCCTTCCAGG 
CCTATACTCA 
CCGGTGGAGA 
AACTTCCCTG 
CATCTCACAG 
CACGTTATAA 
TTTCACTTCA 
AATAGAATGG 



51 



11 21 31 41 

I I I I I I 

MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QPAPSPSCEK 
IEIIATLKNG VQTCLNPDSA . DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 
QKKTT 

Seq ID NO: 404 DNA sequence 

Nucleic Acid Accession ft: nm_006670 

Coding sequence: 85.. 1347 



CCACTGGCCG 
AGTCCCCTTG 
CGGAGCTTOG 
CGCCGCTTGG 
CTGCCCAGCC 
TCCTTCOGCA 
CTTCACAATG 
AACAATCCCT 
GAGGTAGTGC 
GTCCTCTTGG 
CAAACCTCTT 
GTTTTGTATT 
AGGGATCACA 
AACCTCAGTT 
CATGAGATGT 
TAGATACAAC 
TTTCTCGGTG 
TTCTTTTTCT 
TGGGCTTCTT 
ACAGATAGCA 
TATCAGTTTT 
CTGCAGACGT 
AGAGCATGCT 
TTCTTTGACA 
TTTTAATAAA 
ATTCTTAAAA 



11 
I 

CCCTCCGGGC 
AAACGCGAGC 
TGCGGCTGGC 
CCTCGGCATC 
CCCCGCTGCC 
AGTGCGTTAA 
TCTTCCTTAC 
CGCTGGCGGA 
GCGCCTTCGA 
ACCTCAGTCC 
TGGAACTGAT 
AGGGCATGGT 
AGCTGGCCAG 
TCAGGCACCT 
ACCTGACACA 
GCACCCTGGC 
GGGTCTGCGA 
AGGGCAAAGA 
AACTCAACAG 
ATGTCTTCCT 
TGAACCGCAA 
TGGAAGGGTA 
CTAACTCGGA- 
AGACTTAAGC 
GGACTTTGAC 
TGTTCTGTTA 
TGGAACTCCT 
GCTGTCTGTC 
TTCAACAAAA 
ATTCTCATGT 
TAGCAGGCTC 
TACATTTTAC 
AAGTAAATTA 
CTGCATCGAG 
GAA 



21 
I 

CCAGCCTCCC 
CGCGATGCCT 
GCGACTAGCG 
CTCCTTCTCC 
GGACCAGTGC 
CCGCAATCTG 
CGGCAACCAG 
GCTGGCCGCG 
GCATCTGCCC 
CTTCGCTTTC 
CCTGAACCAC 
GGTGGCGGCC 
CAACCACTTC 
GGACTTAAGT 
TCTAGAAAGC 
TGAGTTGCAA 
CTGCCACATG 
CCGGCTCACC 
TGCTGACCTG 
GGGTATTGTT 
GGGGATAAAA 
TCATTACAGA 
TGTCTGAGAA 
TTTATCCCTA 
TAAAAGCAGT 
ATGTAAGACG 
CAACACGTAT 
TCTCTCTCAG 
GCTGCCTCAA 
ACCTAAGTTG 
TTCAAAATAA 
TGTTCTGCAT 
CTTTTTTGAT 
ATCCAACCGA 



Seq ID NO: 405 Protein sequence 
Protein Accession #: NP 006661 



1 

I 

MPGGCSRGPA 
QCPALCECSE 
AALNLSGSRL 
NHIVPPEDER 
LSNNSLVSLT 
HMADMVTWLK 
IVLALIGAIF 



11 
I 

AGDGRIiRIAR 
AARTVKCVNR 
DEVRAGAFEH 
QNRSFEGMW 
YVSFRNLTHL 
ETEWQGKDR 
LLVLYLNRKG 



21 

I 

LALVLLGWVS 
NLTEVPTDLP 
LPSLRQLDLS 
AALLAGRALQ 
BSiiHLEDNAL 
LTCAYPEKMR 
IKKWMHNIRD 



31 

I 

GAGCCTTCGG 
GGGGGGTGCT 
CTGGTACTCC 
TCCTCGGCGC 
CCCGCGCTGT 
ACCGAGGTGC 
CTGGCCGTGC 
CTCAACCTCA 
AGCCTGCGCC 
TCGGGCAGCA 
ATCGTGCCCC 
CTGCTGGCGG 
CTTTACCTGC 
AATAATTCGC 
CTCCACCTGG 
GGTCTACCCC 
GCAGACATGG 
TGTGCATATC 
GACTGTGACC 
TTAGCCCTGA 
AAGTGGATGC 
TATGAAATCA 
ATATTAGAGG 
CTAGGCTTGC 
GAAGGGGATT 
ATGAACAGTT 
GGAGGGATTT 
TACAGTTCAA 
CTTTTTCGAG 
TGGAGAAAAT 
CTCCATGGTG 
ATTACAAAAA 
TGCAGTTTAT 
CTGAATTGTT 



31 

I 

SSSPTSSASS 
AYVRNLFLTG 
HNPLADLSPF 
GLRRLELASN 
KVLHNGTLAE 
NRVLLELNSA 
ACRDHMEGYH 



41 
I 

AGCGGGOGCC 
CCCGGGGCCC 
TGGGCTGGGT 
OGTTCCTGGC 
GCGAGTGCTC 
CCACGGACCT 
TCCCTGCCGG 
GCGGCAGCCG 
AGCTCGACCT 
ATGCCAGCGT 
CTGAAGATGA 
GCCGTGCACT 
CGCGGGATGT 
TGGTGAGCCT 
AGGACAATGC 
ACATTAGGGT 
TGACCTGGCT 
CGGAAAAAAT 
CGATTCTTCC 
TAGGCGCTAT 
ATAACATCAG 
ATGCGGACCC 
ACAGACCAAG 
TCCACTTTCA 
TGCTTCCTTG 
GTGTATAGTG 
TTCAGGTTTC 
GGTGTAGCAA 
AAAAATACTT 
AATTGCATCC 
CACAGGAGCA 
ATAACTTGCA 
ATGAAAATGT 
AAAAAAAAAA 



51 

I 

GTCCCAGCCC 
CGCCGCCGGG 
CTCCTCGTCT 
TTCCGCCGTG 
CGAGGCAGCG 
GCCCGCCTAC 
CGCCTTCGCC 
CCTGGACGAG 
CAGCCACAAC 
CTCGGCCCCC 
GCGGCAGAAC 
GCAGGGGCTC 
GCTGGCCCAA 
GACCTACGTG 
CCTCAAGGTC 
TTTCCTGGAC 
CAAGGAAACA 
GAGGAATCGG 
CCCATCCCTG 
TTTCCTCCTG 
AGATGCCTGC 
CAGATTAACA 
GACAACTCTG 
TCCTCCACTA 
TTATGTAAAG 
TTTTACCCTC 
AGCATGAACA 
GTGTACCCAC 
TATTCATAAA 
TATAAACTGC 
CCTGCATCCA 
ACTTCATAAC 
ACTGATTTTT 
AAAAATAAAG 



41 
I 

PSSSAPFLAS 
NQLAVLPAGA 
APSGSNASVS 
HFLYLPRDVL 
LQGLPHIRVP 
DLDCDPIIiPP 
YRYEINADPR 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



51 

1 

AVSAOPPIiPD 
FARRPPLAEL 
APSPLVELIL 
AQLPSLRHLD 
LDNNPWVCDC 
SliQTSYVFLG 
LTNLSSNSDV 



60 
120 
160 
240 
300 
360 



Seq ID NO: 406 DNA sequence 

Nucleic Acid Accession fi: Eos sequence 



335 



WO 02/086443 

Coding sequences 1..927 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



AT6CCTGGGG 
CTAGCGCTGG 
TTCTCCTCCT 
CAGTGCCCCG 
AATCTGACCG 
AACCAGCTGG 
AGCCTCAGGC 
CGCAACCTGA 
AATGGCACCC 
CCCTGGGTCT 
GTGCAGGGCA 
TTGGAACTCA 
TCTTATGTCT 
TATTTGAACC 
CACATGGAAG 
AGTTCTAACT 



11 
i 

GGTGCTCCCG 
TACTCCTGGG 
CGGCGCCGTT 
CQCTGTGCGA 
AGGTGCCCAC 
CCAGCAACCA 
ACCTGGACTT 
CACATCTAGA 
TGGCTGAGTT 
GOGACTGCCA 
AAGACCGGCT 
ACAGTGCTGA 
TCCTGGGTAT 
GCAAGGGGAT 
GQTATCATTA 
CGGATGTCCT 



21 
I 

GGGCCCCGCC 
CTGGGTCTCC 
CCTGGCTTCC 
GTGCTCCGAO 
GGACCTGCCC 
CTTCCTTTAC 
AAGTAATAAT 
AAGCCTCCAC 
GCAAGGTCTA 
CATGGCAGAC 
CACCTGTGCA 
CCTGGACTGT 
TGTTTTAGCC 
AAAAAAGTGG 
CAGATATGAA 
CGAGTGA 



31 
I 

GCCGGGGACG 
TCGTCTTCTC 
GCCGTGTCCG 
GCAGCGCGCA 
GCCTACGTGC 
CTGCCGCGGG 
TCGCTGGTGA 
CTGGAGGACA 
CCCCACATTA 
ATGGTGACCT 
TATCCGGAAA 
GACCCGATTC 
CTGATAGGCG 
ATGCATAACA 
ATCAATGCGG 



41 
I 

GGCGTCTGCG 
CCACCTCCTC 
CCCAGCCCCC 
CAGTCAAGTG 
GCAACCTCTT 
ATGTGCTGGC 
GCCTGACCTA 
ATGCCCTCAA 
GGGTTTTCCT 
GGCTCAAGGA 
AAATGAGGAA 
TTCCCCCATC 
CTATTTTCCT 
TCAGAGATGC 
ACCCCAGATT 



51 

! 

GCTGGCGCGA 
GGCATCCTCC 
GCTGCCGGAC 
CGTTAACCGC 
CCTTACCGGC 
CCAACTGCCC 
CGTGTCCTTC 
GGTCCTTCAC 
GGACAACAAT 
AACAGAGGTA 
TCGGGTCCTC 
CCTGCAAACC 
CCTGGTTTTG 
CTGCAGGGAT 
AACAAACCTC 



Seq ID NO: 407 Protein sequence 
Protein Accession It: Eos sequence 



1 11 

I ! 

MPGGCSRGPA AGDGRLRLAR 
QCPALCECSE AARTVKCVNR 
SLRHLDLSNN SLVSLTYVSF 
PWVCDCHMAD MVTWLKETEV 
SYVFLGIVLA LIGAIFLLVL 
SSNSDVLE 



21 31 

I I 
LALVLLGWVS SSSPTSSASS 
NLTEVPTDLP AYVRNLFLTG 
RNLTHLESLH LEDNALKVLH 
VQGKDRLTCA YPEKMRNRVL 
YLNRKGIKKW MHNIRDACRD 



Seq ID NO: 408 DNA sequence 

Nucleic Acid Accession &i NM_00009S.l 

Coding sequence: 26. .2299 



CAGCACCCAG 
CCTGGCTGCC 
GCAGATGCTT 
GCGGCAGCAG 
CGGGATGCAG 
GCCCGGCTTC 
CCCCTGCCCC 
CGCCCACCCC 
GGCTTGCCCG 
GGCCAACAAG 
CCCCAACTCC 
CTTCGTGGGC 
CTCGCCCAGC 
GTGCGTGTGT 
AGACGGCTTC 
CGTGACTGTG 
CGATCCGGAT 
GAACCCAGAC 
GTCCCAGAAG 
CGACGACATC 
CTCAGACCAG 
GAAGAGGAAC 
CGATCAAGAC 
TAACAGTGCC 
CGACAATGAC 
GGAGGACGCG 
GGTGGTAGAC 
GGCCTTCCAG 
GGTGCTCAAC 
GGGTTACACT 
GGATGACGAC 
CATGTGGAAG 
GCCTGGCATC 
CGCTCTGTGG 
AAACGTGGGT 
GGGCTACATC 
CTTGGACACA 
CATCTGGGCC 
TCAGCTGCGG 
GCGGCTGGAT 
AAGGGCTCAG 



11 
I 

CTCCCCGCCA 
CTCGGCGCGT 
CGGGAACTGC 
GTCAGGGAGA 
CAGTCAGTAC 
TGCTTCCCCG 
GCGGGCTTCA 
TGCTTCCCCC 
CCGGGGTACA 
CAGGTTTGCA 
GTGTGCATCA 
GACCAGGCGT 
GAGTGCCACG 
CGCGTTGGCT 
CCGGACGAGA 
CCCAACTCAG 
GCCGACGGGG 
CAGCGCAACA 
AACGACGACC 
GACGGCGACC 
AAGGACAGTG 
CCGGATCAGG 
CAGGATGGAG 
CAGGAGGACT 
GGAGTCCCTG 
GACAGGGACG 
AAGATCGACG 
ACAGTCGTGC 
CAGGGAAGGG 
GCCTTCAATG 
TATGGGGGCT 
CAGATGGAGC 
CAACTCAAGG 
CATACAGGAG 
TGGAAGGACA 
AGGGTGCGAT 
ACCATGCGGG 
AACCTGCGTT 
CAAGCCTAGG 
GGGGGCTCTG 
AGAGGACAAA 



21 
I 

CCGCCATGGT 
CCGGACAGGG 
AGGAAACCAA 
TCACGTTCCT 
GCACCGGCCT 
GCGTGGCCTG 
OGGGCAACGG 
GAGTCCGCTG 
GCGGCCCCAC 
CGGACATCAA 
ACACCCGGGG 
CCGGCTGCCA 
AGCATGCAGA 
GGGCCGGCAA 
AGCTGCGCTG 
GGCAGGAGGA 
ACGGGGTCCC 
CGGACGAGGA 
AAAAGGACAC 
GGATCCGCAA 
ATGGCGATGG 
CGGATGTGGA 
ACGGACATCA 
CAGACCACGA 
ACAGTCGGGA 
GCGTGGGCGA 
TGTGTCCGGA 
TGGACCCGGA 
AGATCGTGCA 
GCGTGGACTT 
TCATCTTTGG 
AAACGTATTG 
CTGTGAAGTC 
ACACAGAGTC 
AGAAGTCCTA 
TCTATGAGGG 
GTGGCCGCCT 
ACCGCTGCAA 
GACCAGGGTG 
CACCCAGCCC 
ATAAAGTGTG 



31 
I 

CCCCGACACC 
CCAGAGCCCG 
CGCGGCGCTG 
GAAAAACACG 
ACCCAGCGTG 
CATCCAGACG 
CTCGCACTGC 
TATCAACACC 
CCACCAGGGC 
CGAGTGTGAG 
CTCCTTCCAG 
GCGCGGCGCA 
CTGCGTCCTA 
CGGGATCCTC 
CCCGGAGCCG 
TGTGGACCGC 
CAATGAAAAG 
CAAGTGGGGC 
AGACCAGGAC 
CCAGGCCGAC 
TATAGGGGAT 
CCACGACTTT 
GGACTCTCGG 
TGGCCAGGGT 
CAACTGCCGC 
CGTGTGCCAG 
GAACGCTGAA 
GGGTGACGCG 
GACAATGAAC 
OGAGGGCACG 
CTACCAGGAC 
GCAGGCGAAC 
TTCCACAGGC 
CCAGGTGCGG 
TCGTTGGTTC 
CCCTGAGCTG 
GGGGGTCTTC 
TGACACCATC 
AGGACCC3CC 
AAGGGGTGGC 
TGTGCAGGG 



41 
I 

GCCTGCGTTC 
TTGGGCTCAG 
CAGGACGTGC 
GTGATGGAGT 
CGGCCCCTGC 
GAGAGCGGCG 
ACCGACGTCA 
AGCCCGGGGT 
GTGGGGCTGG 
ACCGGGCAAC 
TGCGGCCCGT 
CAGOGCTTCT 
GAGCGCGATG 
TGTGGTCGOG 
CAGTGCCGTA 
GATGGCATOG 
GACAACTGCC 
GATGCGTGCG 
GGCCGGGGCG 
AACTGCCCTA 
GCCTGTGACA 
GTGGGAGATG 
GACAACTGTC 
GATGCCTGCG 
CTGGTGCCTA 
GACGACTTTG 
GTCACGCTCA 
CAGATTGACC 
AGCGACCCAG 
TTCCATGTGA 
AGCTCCAGCT 
CGCTTCCGTG 
CCCGGGGAAC 
CTGCTGTGGA 
CTGCAGCACC 
GTGGCCGACA 
TGCTTCTCCC 
CCAGAGGACT 
GGATGACAGC 
CGTCCTGAGG 



51 
I 

TTCTGCTCAC 
ACCTGGGCCC 
GGGACTGGCT 
GTGACGCGTG 
TCCACTGCGC 
GCCGCTGCGG 
ACGAGTGCAA 
TCCGCTGCGA 
CTTTCGCCAA 
ATAACTGCGT 
GCCAGCCCGG 
GCCCCGACGG 
GCTCGCGGTC 
ACACTGACCT 
AGGACAACTG 
GAGACGCCTG 
CGCTGGTGCG 
ACAACTGCCG 
ATGCGTGCGA 
GGGTACCCAA 
ACTGTCCCCA 
CTTGTGACAG 
CCACGGTGCC 
ACGACGACGA 
ACCCCGGCCA 
ATGCAGACAA 
CCGACTTCAG 
CCAACTGGGT 
GCCTGGCTGT 
ACACGGTCAC 
TCTACGTGGT 
CTGTGGCCGA 
AGCTGOGGAA 
AGGACCCGCG 
GGCCCCAAGT 
GCAACGTGGT 
AGGAGAACAT 
ATGAGACCCA 
CACCCTCACC 
GGGAAGTGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



41 51 

1 I 
PSSSAPFLAS AVSAQPPLPD 
NQLASNHFLY LPRDVLAQLP 
NGTLAELQGIi PHIRVFLDNN 
LELNSADLDC DPILPPSLQT 
HMEGYHYRYE INADPRLTNL 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



Seq ID NO: 409 Protein sequence 
Protein Accession ft: NP_000086.1 

1 11 21 31 41 51 

I I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 



336 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
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65 
70 
75 
80 
85 



WO 02/086443 

PLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCPPGV 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGPPDEKL 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD 
DTDQDGRGDA CDDDIOGDRI RNQADNCPRV PNSDQKDSDG 
VDHDPVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD 
RDNCRLVPNP GQEDADRDGV GBVCQDDFDA DKWDKIDVC 
PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAPNGV 
FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV 
BSQVRLLHKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY 
RLGVFCFSQE HIIWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 410 DNA sequence 

Nucleic Acid Accession #: NMJ>01565.1 

Coding sequence: 67. .363 



ACIQTESGGR 
PTHQGVGLAF 
CQRGAQRFCP 
RCPEPQCRKD 
EDKWGDACDN 
DGIGDACDNC 
HDGQGDACDD 
PENABVTLTD 
DFEGTFHVNT 
KSSTGPGBQL 
EGPELVADSN 



CGPCPAGFTG 
AKANKQVCTD 
DGSPSECHEH 
NCVTtfPNSGQ 
CRSQKNDDQK 
PQKSNPDQAD 
DDDNDGVPDS 
FRAFOTWLD 
VTDDDYAGFI 
RNALWHTGDT 
WLDTTMRGG 



GAGACATTCC 
AGCACCATGA 
ATTCAAGGAG 
CCTGTTAATC 
CGTGTTGAGA 
TCGAAGGCCA 
TAAAACCAGA 
CCTCTCCCAT 
GTTACACTAA 
GGTTAATGTT 
GCTCTACTGA 
ACCTTTCCCA 
TCAGAATCTC 
ACTTCATGGA 
CATACAATTC 
CTTATTTAAT 
TTTCAGTGTA 
TTTTAAAAAT 
TTTTCAAATA 



11 

I 

TCAATTGCTT 
ATCAAACTGC 
TACCTCTCTC 
CAAGGTCTTT 
TCATTGCTAC 
TCAAGAATTT 
GGGGAGCAAA 
CACTTCCCTA 
AAGGTGACCA 
CATCATCCTA 
GGTGCTATGT 
TCTTCCAAGG 
AAATAACTAA 
CTTCCACTGC 
CAAACACATA 
GAAAGACTGT 
CATGGAATAA 
ACAGATAGAT 
AAAATGAGGT 



21 
I 

AGACATATTC 
GATTCTGATT 
TAGAACOGTA 
AGAAAAACTT 
AATGAAAAAG 
ACTGAAAGCA 
ATCGATGCAG 
CATGGAGTAT 
ATGATGGTCA 
AGCTATTCAG 
TCTTAGTGGA 
GTACTAAGGA 
AAGGTATGCA 
CATCCTCCCA 
CAGGAAGG7A 
ACAAAGTATA 
CATGTAATTA 
ATATGCTCTG 
ACTCTCCTGG 



31 

I 

TGAGCCTACA 
TGCTGCCTTA 
CGCTGTACCT 
GAAATTATTC 
AAGGGTGAGA 
GTTAGCAAGG 
TGCTTCCAAG 
ATGTCAAGCC 
CCAAATCAGC 
TAATAACTCT 
TGTTCTGACC 
ATCTTTCTGC 
ATCAAATCTG 
AGGGGCCCAA 
GAAATATCTG 
AGTCTTAGAT 
AGTACTATGT 
CATGTTACAT 
AAATATTAAG 



41 

I 

GCAGAGGAAC 
TCTTTCTGAC 
GCATCAGCAT 
CTGCAAGCCA 
AGAGATGTCT 
AAATGTCTAA 
GATGGACCAC 
ATAATTGTTC 
TGCTACTACT 
ACCCTGGCAC 
CTGCTTCAAA 
TTTGGGGTTT 
CTTTTTAAAG 
ATTCTTTCAG 
AAAATGTATG 
GTATATATTT 
ATCAATGAGT 
AAGATAAATG 



51 
I 

CTCCAGTCTC 
TCTAAGTGGC 
TAGTAATCAA 
ATTTTGTCCA 
GAATCCAGAA 
AAGATCTCCT 
ACAGAGGCTG 
TTAGTTTGCA 
CCTGTAGGAA 
TATAATGTAA 
TATTTCCCTC 
ATCAGAATTC 
AATGCTCTTT 
TGGCTACCTA 
TGTAAGTATT 
CCTATATTGT 
AACAGGAAAA 
TGCTGAATGG 



GGGAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
CCGCGGCCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGGAA 
ATTGAATTAT 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 
CAAGGGCCAG 
ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 
I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG 
CAGGAGTGCC 
TCCCAGGT CG 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
I 

GCATTGATGC 
TCTCCTCGGT 
CCAGGGCCCC 
GCTGCCCGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGAGTGT 
CTCACTTCGG 
AGCTGAATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 

I 

AGCCTGCGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTCGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



Seq ID NO: 413 Protein sequence 
Protein Accession #: XP_057014 



11 



21 



31 



41 



51 

! i . I ' I 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SBIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 
LPK 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



PCT/US02/12476 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 411 Protein sequence 
Protein Accession #: NPJJ01556.1 

1 11 21 31 41 51 

1 1 I 1 t I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASOFCPRV 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 412 DNA sequence 
Nucleic Acid Accession #: XM_057014 
Coding sequence: 143.. 874 



51 
I 

GGCCTCGGAG 
TCCAGCTCCG 
CGCAGCGGCT 
CCTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 . 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 



337 



WO 02/086443 

Seq ID NO: 414 dna sequence 
Nucleic Acid Accession 8: XM_084007 
Coding sequence! 138.. 2405 



PCT/US02/12476 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTCGTGCCGA 
CCAGTGGGCC 
GCGGAGACGA 
TCTCTGTCAC 
AAATTAGTCC 
ATCATCTACA 
TCAGAAAATT 
ACCACGACCA 
AGCATCACTC 
CTGCTTCTGG 
GTAAAGATCC 
GAAGOAATGT 
TCTCTGAAGG 
CCAAAGATGT 
TGGCTGGTAG 
GAAACACAAA 
GCATGGGCAT 
TCAACCAAAT 
CTCCAAAGAC 
TCAGTTTCCT 
AATTTCTCCT 
TACACCTTCT 
CAATGGAAAT 
GTGCCTATTT 
TTCTTGTTGA 
AGAAGAAACC 
CTCAACTTTC 
GAGCAGACTC 
AAGAGGTCAT 
GGTGCAAGAA 
TTCACCACCA 
CTCACAGTCA 
TGGCCTGGAT 
GTGCTGCTTT 
ATGAGTTGCC 
AGCAGGCTGT 
GAATTTTCAT 
GCTTATTCAT 
GTGACCATGG 
GTTTTGGAAT 
TCTAGTTAAG 
AGGGAGATGA 
TTGTATTGAA 
TATTCTATCT 
TAAACAAGAG 
TTTTCAAGAA 
TGTTTAGGAA 
AGCAAAGAAA 
AAAAATCACA 
CAGAATTAGT 
GTAGTGAGCA 
AAATATATTT 
TTCGTGCGGG 
TATTGCCAAG 
CAAAATTATC 
TCATTTGATT 
GAGCAATTGT 
GATGTTTCTT 



11 
I 

ATTCGGCACG 
CGTGTGGAAC 
AGGCGCAATG 
AAATCCCCTT 
GAATTGGGAA 
ACAGCTTTTC 
ACTTCAAAAT 
TCACTCAGAC 
AGACCACGAG 
TAAAAATAAG 
TAGAAACAGC 
CAAGGACAGT 
AACTCACTTT 
AAGCAGCTCC 
GAAAACAAAT 
TGAAAATCCT 
CCAGGTTCCG 
TGATGCTAGA 
CTATTCATTA 
GTCTCTGCTG 
GAGTTTCCTT 
TCCACATTCT 
GAAAAGAGGA 
TGATTCCACG 
ACATGTCCTC 
TGAAAATGAT 
AACAAATGAG 
ACAAGAGCCC 
GATAGCTCAT 
TAAATGCCAT 
TCATGACTAC 
CAGCCAGCGC 
GGTGATAATG 
TACTGAAGGC 
TCATGAATTA 
CCTTTATAAT 
TGGTCATTAT 
GTATGTTGCT 
ATGTAGCCGC 
TATGTTACTT 
GTTTAAATGC 
GTTTGTATGC 
TATTGCTGTC 
TGGAGATAAA 
ATTTGGCATG 
CTAACACAGT 
TAAGAATGTG 
TAAAGGAGAA 
AAATTTGTTG 
ATAGAGTACA 
CTCTCATATA 
AATGAATTCA 
TTATATACCA 
TTATATATCA 
AGAGTAGTAA 
CGATTCAGAA 
CTTTATATAC 
TTTTACACAA 



21 
1 

AGACCGCGTG 
CAAACCTGCG 
GCGAGGAAGT 
CATGAACTAA 
TCTGGCATTA 
TACCGCTATG 
ATAGGCATAG 
CACGAGCATC 
CATCACTCTG 
CGAAAAGCrC 
CAGGGGAAAG 
GTTAGTGCTA 
CTAGAGACAA 
ACTCCACCCA 
GAATCTGTGA 
CAGGAGTGTT 
CTGAATGCAA 
TCTTGTCTGA 
CAAATAGCCT 
GGGGTTATCT 
GTGGCACTGG 
CATGCAAGTC 
CCACTTTTCA 
TGGAAGGGTC 
ACATTGATCA 
GATGATGTGG 
GAGAAAGTAG 
TCCCACTTTG 
GCTCATCCAC 
TCACATTTCC 
CATCATATTC 
TACTCTCGGG 
GGTGATGGCC 
TTATCAAGTG 
GGTGACTTTG 
GCATTGTCAG 
GCTGAAAATG 
CTGGTTGATA 
TGGGGGTATT 
ATTTCCATAT 
TAGAGTAGCT 
TGTACTATGC 
TGTTACAAAG 
ATCTGTATGT 
ACATGTTCTG 
TATTCCTATA 
CATGAAGCCT 
AAGAGAAGAA 
TAAATTAGAG 
TTCATTAAAC 
CTAATTAGTG 
AGCAATATAC 
GATGAGTACA 
CCAAAAGCTG 
AACTTTGATA 
AGTACTTTGA 
GGTACTGTAG 
TAAATTCCTT 



31 
I 

TTCGCGCCTG 
OGCGTGGCCG 
TATCTGTAAT 
AAGCAGCTGC 
ATGTTGACTT 
GAGAAAATAA 
ATAAGATTAA 
ACTCAGACCA 
ACCATGATCA 
TTTGCCCAGA 
GAGCTCACCG 
GTGAAGTGAC 
TAGAGACTCC 
GTGTCACATC 
GTGAGCCCOG 
TCAATGCATC 
CAGAGTTCAA 
TTCATACAAG 
GGGTTGGTGG 
TAGTGCCTCT 
CCGTTGGGAC 
ACCACCATAG 
GTCATCTGTC 
TAACAGCTCT 
AACAATTTAA 
AGATTAAGAA 
ATACAGATGA 
ATTCTCAGCA 
AGGAAGTCTA 
ACGATACACT 
TCCATCATCA 
AGGAGCTGAA 
TGCACAATTT 
GTTTAAGTAC 
CTGTTCTACT 
CCATGCTGGC 
TTTCTATGTG 
TGGTACCTGA 
TCTTTTTACA 
TTGAACATAA 
TAAAAAGTTG 
AGCGTTTAAA 
TCAGTTAAAG 
GCAATTCACC 
TATGTTTCAG 
CTGGATTTTA 
AAAATACCAA 
TCTGAGAATT 
GGGAGAAATT 
ATTTTTGTCA 
TACATTTAAC 
ACTTGACCAA 
GTGAGTAGTT 
TATGACTGGA 
TATATGAGGA 
TATCTCTCAG 
CCATACTAGG 
ATATCAGCTT 



41 

I 

GTAGAGATTT 
GGCCGTGGGA 
CTTGATCCTG 
TTTCCCCCAG 
GGCAATTTCC 
TTCTTTGTCA 
AAGAATCCAT 
TGAGCGTCAC 
TCACTCCCAC 
CCATGACTCA 
ACCAGAACAT 
CTCAACTGTG 
AAGACCTGGA 
AAAGAGCCGG 
AAAAGGCTTT 
AAAGCTACTG 
CTATCTCTGT 
TGAAAAGAAG 
TTTTATAGCC 
CATGAATCGG 
TTTGAGTGGT 
TCATAGCCAT 
TTCTCAAAAC 
AGGAGGCCTG 
AGATAAGAAG 
GCAGTTGTCC 
TCGAACTGAA 
GCCTGCAGTC 
CAATGAATAT 
CGGCCAGTCA 
CCACCACCAA 
AGATGCCGGC 
CAGCGATGGC 
TTCTGTTGCT 
AAAGGCTGGC 
GTATCTTGGA 
GATATTTGCA 
AATGCTGCAC 
GAATGCTGGG 
AATCGTGTTT 
TCATAGTTTC 
GTTAGTGGGT 
GTACGTTTTA 
GGTATTACCA 
GGAAAAATGT 
GGTCTCTGAA 
GAAAGCTTAT 
GGGGAGGCAT 
TAGAATTAAG 
GGATTATTTC 
TTTGTATAAT 
GAAATTGGAA 
TATGTATCAC 
TGTTCTGGTT 
TATTAAAACT 
TGCTTCAGTG 
CCTGTCTGTG 
G 



51 
I 




1 

CTCGAAGACA 


60 


CAACGAGGCC 


120 


ACCTTTGCCC 


180 


ACCACTGAGA 


240 


ACACGGCAAT 


300 


GTTGAAGGGT 


360 


ATACACCATG 


420 


TCAGACCATG 


480 


CATAATCATG 


540 


GAT AG TT CAG 


600 


GCCAGTGGTA 


660 


TACAACACTG 


720 


AAACTCTTCC 


780 


GTGAGCCGGC 


840 


ATGTATTCCA 


900 


ACATCTCATG • 


960 


CCAGCCATCA 


1020 


GCTGAAATCC 


1080 


ATTTCCATCA 


1140 


GTGTTTTTCA 


1200 


Cj A *T*GOTT TTT 


1260 


GAAGAACCAG 


1320 


ATAGAAGAAA 


1380 




1440 


AAAAAGAATC 


1500 


AAGTATGAAT 


1560 


VVV* X r\ X i. 4. rVVv 


1620 


TTGGAAGAAG 


1680 


GTACCCAGAG 


1740 




1800 


AACCACCATC 


1860 


GT CGCCACTT 


1920 


CTAGCAATTG 


1980 


UlOi A w A w£ w 


2040 


ATf^APfYSTTA 


2100 


ATGGCAACAG 


2160 


PTT A CFfSfTCJ 

1 4»/%W£VwX\7 


2220 


AATGATGCTA 


2280 


ATflfTTTTfiG 

IX A. v?v 1X11 wVJ 


2340 


rY5TATAAATT 
v^uininnni x 


2400 


AGT AGGTCAT 


2460 


TTTGTGATTT 


2520 


ATATTTAAGT 


2580 


GTTTATTATG 


2640 


CXTTAATGCT 


2700 


gaactgctgg 


2760 


ACTGAATTTA 


2820 


AGATTCTTAT 


2880 


TATAAAAAGG 


2940 


CCGTAAAAAC 


3000 


ACAGAAATCT. 


3060 


TTTCAAAATG 


3120 


CAGACTGGGT 


3180 


ACCTGGTTTA 


3240 


ACACTAAGTA 


3300 


CTATCATTGT 


3360 


GCATTCTCTA 


3420 



Seq id NO: 415 Protein sequence 
Protein Accession #: XP 084007 



1 


11 


21 


31 


41 


51 
1 




1 

MARKLSVlIiI 


1 

LTFALSVTNP 


1 

LHELKAAAFP 


I 

QTTEKISPNW 


1 

ESGINVDLAI 


STRQYHLQQL 


60 


FYRYGENNSI* 


SVEGFRKLLQ 


NIGIDKIKRI 


HIHHDHDHHS 


DHEHHSDHER 


HSDHEHHSDH 


120 


EHHSDHDHHS 


HHNHAASGKN 


KRKALCPDHD 


SDSSGKDPRN 


SQGKGAHRPE 


HASGRRNVKD 


180 


SVSASEVTST 


VYNTVSEGTH 


FLETIETPRP 


GKLPPKDVSS 


STPPSVTSKS 


RVSRIAGRKT 


240 


NESVSEPRKG 


FMYSRNTNEN 


PQECFNASKL 


LTSHGMGIQV 


PLNATEFNYL 


CPAIINQIDA 


300 


RSCLIHTSEK 


KAEIPPKTYS 


LQIAWVGGFI 


AISIISFLSL 


LGVILVPLMN 


RVFFKFLLSF 


360 


LVALAVGTLS 


GDAFLHLbPH 


SHASHHRSHS 


HEEPAMEMKR 


GPLFSHLSSQ 


NIEESAYFDS 


420 


TWKGLTALGG 


LYFMFLVEHV 


LTLIKQFKDK 


KKKNQKKPEN 


DDDVEIKKQL 


SKYESQLSTN 


480 


EEKVDTDDRT 


EGYLRADSQE 


PSHFDSQQPA 


VLEEEEVMIA 


HAHPQEVYNE 


YVPRGCKNKC 


540 


HSHFHDTLGQ 


SDDLIHHHHD 


YHHILHHHHH 


QNHHPHSHSQ 


RYSREELKDA 


GVATLAWMVI 


600 


MGDGLHNFSD 


GLAIGAAFTE 


GLSSGLSTSV 


AVFCHELPHE 


LGDFAVLLKA 


GMTVKQAVLY 


660 


NALSAMLAYL 


GMATGIFIGH 


YAENVSMWIP 


ALTAGLFMYV 


ALVDMVPEML 


HNDASDHGCS 


720 


RWGYFFLQNA 


GMLLGFGIML 


LISIFEHKIV 


FRINF 









Seq ID NO: 416 DNA sequence 

Nucleic Acid Accession #i NM_0l54l9.l 

Coding sequence; 1..8487 



338 



WO 02/086443 



PCT/US02/12476 



1 U 21 31 41 51 

| | I I t 1 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCOGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

5 CCGCGAGTGO CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 120 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCOGATGG AGCTTTAAGA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 360 

10 CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGATCGAG 420 

TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 540 

AGACTCTCCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 

15 TGOGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 720 

TGTAAAAAGG ACAAAGCTTA TGAAGGOGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 90 0 

GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

20 AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 1020 

GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAO ATATTGACAT AAATGCAACA 1080 

GTTGCCTTGQ ACTTTGAGTG TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 12 DO 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA CTACACAGGT 1260 

25 GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 1320 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 1380 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 1500 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 1560 

30 CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 1620 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 1800 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

35 GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 1920 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 1980 

CATTTTACGG TGGGAATCAC ACTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

40 GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 2280 

GGTCGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCOG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 2460 

45 TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 2580 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2760 

50 TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGA CCCA 2940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3060 

55 TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

60 CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCC CCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 

65 AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 

„ AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

70 TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

75 GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

_ ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

80 GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4660 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

85 CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
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CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5X00 

GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAO TTGGAAAGCC TCCCAGTCCA 5X60 

AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 5400 

ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 

CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 

AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATAOGGAA GGTTCAAGTA 5760 

CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC AOGGCCTGGA CAGGATGGTG 5820 

GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGOGG ACAGCCTGGC CATCCGCCTG 6X20 

CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6X80 

CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 

TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 

GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGAOGGA CGTCAGGTAC 6480 

GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 

CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 6900 

TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 

GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7X40 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGOG GCAACTACAC CTGCCTGGTC 7260 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 

AAGATCAACG GTAACCCCAA CCCCATCACC ACOGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 

GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACOCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGAOG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8X00 

GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTOG 8X60 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TGAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9X20 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9X80 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC* TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 

Seq ID NO: 4X7 Protein sequence 
Protein Accession # i NP_056234.X 

I XX . 2X 3X 4X 51 

II I I I I 

MPKRAHWGAL SWLILLWGH PRVAIACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

NLGFNSIQAL SETSPAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVPKF SYNKLRVITG X20 

QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTPTFLDYP 180 

RLSTIRHLYL AENMVRTLPA SMLRNMPIiLE KLYLQGNPWT CDCEMRWFLE WDAKSRGILK 240 
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CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI 
DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM 
VALDFECPMT RENYEKLWKL IAYYSEVPVK LHRELMLSKD 
VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS 
PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI 
RIKSKEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA 
ALAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK 
HPTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG 
EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE 
ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLBVTPP 
SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT 
TBGDLK6TAA PTLISEPYEP SPTLHTLDTV YEKPTKEETA 
EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTPAH 
STIGEPGVPG QSHLQGIiTDN IHLVKSSLST QDTIiLIKKGM 
RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL 
TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETP 
VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK 
PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR 
SDGKEIKDDV ATNVDKKKSD ILVTGESITN AIPTSRSLVS 
NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS 
TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN 
MSLGQTTTTK PALPSPRISQ ASRDSKENVF LNYVGNPETE 
PSSDRDAFNL STKLELEKQV PGSRSLPRGP DSQRQDGRVH 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK 
PSIPSKPTDR RTDQFNGYSK VFGNNNIPEA RNPVGKPPSP 
PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL 
TNLQNIPMVS STQSSISPIT SSVQSSGSPH QSSSKPPAGG 
QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT 
QDRGQYMCTA SNIiHGLDRMV VLLSVTVQQP QILASHYQDV 
QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR 
HVAALPPVIH QEKLENISLP PGLSZHIKCT AKAAPLPSVR 
VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ 
GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV 
CVARNKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD 
DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG 
WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW 
GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP 
RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN 
RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI 
GTDLQSGQQL QRFYHKADGM LHISGLSSVD AGAYRCVARN 
KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE 
ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT 
PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA 
KTTYIHVF 

Seq ID NO: 418 ONA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..5001 
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DVYKIHLNQT 
PRVSYQYRQD 
QTISTKDTRQ 
LKAPMDDPDS 
BKDTVTIGKN 
VQVSDSGYYR 
GSGMGDEENT 
GRRVFESRRR 
FPAVSPPSAS 
STPLEEWDD 
TEGWSAADVG 
LTPTPTIWVN 
KEMSQTLQGG 
LDKDTTTVTT 
STQPTQAPDI 
HRYTPSTVSS 
KIYSSYPKVQ 
TMGEFKEESS 
EFLSSLTVST 
HTPTAARMKE 
ATPVNNEGTQ 
ASHQLTRVPA 
QFTTPRIiSST 
RIPHYSNGRL 
DFGPPAPPLL 
PPASKFWSLG 
RIQRFEVLKN 
TVYLGDTIAM 
GVYKCVASNA 
WVLGDGTQIR 
RAAANARITG 
PANGTLWKS 
LKVDCVATGL 
DYTCFAENQV 
LSPTNKVIPT 
KINGNPNPIT 
GSLDIRSLRK 
SLNCSAAGTP 
AAGHTERLVS 
GPQTLGRVSL 
PVIYTRPGNT 
TQRDAGFYKC 



DPPDIDINAT 
ADEEALYYTG 
ARGRSWVMIE 
KFSIbSSGWL 
PGESVTLPCN 
CVAVNQQGAD 
SRRLLHPKDQ 
INMANKQINP 
PVQTVTSAEE 
LSEKTEEITS 
SSPEPTSSEY 
DSSTSQLFED 
NMLEGDPTHS 
TPRQKVAPSS 
KIS%QVESSL 
RASGSKPSPS 
ETLPVTYKPT 
PVGFPOTPTW 
PFHQEEAGSS 
PASSSPSTIL 
HMSGPNELST 
KPILPTATVR 
TIPLPLHMSK 
PFPTNKTLSP 
HTPQTTGSPS 
EKPQILTKSP 
GTIiVIRKVQV 
ECLAKGTPAP 
AGADSLAIRL 
PSQFLHGNLF 
TSPRRTDVRY 
VTDKDAGDYL 
PNPEISWSLP 
GKDEMRVRVK 
SSEKYQIYQD 
TVREIAAGGS 
SDSVQLVCMA 
TPSLVWVLPN 
LKVGLKPEAN 
LDNGTLTVRE 
VKLNCMAMGI 
MAKNILGSDS 



300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 

2seo 

2640 
2700 
2760 
2820 



1 
I 

ATGCCAGGCA 
ACCTCTCAAG 
CAGTCTGTGC 
TCAAGACAGT 
CAGATCGCTA 
GCAGTCCGTA 
ACACCAGAAT 
AAACCTACAG 
GTCTGTCTGC 
TTTCAGAATA 
CCTATCCTGG 
TTTTCAAAAT 
TCGCTATGCC 
ATCGATATCC 
CCATGTTTTC 
ATGTGCTATG 
GCCAGTAAGG 
CCTTCCTCAC 
CCCCAAGGGA 
6GTGGGGCGC 
TCGACAGAAA 
TCAGACACCC 
GTTGCTCCCG 
GTAGATAAGC 
TCGGCCTCTC 
GCCAGCTTGA 
CTCCACCCCA 
CCGTCCAGCG 
GCGCGGAGGA 
TCAAGACTTT 
AGCTCTCCAC 
GCGCCATCAC 
GTCTCTGAGG 
GGCGGAAGGC 
TTCCATTTGC 
ATTGGGCGGG 



11 
I 

CAAAACTAAC 
AGGACGAATT 
TTGTGTCCTG 
ACACCGTGCG 
ACAGGCGTGT 
TTTCACAGGG 
CTGCCCCTAC 
TTGTCGCTGC 
TGGACACAGG 
CATTCTTTCA 
AGACACTACT 
CCGGGCCACA 
AACAAGAATG 
AAACCAAACA 
TTTTCTACTT 
AAGACCCANN 
CGGATGTTCA 
CTTCTCCCAG 
GAAATGCCAA 
CCCGAAAACC 
TCACTGGGGA 
AAGACCAGAA 
GCAGGACTGC 
CTGGCTTTTC 
CTGCCCACCA 
ATGACAACGA 
AGGGCGCCTT 
TTCTCCGCGA 
CCCCCCATTC 
CTCCACCCCA 
TTTCCAAGGG 
GGTCCACCAT 
GAGCGGAGGC 
AGGCGGAGGC 
TCAGACACAA 
GACCTCGGCT 



21 
I 

CCGAACAGGC 
GGATGTACCT 
GGTGGATCCT 
CTATCGAGAG 
GCTGATTGAG 
TGAAAGAGAT 
CACAGCTCCT 
ATCTTGGGAT 
ACTGTTTTCA 
TACGCCCCGG 
TCTGCCCTGG 
AACAGGAGAG 
CAGCTGTACC 
AGTTAATAAA 
CCTCACATTT 
TGTTTCTTCT 
GCAGAACACG 
AGCTCCAGCT 
GGACCTTCTT 
CCAGCTTCGC 
GGAGGAGCTG 
ACGGACCCTG 
AGTGAGGGCC 
CCTGGCCACG 
CGCGTCCACC 
CTTGGTGGAC 
CGCCCAGCCC 
CAGAAGCTCT 
AGGGGCCGCA 
TGGGGGATCA 
CGGGAAGGAT 
GTCCTCCTCC 
TTCTGATGGT 
CACGGCCCAG 
ACCCTTTGCT 
GCAGCCCTCC 



31 

] 

GCCCCAGCAG 
GACGACATCA 
GTTCTGGAAA 
AAGGGGGAAT 
AACCTGATTC 
GGCAAATGGA 
GAAAACTTGA 
GCGCTACCAG 
GTTTCCTCCT 
CTCTCAAACC 
TGGATGGTCT 
GCCTGGGACC 
CAGAAGGATT 
GATCCACAAC 
ATGCTGGATA 
TTGACAGGCA 
GAGGACAATG 
TCCTCCCAAC 
CTTGACTTGA 
GCCAAGAAGG 
GGTTCCCGGG 
AGGCCGCCAA 
CGGATGCCAG 
CAGCCCCGCC 
CAGGGCACCT 
TCAGACGAAG 
CGGCCAGCCC 
GTGCACCCCG 
GAGGAAGATT 
TCTCGGCTGC 
GGTGAGGACG 
GTCTCTTCTC 
GAAAGCCACG 
ACGCTGCGGG 
GCCAACGGGA 
AGCTCCCCAC 



41 

I 

ACTACAGAGT 
GCGTCCGGGT 
AACAGAAGAA 
TGGCCAGGTG 
CAGACACTGT 
GTACGTCAGT 
ACGTCTGGCC 
AGACTGAGGG 
TCCAACCATC 
ATTTGGAGCA 
GCAGCCTGGG 
TCACTCCAAA 
TCAGTTGTCT 
TGGAAGGGAG 
TTGGCGGCTT 
ATTCTTTAAA 
GGAAACCCGA 
ACCCCTCTGT 
AGAACAAAAT 
CAGAGGAGCT 
AGGACTCGCC 
GTAGACACGG 
CGCTGCCCCG 
CAGGGGCGCC 
CTCATCGTCC 
ATGAGCGCGC 
TGTCCCCCAG 
GCGCAAAGCC 
CCAGTGCCTC 
TGCCCACCCA 
CCCCAGCCAC 
ATCTCTCGTC 
GTGACGGCGA 
CCCGGCCTGC 
GGTCTCCAAG 
AGTCGACTGT 



51 
I 

GATATTGAAG 
TATGTCATCT 
AGTTGTTGCA 
GGATTATAAG 
GTATGAATTT 
CTTCCAAAGA 
AGTCAATGGC 
GAAAGTGAAA 
TGCCAAATCA 
AAGTCCCTCA 
GAACGCTATC 
GCCTTCATTG 
AGCTTACCTG 
TGTTTTTGGA 
TTCCTTCATT 
ATCTGTTGCA 
AAAACCTGAG 

ATTGGCTAAT 
GGATCTTCAG 
CATGTCACCC 
CCACTCGGTG 
AAGGGAAGGC 
CCCCTCGGCT 
TTCCCTGCCT 
TGTGGGCTCC 
CCGCCAGTCC 
AGCCTCGCCG 
AGCCCCACCC 
GCCACACCTG 
CAACTCCAAT 
CAGGACGCAG 
TAGGGAAGAC 
CTCTGGACAC 
CAGGTTCAGC 
GCCCTCCCGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
.1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA 
GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG 
TCCTCCAGGC AGCCCATCTC CCGGQQCTGG GAGGACTTAA 
GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA 
CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC 
GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG 
TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC 
ACACCCGGCC GGOCCCCAGA ACAGCAGCCC CCTCCTCCCG 
CCGGGACCCC AGAGCAGAGA OGCGGGTCGG TCACCTTCCC 
CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC 
ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA 
GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG 
GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC 
ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG 
AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG 
GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA 
GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG 
TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA 
GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC 
CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA 
GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA 
CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT 
TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC 
AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG 
GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG 
GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG 
CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC 
GGGACCCCCG TOGTGAGTCC TGACGGCCTC CCACTCTTTG 
CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG 
TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA 
ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA 
ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC 
CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA 
ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC 
GACGATGATG GCAACCTGAT AATGAGCTCC AATG6GATCC 
GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA 
TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA 
ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA 
GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT 
GACCCATCAG CCCCGTOCTC TCTGACTGAT GCACTGGATC 
GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC 
AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG 
GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA 
ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC 
AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA 
ATCAGCCCTT OGGTCTCATT TGTCACCGAA TCAGATAATC 
CCAGGCGGTG AGCTATCTGG ATCCCATTOG CTTTCAAACA 
GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA 
GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT 
TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT 
GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC 
AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT 
TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG 
CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA 
CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA 
CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT 
TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG 
CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT 
AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC 
TGCTACA7GC TTTCTGTTTT TCTCATTTTO GATTTCTCCA 
TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA 
ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC 
TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA 
TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT 
AAAAAAA 

Seq ID NO i 419 Protein sequence 
Protein Accession #: Eos sequence 



PCT/US02/12476 



MPGTKLTRTG 
SRQYTVRYRB 
TPESAPTTAP 
FQNTFFHTPR 
SLCQQBCSCT 
MCYEDPVSSL 
QGRNAKDLLL 
DTQDQKRTLR 
ASPAHHASTQ 
SSVLRORSSV 
SPLSKGGKDG 
GRQAEATAQT 
HPRVPSHSDS 
SLHRKEPIPE 
PARPPAARSQ 
AGRPRPTSQG 



11 
I 

APADYRVILK 
KGELARWDYK 
ENLNVWPVNG 
LSNHLEQSPS 
QKDPSCLAYL 
TGNSLKSVAA 
DLKNKIIANG 
PPSRHGHSW 
GTSHRPSIiPA 
HPGAKPASPA 
EDAPATNSNA 
LRARPASGHF 



NPKSTGADTH 
QHPSVPRRMT 



21 

} 

TSQEDELDVP 
QIANRRVLIE 
KPTWAASWD 
PILETLLLPW 
IDIQTKQVNK 
SKADVQQNTE 
GAPRKPQLRA 
APGRTAVRAR 
SLNDNDLVDS 
RRTPHSGAAE 
PSRSTMSSSV 
HLLRHKPFAA 
DEEDEKPLPA 
PQGKYSSLAS 
PGRAPEQQPP 
ASSRGMLPTA 



31 



DDISVRVMSS 
NLIPDTVYEP 
ALPETEGKVK 
WMVCSLGNAI 
DPQLEGSVFG 
DNGKPEKPEP 
KKAEELDXjQS 
MPALPRRBGV 
DEDERAVGSL 
EDSSASAPPS 
SSHLSSRTQV 
NGR5PSRPSI 
TWNDHVPSS 
KAQDVQQSTD 
PPVATSQHHP 
LQNQDEDAQG 



AGCTTAGCTC 
TCAATGACCA 
GGAGAAGCCC 
AATCCACAGG 
AGGATGTTCA 
GGTCCACAGA 
CCAGTGTTCC 
TCGCCACGTC 
AGCCCAGGCT 
ACTCCTCCTC 
ACCAGGACGA 
ATGTGCGGGC 
ACCAGCAGGT 
GACATGCGGC 
TCCCCAGCAG 
AGTCCCAGCA 
GCGGGAAAGA 
GGGGCGGCAA 
TTGCCCCTCG 
GCTCCCCCAG 
GCCCCGTCGC 
TCTCCACCAC 
CTCTCTCCCG 
TAGAAGGGAA 
GCCCTCAAGG 
GAAGGTACCT 
GAACCATTGT 
GGCAGGGGCG 
GAGGAAAGCC 
CCATGCAGCC 
CTGCCACCAC 
CCACCACTGC 
CTACGCGGAC 
CTGGGACCTT 
CAGAGTGCTA 
CGGAAGAGGC 
CCACCACTGA 
TCAGTTCCTT 
ACGTGACGTA 
ACTTCCAAGT 
CTCCCCAGCA 
TCATTGTGGA 
GTGCATCCTA 
ACTTGCCCAT 
ATCCTCATGG 
CTCTOCTTOT 
TGATCCCAGC 
GTTCGTGGGA 
GAAAGATACA 
TGTGGATTCA 
TATTCAAGGC 
CGGAACCCCC 
GTAATCACAG 
CTAGGGGCTG 
GGTCACAGAT 
GGCCTGGACA 
TTTGTTTGTA 
CCCAGACTTT 
GCTGTTCTTG 
AAACTAACTG 
CACCACCAAA 
TAGAATAAAT 
TGCATATACA 
TGTTCACCAA 



41 
I 

QSVLVSWVDP 
AVRISQGERD 
VCLLDTGLPS 
PSKSGPQTGE 
PCPLPYPLTF 
SSPSPRAPAS 
TEITGEEELG 
DKPGPSLATQ 
HPKGAFAQPR 
RLSPPHGGSS 



AGGTATCCAT 
CGTGCCTTCC 
GCAGAGAGGG 
GGCAGATACA 
ACAGAGCACA 
CCGCCACGCG 
CAGAAGGATG 
CCAGCACCAC 
CTCACTGACC 
GGACCCTTAC 
GGATGCCCAG 
CCCOGCGCAC 
GGAGTCTCCC 
CTCCCCCGCC 
GGCAGCGCCG 
GTCGGTCTCA 
AGACCTTCTG 
AGACGCCGAT 
CGGAGGGAGC 
GGCCTCCCAC 
GGGCACCCAC 
CCCGATGCTG 
ACAGCCTGCC 
AGTCCTTCCT 
AACAAAGTGG 
CCAAGATTCA 
AGATCTGGAA 
ACATGGCACA 
GCTGGTQGGC 
CACCACTACT 
CATGCAGCCC 
CACCACCCGC 
AACCACCACC 
GGAACGGCAC 
CGCTGAAGAA 
CTACGTTATA 
GCCTTCGACC 
TCCTGAAGAA 
CCTAAATAAA 
GGACAGCCTG 
TGCTCCCCGC 
TTGGGACAAA 
TGAAGATTTC 
TGAGAACCTA 
CTACGGACCT 
TGTGAGGCCC 
TACACGGACT 
GTTGTTCTTT 
TTCTACAGCA 
CACCTTGATG 
TACTATCGCC 
TACTACTATG 
GACCGTCATG 
TGAGCAAAGA 
GGACACTGGC 
ATGAACAGGA 
ATAGCACATC 
TTAGGCATGA 
CTTCATGGAA 
AATTTAAGCT 
GAAAATAAAT 
GTATCAAATC 
TATCCACACT 
AAAAAAAAAA 



51 . 
I 

VLEKQKKWA 
GKWSTSVFQR 
VSSFQPSAKS 
AHDLTPKPSL 
MLDIGGPSFI 
SQHPSVPASP 



2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4660 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5620 
5680 
5940 
6000 



GRGPRLQPSS 
SRQPZSRGWE 
ADTEGHSPKA 
GPQSRDAGRS 
SYDDDSTEVE 



PRPGAPPSAS 
PALSPSRQSP 
RLXiPTQPHLS 
SHGDGDREDG 
SPQSTVPSRA 
DIiRRSPQRGA 
QPGSTDRHAS 
PSQPRLSLTQ 
AQOVRAPAEA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



342 



WO 02/086443 



PCT/US02/12476 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



ARAKEAAASL 
KSEPPSKRPL 
SLAKEEREPA 
WPRYTTRAPP 
SNGKPNGQRI 
TPWSPDGLP 
TPLPTTTTPR 
TPKPTTPIPT 
DEDYEFETSR 
PSAPCSLTDA 
TPGDLVTGYL 
SPSVSFVTES 



PKHQQVBSPT 
SSKSQQSVSA 
IALAPRGGSL 
GHFSTTPMLS 
INGPQGTKWV 
LFGQGRKGTP 
PTTATTMQPT 
CPPGTLERHD 
PPTTTEPSTT 
LDHFQVDSLD 
VYSASYBDPI 
DNPLLWRPP 



EDEEEEDAGF 
APVKRPLPPP 
LRQRMMHARF 
VDLDRGLVLN 
LANAQDKPIL 
TTTTPLPTTT 
DDGNLIMSSN 
ATTPRVIPEE 
EIIPNDLKKS 
RNKFSTQASS 
GGELSGSHSL 



QRGHAASPAR 
FKGGKEDLLS 
PGSSPRASHV 
RNPLSRQPAR 
AEGRYLQDSH 
SLGGKPLVGL 
PRPTTATTRR 
GIPECYAEED 
GAISSFPEEE 
DLPPQHAPRN 
VTHLPIENLK 
SNMIPATRTA 



SSVPKWPSSS 
PSRPPPRSAA 
PSYRQGYNGR 
GNPLRIKLGG 
EVIKKTTHPP 
TTTRRPTTTV 
EFSGLETPTA 
FDLAGRKRFV 
ITWAVEGCH 
PNTRYYFKVQ 
MDGNM 



ARVPSRAAPG 

TPRGGKDADG 

TVSPVAGTHP 

PNVEGKVLPG " 

DGRTIVDLEG 

TTTMQPTTTT 

RTTTRTTTTT 

VPTEEAYVIY 

APYVTYLNKD 

SFVIVDWDKA 

AQNPHGYGPI 



Seq ID NO: 420 DNA sequence 
Nucleic Acid Accession #: NM_022743 
Coding sequence: 128.. 1237 



1 
I 

GTGGATTTTA 
TCTCTGCTGC 
AAAGCTGATG 
AAAAGCTTGG 
TCCTCCAGAC 
TTCAGAATCA 
TGAAGATAAG 
AGAAATACAG 
AGTGATCTGC 
ATATCCCAGT 
TGGGCCCCAC 
CTGCTACCTG 
CTGCTTTGAA 
TGGTGATGAG 
GGCACACTGG 
TGAACGGCTT 
CTGCATCAAC 
ATACAGGATT 
CAAACTGCAG 
TGATATTATG 
TTTAGAAGAA 
CGGCGTGTGT 
GTGAACCTCT 
GGTTTGCAAA 
ATTTGGTTGA 



11 
I 

GAGATACCTC 
AACACCCCTA 
CGATGCTCTC 
CCAGACCACA 
TCCGTTCGAC 
GAGAAGCTTT 
AAAGAGGGCC 
GATGCCTCTC 
AACTCTTTCA 
ATCTCTTTGC 
CTCTTACTGC 
GATATGCTGA 
TGTGACTGTT 
CAAGTATGGA 
AAGTGGGAGC 
CCCGATATCA 
CTCGGCCTGT 
TTTTTCCCAG 
CTACATCAAG 
AGAGTGACAC 
TGCGACGCCA 
CTTTGTTGAA 
CTTATTGGAA 
CCACAAGAAT 
GGATGCCAAA 



21 
I 

CCCTCCTTCT 
CTGTTCTCCG 
AGTGCCGCGT 
AGCGGGAATG 
TTCTTGGCAG 
ACTCATTTTA 
TCAGGCAACT 
AGCTGCCACC 

ccatctgtaa" 
tcaatcacag 
gagcagtccg 
tgaccagtga 

TCCGTTGCCA 
AGGAAGTTCA 
AGGTTCTGGC 
ACATCTACCA 
TGGAGGAAGC 
GAAGCCATCC 
GCATGTTTCC 
ATGGCAGAGA 
ACATCAGAGC 
TGCCTTATTG 
ATTCTGTTCC 
CATTAGTTGT 
AAAAAAAAAA 



31 
I 

GCTCAGCTGC 
TGTATTGGCT 
CGCCAAATAC 
CAAATGCCTT 
AGTTGTCTTC 
TGATCTGGAG 
CGTAATGACA 
TGCCTTTGAC 
TGCGGAGATG 
CTGTGACCCC 
AGACATCGAG 
GGAGCGCCGG 
AACCCAGGAC 
AGAATCCCTG 
CATGTGCCAG 
GCTGAAGGTG 
CTTGTTCTAT 
CGTCAGAGGG 
CCAAGCAATG 
ACACAGCCTG 
ATCCTAAGGG 
AGGTCACACA 
GTGTTTGTGT 
AGAGAAGCAC 



Seq ID NO: 421 Protein sequence 
Protein Accession 8: NP 0735B0 



MRCSQCRVAK 
SEKLYSPYDL 
QJSFTICNAE 
LDMLMTSEER 
WKWEQVLAMC 
IPPPGSHPVR 
ECDANIRAS 



11 

i 

YCSAKOQKKA 
ESNINKLTED 
MQEVGVGLYP 
RKQLRDQYCF 
QAIISSNSER 
GVQVMKVGKL 



21 

I 

WPDHKRECKC 
KKEGLRQLVM 
SISLLNHSCD 
ECDCPRCQTQ 
LPDINIYQLK 
QLHQGMFPQA 



31 
I 

LKSCKPRYPP 
TFQHFMREEI 
PNCSIVFNGP 
DKDADMLTGD 
VLDCAMDACI 
MKNLRLAFDI 



41 

I 

CTTGCAGTAA 
TTTCTGGGCA 
TGTAGTGCTA 
AAAAGCTGCA 
AAACTTATGG 
TCAAATATTA 
TTTCAACATT 
CTTTTTGAAG 
CAGGAAGTTG 
AACTGTTCGA 
GTGGGAGAGG 
AAGCAGCTGA 
AAGGATGCTG 
AAAAAAATTG 
GCGATCATAA 
CTCGACTGCG 
GGTACTCGGA 
GTTCAAGTGA 
AAGAATCTGA 
ATTGAAGATT 
AACGCAGTCA 
CTCTATGCTT 
AGGTAAATAA 
GATTATAATA 



41 
I 

DSVRLLGRW 
QDASQLPPAF 
HLLLRAVRDI 
EQVWKEVQES 
NLGLLEEALF 
MRVTKGREHS 



51 
I 

TTAAACTCTT 
GCAGGAAGGA 
AGTGTCAGAA 
AACCCAGATA 
ATGGAGCACC 
ACAAACTGAC 
TCATGAGAGA 
CCTTTGCAAA 
GTGTTGGCCT 
TTGTGTTCAA 
AGCTCACCAT 
GGGACCAGTA 
ATATGCTAAC 
AAGAACTGAA 
GCAGGAATTC 
CCATGGATGC 
CCATGGAGCC 
TGAAAGTTGG 
GACTGGCTTT 
TGATTCTACT 
GAGGGAAATA 
TGTTAGCTGT 
AGGCAGACAT 
AATTCAAAAC 



51 
I 

FKLMDGAPSE 
DLFEAFAKVI 
EVGEELTICY 
LKKIEELKAH 
YGTRTMEPYR 
LIEDLILLLE 



Seq ID NO: 422 DNA sequence 

Nucleic Acid Accession #: NMJJ03014.2 

Coding sequence: 23B..648 



1 
I 

GGCGGGTTCG 
CGGAGCTCCG 
AAACTCTCCT 
GGCAGGAAGA 
TTOCTCTCCA 
GCGCCCTGCG 
ATGCCCAACC 
GAGGAGCTGG 
GCGCCCATTT 
CAACGCGCGC 
AGCCTGGCCT 
ATCGTCACGG 
CAGGAAAGGC 
AAGGTGAAGC 
AAAATAAAAG 
GAGATCTTCA 
TCTTGCCAGT 
CGTTCAAGGA 
AAAAGATCCA 
AAGAAAACAG 
GCTCCCAAAC 
AACCCGAAAA 
GATGAGGCTG 



11 
1 

CGCCCCGAAG 
CGGCCGGACC 
GCGCCCCAGA 
GAAGGCGCTT 
TCCTAGTGGC 
AGGCGGTGCG 
ACCTGCACCA 
TGGACGTGAA 
GCACCCTGGA 
GCGACGACTG 
GCGACGAGCT 
ACCTCCCGGA 
CTCTTGATGT 
CAACTTTGGC 
CTGTGCAGAG 
AGTCCTCATC 
GTCCACACAT 
TGATGCTTCT 
TACAGTGGGA 
CCGGGCGCAC 
CAGCCAGTCC 
GAGTGTGAGC 
GGCATTGCCT 



21 
1 

GCTGAGAGCT 
CCGCGGCCCC 
AGATTTCTTC 
TCTGTCTGCC 
GCTGTGCCTG 
CATCCCTATG 
CAGCACGCAG 
CTGCAGCGCC 
GTTCCTGCAC 
CGAGCCCCTC 
GCCTGTCTAT 
GGATGTTAAG 
TGACTGTAAA 
AACGTATCTC 
GAGTGGCTGC 
ACCCATCCCT 
CCTGCCCCAT 
TGAAAATTGC 
AGAGAGGCTG 
CAGTCGTAGT 
CAAGAAGAAC 
TAACTAGTTT 
GGGACAGCCT 



31 
I 

GGOGCTGCTC 
GCTTTGCTGC 
CTCGGCGAAG 
GGGGTCGCAG 
TGGCTGCACC 
TOCCGGCACA 
GAGAACGCCA 
GTGCTGCGCT 
GACCCTATCA 
ATGAAGATGT 
GACCGTGGCG 
TGGATAGACA 
CGCCTAAGCC 
AGCAAAAACT 
AATGAGGTCA 
CGAACTCAAG 
CAAGATGTTC 
TTAGTTGAAA 
CAGGAACAGC 
AATCCCCCCA 
ATTAAAACTA 
CCAAAGCGGA 
ATGTAAGGCC 



41 

I 

GTGCCCTGTG 
CGACTGGAGT 
GGACAGCGAA 
CGCGAGAGGG 
TGGCGCTGGG 
TGCCCTGGAA 
TCCTGGCCAT 
TCTTCTTCTO 
AGCCGTGCAA 
ACAACCACAG 
TGTGCATTTC 
TCACACCAGA 
CCGATCGGTG 
ACAGCTATGT 
CAACGGTGGT 
TCCCGCTCAT 
TCATCATGTG 
AATGGAGAGA 
GGAGAACAGT 
AACCAAAGGG 
GGAGTGCCCA 
GACTTCCGAC 
ATGTGCCCCT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 



51 
I 

TGCCAGACGG 
TTGGGGGAAG 
AGATGAGGGT 
CAGTGCCATG 
CGTGCGCGGC 
CATCACGCGG 
OGAGCAGTAC 
TGCCATGTAC 
GTCGGTGTGC 
CTGGCCCGAA 
GCCTGAAGCC 
CATGATGGTA 
CAAGTGTAAA 
TATTCATGCC 
GGATGTAAAA 
TACAAATTCT 
TTAOGAGTGG 
TCAGCTTAGT 
TCAGGACAAG 
AAAGCCTCCT 
GAAGAGAACA 
TTCCTTACAG 
TGCCCTAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



343 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT 
GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT 
GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA 
CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC 
AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG 
TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT 
AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT 
TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG 
TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT 
AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT 
GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC 
ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT 
ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT 
AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC 
TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA 
AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG 
AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT 
ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA 
AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT 
ACTACACAGA ggtaatcact attagtattt TGGCATATTA 

GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT 
CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC 
TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA 

Seq ID NO: 423 Protein sequence 
Protein Accession ft: NP_003005.1. 



1 .11 21 31 41 

I I I I I 

MFLSILVALC LWLHLALGVR GAPCBAVRIP MCRHMPWNIT RMPNHLHHST 
YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV CQRARDDCEP 
ESLACDELPV YDRGVCISPE AIVTDLPEDV KWIDITPDMM VQERPLDVDC 
KKVKPTLATY LSKNYSYVIB AKIKAVQRSG CNEVTTWDV KEIFKSSSPI 
SSCQCPHILP HODVLIMCYE WRSRMMLLEN CLVEKWRDQL SKRSIQWEER 
KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 



PCT/US02/12476 



TTTCTTAAGG 
TGOCCTTTGG 
GAGTAACCTG 
TAATATGTGC 
TATGTTTTAT 
GAAAATATAA 
GTTTATGGTC 
TAGCATATGG 
AGAAACTTAA 
GTCTGGATTC 
CCTCTTAAGC 
TAGTTGGCTA 
CCCAGGACAT 
CAAATTTTGT 
AATATAAATG 
TTGGGATACT 
AATTGTGGAC 
CAGTAAGCAT 
CCTATCTAAT 
TTCTCCAGGT 
GTATACATGT 
TGTCAAGAAA 
TACTCAACAA 
TAAACATCTC 



CTATGCTTCA 
TACAGAAGGT 
TGTGCATACT 
ATTGTAAAAT 
TACCTTTTGA 
TGTTTTTAAG 
TGCAGAAGGA 
AAAATTATAA 
AAACAAAAAT 
CTGTTTTTTG 
AGCACCAGAA 
ATGCTCAAGT 
CCACCCTGAG 
TTTTCTTCAT 
TACAGAGAGG 
TTAATCAGAA 
AATTGGAGGC 
GTATTTTATA 
CCTACTCTCC 
GTTTGCTTAT 
GTTTCATAAC 
GCAGAAACCA 
ACTGTTGTGC 
ACOGGAATTC 



51 

I 

QENAILAIBQ 
LMKMYNHSWP 
KRLSPDRCKC 
PRTQVPLITN 
LQEQRRTVQD 



Seq ID NOx 424 DNA sequence 
Nucleic Acid Accession #: BC010423 
Coding sequence: 2 4 8.. 17 80 



CACAGCGTGG 
AGCTACGGCT 
CAAGTGCGAG 
TCTGCAGCCG 
TTCAACCATG 
GCTGCTACTG 
CGTGGTAACT 
CGGCGAGCAA 
ACTAGCGCTA 
GGAGCAGCCG 
GCAGGCGGAT 
GGCGCGGCTG 
ACTAGAAGAG 
CCCCAGCGTG 
CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTX3GGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
GAGAGCCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
. ACACCCCCAT 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGCGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 



11 
I 

GAAGCAGCTC 
GGGTGTGTAG 
AGGCAAGAAC 
GCTCCCAGGG 
CCCCTGTCCC 
CTGGCATCAT 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCCTG 
GGCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TACATCAATG 
CTCCTTCTOT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
TGTGTCATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 
TTTTTCTTTT 
ATTTTTTTTT 



21 
I 

TGGGGGAGCT 
AACGGGGCCG 
TCTGCAGCTT 
AGATCTCGGT 
TGGGAGCCGA 
TTACAGGCCG 
GCCAGGACGC 
TGGCATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC 
TGACCCTGGC 
CGGAGGTCAA 
CCTCAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT 
TGGATGGGCC 
TGACCACTGA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGTCCG 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTTCAG 
GATGTGGGGG 
GTGCCCCGCG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCTGTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTOGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 



41 

I 

GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



51 

I 

TCTTGGGGGT 

AGTGGAGACC 

CTTATTCAAG 

CAGTCTGCCT 

GGCTGCTGCT 

AGACCTCAGA 

GAGGGGACTC 

GCGCCCAGGA 

AGGGCCGCGT 

GCAACGCAGT 

GCAGCTTCCA 

CTGGTCCAGC 

GCAGCCCAGC 

CCTTCAAGCA 

GCATGAATGG 

GGATCACCCA 

ACCAAAATCT 

AGCCCCCTCC 

TGGATGGGGA 

GCCATGTCAG 

CCCAGGAAGA 

TGATCGCCGC 

ATCGGCGCAA 

AGAACTCCAT 

GTGTAGGGCT 

TGATGAGTGA 

AAACACAGAC 

ATGAAGG CAT 

AGCCCACGGG . 

CCTCCCTTCC 

GCCTCCTTAA 

CTTTACCTCC 

GCATGCAGGT 

TGTGGAGGGG 

TATCAGAGTC 

CACTGTCAGG 

GGTATTTTCT 

GACTGTGGCT 

AGGGAACCTG 

GGGTCAGCCA 

TGCTGCATGT 

GCTCCGAATC 

TTTTATTTAT 

CAGACCCTGT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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1260 
1320 
1380 
1440 
1500 
1560 
1620* 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MPLSLGAEMW 
QVGQVAWARV 
DEGBYECRVS 
VTWDTEVKGT 
HVSPLAEASV 
GPPPLTTEHS 
PCLLVWWL 
EGHPDSLKDN 
AMNHFVQENG 



11 
I 

GPEAWLLLLL 
DAGEGAQELA 
TFPAGSFQAR 
TSSRSFKHSR 
RGIiEDQNLWH 
GIYVCHVSNE 
MSRYHRRKAQ 
SSCSVMSEEP 
TLRAKPTGNG 



21 
I 

LLASFTGRCP 
LLHSKYGLHV 
LRLRVLVPPL 
SAAVTSEFHL 
IGREGAMLKC 
PSSRDSQVTV 
QMTQKYEEBL 
EGRSYSTLTT 
IYINGRGHLV 



31 
I 

AGELETSDW 
SPAYEGRVEQ 
PSLNPGPALE 
VPSRSMNGQP 
LSEGQPPPSY 
DVLDPQEDSG 
TLTRENSIRR 
VREIETGTEL 



41 
I 

TWLGQDAKL 
PPPPRNPLDG 
5GQGLTLAAS 
LTCWSHPGL 
NWTRLDGPLP 
KQVDLVSASV 
LHSHHTDPRS 
IiSPGSGRAEE 



Seq ID NO: 426 DNA sequence 

Nucleic Acid Accession 8: NM_003474.2 

Coding sequence i 3 7.. 3 03 6 



CACTAACGCT 
TCAAGGCTGG 
CTTTTTTAAA 
CCGGAGCTGA 
GCCCGCGTGG 
GCGACGATGG 
GCCGGTGCTC 
GCTGATGAAG 
TTCGACTCCA 
CTGATCATAA 
TATCTGCAAG 
TGTTACTACC 
TCTGGTCTCA 
AGTGCAACCA 
TGTGGATCAC 
CAGACATGGG 
GTGATCGTGG 
CAGCGATTAA 
ATCGTGTTGG 
CCATTCACCA 
TCCCATGACA 
GCCCCAATCA 
GACAATCCCC 
AATCATGACA 
ATCATGAACG 
GACTTGGAGA 
AGGGAGTCTT 
GkCTGTGGGG 
AAGCCGGACG 
GGAACAGCGT 
AGCCCTCACT 
GGCTACTGCT 
CCAGGTGCTA 
TATGGCAACT 
AAATGTGGAA 
GTTTCCATAG 
CAOGTGTACT 
GCAGATGGAA 
GAGTGTGCAA 
GAGGCCCACT 
GGCCCCATCC 
TGTCTTCTTG 
TTTACAAATA 
CGTGGCTTCC 
CCGCCAGATT 
GACATCAGCA 
CTTCCTCCCC 
AAGCCTGCAC 
CCTGCAGATC 
CAATGGGAGA 
GTGCCCAGAT 
TGAAGACAGA 
GGATTTTTTT 
GCTGTGCTGT 
GCAGAATGTT 
CCATGGCAGG 
ATGGGATTCT 
CCAACTACCC 
CTCAGTTGAT 
TGTGTTTGGC 
ACACCTGGGA 
AGGAATCTTA 
TGACCCTGAG 
GGTGCTTGAT 



11 

I 

CTTCCTAGTC 
CTTGTGCCAG 
AATGAAAGGC 
CTCGCCGAGG 
GATGGTGCAG 
CAGCGCGCCC 
TGCTCGCGCC 
TTGTCAGTGC 
AGAATCATCC 
ATCTGGAAAG 
ACGGTACTGA 
ATGGACATGT 
GGGGACTTAT 
ACAGATACAA 
ATCACAACAC 
CAAGAAGGCA 
CAGACAACCG 
TAGAGATTGC 
TAGGOGTGGA 
GCCTCCATGA 
ATGCGCAGCT 
TGAGCATGTG 
TTGGTGCAGC 
CACTGGACAG 
CTTCCACCGG 
CCAGCCTGGA 
TGGGGGGCCA 
AGCCAGAGGA 
CTGTGTGCGC 
GCAGGGACTC 
GCCCAGCCAA 
ACAATGGCAT 
AACCTGCCCC 
GTGGCAAAGT 
AAATCCAGTG 
AAACAAACAT 
TGGGCGATGA 
AAATCTGCCT 
TGCAGTGCCA 
GGGCACCTCC 
GGCAAGCAGA 
CTGCOGGATT 
AGAAGACCAC 
AACCCTGTCA 
CCTACCCACC 
GACCCCTCAA 
TCCACCGGGC 
TTAGGCAGGC 
CTCTGGCCAG 
CTGGGCTCCG 
C CACCC ACAC 
AGTTTGCACT 
TAATGTTTAA 
GCTATGGTGC 
GATTACAGTG 
AAGGCTTGTT 
GGACAGGATG 
CCAGCTGTGC 
TTTCTGGATT 
TTTCAGGGAG 
GAAATCTGGC 
AGGTGTAGCC 
CTGACCAGCC 
AGAAATGCCA 



21 
I 

CCCGGGCCAA 
AAOGGCGCGC 
TAGAAGAGCT 
CAGGAAATCC 
CGCTCGCCGC 
GCTGCCCGTG 
CTGCGAGGCC 
CTCTGTTCGG 
AGAAGTGCTG 
AAATGAAGGT 
TGTCTCCCTC 
ACGGGGATAT 
TGTGTTTGAA 
ACTCTTCCCA 
ACCAAACCTC 
TAAAAGAGAG 
AGAGTTTGAG 
TAATCAGGTT 
AGTGTGGAAT 
ATTTCTGGAC 
TGTCAGTGGG 
CACGGCAGAC 
CGTGACCCTG 
GGGCTGTAGC 
GTACCCATTT 
GAAAGGAATG 
GAAGTGTGGG 
ATGTATGAAT 
ACATGGGCTG 
CAGCAACTCC 
CGTGTACCTG 
CTGCCAGACT 
TGGGATCTGC 
CTCGAAGAGT 
TCAAGGAGGT 
CCCCCTGCAG 
CATGCCGGAC 
GAATCGTCAA 
CGGCAGAGGG 
CTTCTGTGAC 
TAACCAAGGT 
TGTGGTTTAT 
CATTGAAAAA 
GGCTCACCTC 
GAAGGACAAT 
CGGCCTGAAT 
CCCACGTGCA 
CCAGGGGACC 
AACAACTCGG 
CCTGGCACCC 
CGCCTATATT 
ATCTTTCAGC 
AACATCATTA 
TCTGTCTACT 
CAGTGCGCTG 
GTGCTTTTAG 
TGTTTGCTTT 
TTATGGTACC 
CCCCATCTCA 
GCCCTGTGCC 
TTCTGGCCAG 
ACACCAGGAT 
GTGAGCATGT 
AGCACTTCTT 



31 
I 

CTCGGACAGT 
GCGCGACGCA 
CAGCGGCGGC 
CTCCGGTOGC 
CGGGCCCGAG 
TCCCCOGCCC 
CGAGGGGTGA 
AGTGGGGACC 
AATATTCGAC 
CTCATTGCCA 
GCTCGAAATT 
TCTGATTCAG 
AATGAAAGCT 
GCGAAGAAGC 
GCTGCAAAGA 
ACCCTCAAGG 
AGGCAAGGAA 
GACAAGTTTT 
GACATGGACA 
TGGAGGAAGA 
GTTTATTTCC 
CAGTCTGGGG 
GCACATGAGC 
TGTCAAATGG 
CCCATGGTGT 
GGGGTGTGCC 
AACAGATTTG 
CGCTGCTGCA 
TGCTGTGAAG 
TGTGACCTCC 
CAOGATGGGC 
CACGAGCAGC 
TTTGAGAGAG 
TCCTTTGCCA 
GCCAGCCGGC 
CAAGGAGGCC 
CCAGGGCTTG 
TGTCAAAATA 
GTGTGCAACA 
AAGTTTGGCT 
TTAACCATAG 
CTCAAAAGGA 
CTAAGGTGTG 
GGCCACCTTG 
CCCAGGAGAT 
GTCCCTCAGC 
CCTAGCGTCC 
TGTAAGCCAA 
CTCACTCATG 
CTCAGACCTG 
AAGTGAGAAG 
TCCAGTTGGA 
CTATAAQAAC 
TGCACAGGTA 
TAGTAGGCAT 
TATTTTAGTG 
CTGATCAAGG 
AGATGCAGCT 
GGCCAGAGCC 
CCTTGACAAC 
GAAGCTTTGG 
AGAGACTGGA 
TTGGAAGGGG 
TTTCTCGCTG 



41 
I 

TTGCTCATTT 
CGCACACACA 
GCGGGCCGTG 
GACGCCCGGC 
AGCTGCTGCA 
GCGCCCTCCT 
GCTTATGGAA 
TCTGGATCCC 
TACAACGGGA 
GCAGTTTCAC 
ACACGGTAAT 
CAGTCAGTCT 
ATGTCTTAGA 
TGAAAAGCGT 
ATGTGTTTCC 
CAACTAAGTA 
AAGATCTGGA 
ACAGACCACT 
AATGCTCTGT 
TGAAGCTTCT 
AAGGGACCAC 
GAATTGTCAT 
TGGGCCACAA 
OGGTTGAGAA 
TCAGCAGTTG 
TGTTTAACCT 
TGGAAGAAGG 
ATGCCACCAC 
ACTGCCAGCT 
CAGAGTTCTG 
ACTCATGTCA 
AGTGTGTCAC 
TCAATTCTGC 
AATGCGAGAT 
CAGTCATTGG 
GGATTCTGTG 
TGCTTGCAGG 
TTAGTGTCTT 
ACAGGAAGAA 
TTGGAGGAAG 
GAATTCTGGT 
AGACCTTGAT 
TGCGCCCTTC 
GAAAAGGCCT 
TGCTGCAGTG 
CCCAGTCAAC 
CTGCCAGACC 
ACCCCCCTCA 
CCTTGGCCAG 
CTGCACAATA 
CCGACACCTT 
GTTTTTTGTA 
TTTGAGCTAC 
CTTGTAAATT 
TTTTACCATC 
AACTTGAAAT 
CCTTATTGGA 
CAAGAGATCC 
AAGGGGCTTC 
TGGCAGGCAG 
TGAGAACCTG 
ACACTAGACA 
TCTGTAGTGT 
TCCTTTCTAG 



51 
I 

PCPYRGDSGE 
SVLLRNAVQA 
CTAEGSPAPS 
LQDQRITHIL 
SGVRVDGDTL 
VWGVIAALL 
QPEESVGLRA 
EEDQDBGIKQ 



51 
I 

ATTGCAACGG 
CGGGGGGAAA 
OGCGAGGGCT 
CCCGCTCGGC 
CTGAAGGCCG 
GCTOGCCCTG 
CGAAGGAAGA 
AGTGAAGAGC 
AAGCAAAGAA 
GGAAACCCAC 
TCTGGGTCAC 
CAGCACGTGT 
ACCAATGAAA 
CCGGGGATCA 
ACCACCCTCT 
TGTGGAGCTG 
AAAAGTTAAG 
GAACATTCGG 
AAGTCAGGAC 
ACCTCGCAAA 
CATCGGCATG 
GGACCATTCA 
TTTCGGGATG 
AGGAGGCTGC 
CAGCAGGAAG 
GCOGGAAGTC 
AGAGGAGTGT 
CTGTACCCTG 
GAAGCCTGCA 
CACAGGGGCC 
GGATGTGGAC 
ACTCTGGGGA 
AGGTGATCCT 
GAGAGATGCT 
TACCAATGCC 
CCGGGGGACC 
CACAAAGTGT 
TGGGGTTCAC 
CTGCCACTGC 
CACAGACAGC 
GACCATCCTG 
ACGACTGCTG 
CCGGCCACCC 
GATGAGGAAG 
TCAGAATGTT 
TCAGCGAGTG 
CCTGCCAGCC 
GAAGCCTCTG 
GACCCCAGGA 
TCCACACCAA 
TTTTCAACAG 
CCAACTTTTA 
TGCCGTCAGT 
ATTAATTTAT 
ACTGAGTTTT 
ATCCTOCTTG 
AAGCAGTCCC 
CAAGTAGAAT 
AGGTCCAGGC 
GCTCCCAGGG 
GGTTGCAGAC 
AGCCAGAACT 
CACTCAAGGC 
AGCACTGCCA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
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600 
660 
720 
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960 
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3120 
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3240 
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CCAGTAGGTT 
CTGCAAACOG 
CAATGATCCT 
TGAACCATTA 
AATTAGGCAG 
TATAGTTCAT 
CATCCTCTTT 
ACCTATTTCT 
CAACTTGCTT 
CTCTTCACTC 
AATGGCATGA 
CTGGACTGGT 
TTTATGAGAA 
TGTACCAAGA 
TCCCCACTGT 
AAACACACAC 
TTATTCTATA 
AGATACATAC 
TATATACTAT 
AGATGCCCAA 
ACCAAAAAAA 



ATTTAGCTTG 
CCACCTCCCT 
GTATTCAGAC 
ACCAGATCTA 
ACTCTTTATG 
GTCTGCTATC 
TTCCAACTTG 
TAAACACTTG 
ATCAACTTCC 
TTCAAATGCC 
GAAATACAAA 
TTTCACATTA 
AGCCTTCTTT 
ATCTTGGTTT 
ATCTAGGCAA 
AAAAGGGAAC 
GTTATTAAGT 
AGAATTACTG 
TAAAAAGGTT 
ATCCTTAGAT 
AAAAAAAAAA 



GGAAAGGTGG 
ATACTGCTTG 
AGATGAGGAC 
GTCAATCAAG 
CTTGCAAAAA 
ATTATTCGTA 
GCTGCAGGAA 
CAACCTACCT 
TAAATATTAT 
TGACTAGGGA 
AATACTCAGA 
GAAGACAATT 
TGGGGTCAAC 
GCCTTCCAGA 
CATAGTATTC 
CCAGCTCTAA 
TCTTTAAAAT 
TAACTGATTA 
TACAGAATTT 
CTGGCATGTT 
AA 



Seg ID NO: 427 Protein sequence 
Protein Accession #: NP 003465 



1 

I . 

MAARPLPVSP 
' SKNHPEVLNI 
YHGHVRGYSD 
SHHNTPNLAA 
LIEIANHVDK 
DNAQLVSGVY 
DTLDRGCSCQ 
SFGGQKCGNR 
ACRDSSNSCD 
AKPAPGICFE 
IETOIPLQQO 
AMQCHGRGVC 
LAAGFWYLK 
DSYPPKDNPR 
ALRQAQGTCK 
RSTHTAYIK 



11 
I 

ARALLLALAG 
RLQRESKELI 
SAVSLSTCSG 
KNVFPPPSQT 
FYRPLNIRIV 
FQGTTIGMAP 
MAVEKGGCIM 
FVEEGEECDC 
LPEFCTGASP 
RVNSAGDPYG 
GRILCRGTHV 
NNRKNCHCEA 
RKTLIRLLFT 
RLLQCQNVDI 
PNPPQKPLPA 



21 
I 

ALLAPCEARG 
INLERNEGLI 
LRGLIVFENE 
WARRHKRETL 
LVGVEVWNDM 
IMSMCTADQS 
NASTGYPFPM 
GEPEECMNRC 
HCPANVYLHD 
NCGKVSKSSF 
YLGDDMPDPG 
HWAPPFCDKF 
NKKTTZEKLR 
SRPLNGLNVF 
DPLARTTRLT 



TGTTTCTGTA 
GAGCTGAGCA 
TTTCCATGGG 
TCTGTTTACT 
CTACAACCAA 
GATATTGGAC 
TCTTTAAAAG 
GTTGAGCATC 
GAGATGTGGC 
GCCATGTTTC 
TAAGGTAAAA 
GACAACAGTT 
AGTTTTCCTA 
AAACAAAACT 
ATGACTATGG 
TACATTCCAA 
GTAAAGCCAT 
CACTTGGTAA 
TATGGTGCAT 
AGCCCTTCCT 



31 
J 

VSLWNEGRAD 
ASSFTETHYL 
SYVLEPMKSA 
KATKYVELVI 
DKCSVSQDPF 
GGIVMDHSDN 
VFSSCSRKDL 
CNATTCTLKP 
GHSCQDVDGY 
AKCEMRDAKC 
LVLAGTKCAD 



CVRPSRPPRG 
QPQSTQRVLP 
HALARTPGQW 



AGAAACCTAC 
AATCACCACA 
ACCACAACTA 
GCAAGGTTCA 
TGGAATGTGA 
AAAGAACCTT 
ATGCTTTTAA 
ACAGAATGTG 
TTGGGCAGCA 
ACAAGGTCTT 
TGCCATGATG 
ACATAATTCA 
TGCTTTGAAA 
GCATTTCACT 
ATAAACTAAA 
CTOGTATAGC 
GCTGGAAAAT 
TTGTACTAAA 
TACGTGGGCA 
CCAATTATAA 



41 

I 

EWSASVRSG 
QDGTDVSLAR 
TNRYKLFPAK 
VADNREFQRQ 
TSLHEFLDWR 
PLGAAVTLAH 
BTSLEKGXGV 
DAVCAHGLCC 
CYNGICQTHE 
GKIQCQGGAS 
GKICLNRQCQ 
IRQADNQGLT 
FQPCQAHLGH 
PLK RAP RAPS 
ETGLRLAPLR 



TGCCCAGGCA 
AACTGTAATA 
TTTTCAGATG 
ACTTATTAAC 
TGTTCATGGG 
CTCTATGGGG 
CAGAGTCTGA 
ATAAGGAAAT 
TCCCCTTGAA 
TAAAGTGACT 
CCTCTGTCTT 
CTCTGAGTGT 
CAGAAAAATA 
TTCCCGGTGT 
CACGTGACAC 
ATGCATCTGT 
AATACTGCTG 
GCCAAACATA 
TTGTCTTTTT 
GAGGATATGA 



51 
I 

DLWIPVKSFD 
NYTVILGHCY 
KLKSVRGSCG 
GKDLEKVKQR 
KMKLLPRKSH 



CLFNIiPEVRB 
EDOQLKPAGT 
QQCVTIiWGPG 
RPVIGTNAVS 
NISVFGVHEC 
IGILVTILCL 
LGKGLMRKPP 
VPARPLPAXP 
PAPQYPHQVP 



Seq ZD NO j 428 DMA sequence 

Nucleic Acid Accession th NM_003714 

Coding sequence: 135.. 1043 



GAGGAGGAGG 
GAGGAAGAGG 
TAATACCAAG 
TGGCCACCTT 
ACAGGAGCTC 
GTTTGGTCAA 
GTGAGATTCG 
ATGCCCAGGG 
ACAGGTTCGG 
AGCGGGAATG 
TAGTGGAGAT 
ACTTGCTGCT 
AGTGTGAGCA 
AGAAGCCTCC 
GGGCCCACCA 
GAGGTGCCAA 
TCGGGGGCCT 
AGTATTCTGA 
GTCCATTTTC 
ACGCAGGATT 
TGAGATGGAG 
ACGTACTCAA 
AGTCAGTGGG 
GCAGGGCCCC 
GCAGCAGCCT 
QAATGTAAAA 
AGGGGGTGCT 
CTCTTGGCGA 
TGTCTGGGCT 
ACTGCTTCAA 
TCTAAATAAA 
TAAAAAAAAA 
TTAAAAGCTA 
■ CACTTGGGGG 
TTTCCCTTAG 
CTGGCGATTC 
GAAAGAAGAG 



11 
I 

GAAAAGGCGA 
AGGAGGAGGA 
AACCATGTGT 
TGACCOGGOG 
CCAGCAGAAA 
CGCTGGCGAT 
GGGCTTACAT 
CAAGTCATTC 
CTGCATAAGC 
CTACCTCAAG 
GATCCATTTC 
GACCTGTGGG 
GAACTGGGGA 
CACGGCGCCC 
CGGGGAAGCA 
GGGTGAGCGA 
TGGGGCTCAG 
TATCCGGAGG 
TTATCTATGG 
CTGTGGGGAC 
ACCCCTGGGG 
GGGAGCGCGC 
TGTCGGCCGC 
CAGAGCTQGG 
CTGGTGCTGT 
ATAAATATCG 
TGGTGCCAAA 
GGGTGGAGGG 
GGGGGGGACA 
ATCTCGATTT 
TGGCTTTCAA 
AAAACCAGCC 
TCAAACAGCG 
AAACCTTATA 
GATTTCGTTA 
CAGGAGACCC 
AATGAAGACT 



21 
I 

GCAAAAAGGA 
AGAGGGGAGC 
GCCGAGCGGC 



GGCCGCCTGT 
GTGGGGTGTG 
GGGATTTGCA 
ATCAAAGACG 
CGGAAGTGCC 
CACGACCTGT 
AAGGACTTGC 
GAGGAGGTGA 
AGCCTGTGCT 
CCCGAGCGCC 
GGACATCACC 
GGTAGCAAGA 
GGACCTTCCG 
TGAAATGAAA 
ACATTCCAAA 
TGTGGACTTC 
CCGTGGGGTC 
CCGCGTTATC 
TCTGTTGTGG 
CCACACAGT3 
CTCCGCGGAA 
CTTAGAATGC 
CTGAAATTCA 
AGGAGTGTCA 
CTGTCCAAGG 
CACTTTTTTT 
ACAAAGCAAC 
CATCCTTTGA 
ACATAGCCAT 
CCCAGAGGAA 
TCTCACCTTG 
AGCTGGAAAC 
ACTTAGTAAT 



31 
I 

AGAGTGGGAG 
ACAAAGGATC 
TGGGCCAGTT 
ACGCCACCAA 
CCCTGCAGAA 
GCGTGTTTGA 
TGACTTTTCT 
CCTTGAAATG 
CGGCCATCAG 
GCGCGGCTGC 
TGCTGCACGA 
AGGAGGCCAT 
CCATCTTGAG 
AGCCCCAGGT 
TCCCAGAGCC 
GCCACCCAAA 
GAAGCAGCGA 
GGCCTGGCCA 
ACATTTACCA 
ATCGAGGTGT 
TCAGGGGTGC 
CTCGTACCTT 
GGGAOGTGAA 
GGTGCTGGGC 
GTGAGGGCGG 
AGGAGAAGGG 



TTTCTATGTG 
GAGTGGCCCC 
ATTTATCCAG 
TGGGTCATTA 
GGCTGATTTT 
ACATCTGACT 
AATACACACC 
ACCCTCAGCC 
CTGGCTTCTC 
TCCCATCAGG 



41 

I 

GAGGAGGGGA 
CAGGTCTCCC 
CATGACCCTG 
CCCACCCGAG 
TACAGCGGAG 
ATGTTTCGAG 
GCACAACGCT 
TAAGGCCCAC 
GGAAATGGTG 
CCAGGAGAAC 
ACCCTACGTG 
CACCCACAGC 
CTTCTGCACC 
GGACAGAACC 
CAGCAGTAGG 
CGCCCATGCC 
GTGGGAAGAC 
CGAAATCTTT 
TTAGAGAGGG 
GTGTTCGCGG 
CTGGTGAATT 
TGTCTTCTTT 
CCAGGGAGGG 
CTCGCCCCGA 
CTGGATTCCA 
TGGAGAGGAG 
TGGGGCCTTG 
TAATTTCTGA 
TATGAGTTTA 
TTATATCTAC 
AAACCAGCTC 
TCTTTTTTTT 
GCCTGACATG 
TGGGGAGTAC 
AAGATTGGTA 
CATGTGAGGG 
AAATGCTGAC 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 



51 
I 

AGCGGCGAAG 
GACGGGAGGT 
GCTTTGGTGT 
GGTCCCCAAG 
ATCCAGCACT 
AACAACTCTT 
GGAAAATTTG 
GCTCTGCGGC 
TCCCAGTTGC 
ACCCGGGTGA 
GACCTCGTGA 
GTGCAGGTTC 
TCGGCCATCC 
AAGCTCTCCA 
GAGACTGGCC 
CGAGGCAGAG 
GAACAGTCTG 
CCTCCACGCC 
GGGATGTCAC 
AACGGACAGG 
CTGCACTTAC 
CCATCTGTGG 
GCAGGGCAAG 
AGCTTCTGGT 
GGACAGGAGT 
GCAGGGGCCG 
CGGTTCAGAG 
GCCATTGTAC 
TATTTTAACC 
ATATCTGTCA 
AAAGGGGGTT 
AAGTTCTATT 
GACTCCTGCC 
ATTTGACAAA 
AAGCTGCGTC 
GATGGGAAAG 
CTTTTACATA 



60 
120 
180 . 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
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AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGQAAA 
TTTAGCAATA AGGAGA6GAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTCGTGCC 

Seq ID NOt 429 Protein sequence 
Protein Accession #: NP_003705 



PCTYUS02/12476 



2280 
2340 



1 11 

I I 
MCAERIiGQFM TLALVLATFD 
GDVGCGVFBC FENNSCEIRG 
ISRKCPAIRB MVSQIiQRECY 
CGEEVKEAIT HSVQVQCEQN 
EAGHHLPEPS SRETGRGAKG 
RR 



21 31 

I I 
PARGTDATNP PEGPQDRSSQ 
LHGICMTPLH NAGKFDAQGK 
LKHDLCAAAQ ENTRVIVEMI 
WGSLCSILSF CTSAIQKPPT 
ERGSKSHPNA HARGRVGGLG 



41 51 

I I 
QKGRLSLQNT AEIQHCLVNA 
SFIKDALKCK AHALRHRPGC 
HFKDLLLHEP YVDLVNLLLT 
APPERQPQVD RTKLSRAHHG 
AQGPSGSSEW EDEQSEYSDI 



Seq ID NO: 430 DNA sequence 
Nucleic Acid Accession NM_005940 
Coding sequence! 23.. 1489 



AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTOGG 
CCTAAAGGTA 
TGACATCATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCXX5CCTTC 
TCAACACCTA 
CCAGGCTGG6 
CTGTGAGGCC 
GGGCTTTGTG 
TCGCCACTGG 
CATTTGGTTC 
CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



11 
I 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCGGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCQACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGACG 
TGGCGCCTCC 
CAGGGACTGC 
TTCCAAGGTG 
CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 




CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 
CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 
TGGGCCTGGT 
TCTACTTCTT 
GTCCOGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



31 

I 

GGCCGCCTGG 
GCTCCAGCOG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG 
CATCCGAGGC 
GCTGCAGCCC 
GGACGCTGCC 
GGTGTACGAC 
GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



41 

I 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA 
AAGACGGACC 
CGGCAGACGA 
GAGGTGCACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAGCCA 
CCAGATGACT 
AGGACCCCAG 
CCAGACGCCC 
GAGCTCTTTT 
GGCTACCCAG 
TTCGAGGATG 
GGTGAAAAGC 
GTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGTCGTG 
TTAAGAGGAA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



51 
I 

CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
TCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
TCTTCAAAGC 
CATTGGCCTC 
CCCAGGGCCA 
CAGTCCTGGG 
CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 

CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 
TTCTTTTTTT 



Seq ID NO: 431 Protein sequence 
Protein Accession ft: NP 005931 



1 
I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FPPKTHRBGD 
YPLSLSPDDC 
VSTIRGELFP 
QYWVYDGEXP 
PVPRRATDWR 
AEPANTFL 



11 
I 

AARAIiLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEfc 
GVPSEIDAAF 



21 
I 

LLLLQPPPU, 
GVPDPSDGLS 
TPLTPTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRPPVHAA 
QDADGYAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 
I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 
I 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PDACEASFDA 
OGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



Seq ID NO: 432 DNA sequence 
Nucleic Acid Accession #: NM_024022 
Coding sequence: 202.. 1563 



51 



1 11 21 31 41 

I I I I I 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 
GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 
CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 
AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 
TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1660 
1920 
1980 
2040 
2100 
2160 
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60 
120 
160 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
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GATGCTGTTQ CTGCACAGAT CCTGTCACTG CTGCCATTGA A GTTTTTT CC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACT6 GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAOGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTGGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATOGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCAOGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG* AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCOGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCAOGOGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCOGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 



Seq ID NO: 433 Protein sequence 
Protein Accession 8: NP_076927 

1 11 21 31 41 51 

I I I 1 1 I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 1 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVP 120 

TAASWKTMCS DDWKGHYANV ACAQLGFP5Y VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTT QVGLVSIiLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHBQMERD LKT 

Seq ID NO: 434 DNA sequence 

Nucleic Acid Accession #: NM_000493.2 

Coding sequence: 97.. 213 9 

1 11 21 31 41 51 

I III I i 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAGCTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 

CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAA OGATA CCAAATGCCC 180 

ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGTACTC CTGGTCCACC AGGCCCTGCT 300 

GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGTCCT 360 

GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 

GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 

GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA 540 

CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 600 

CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 

GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 720 

GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGGAAAAA GAGGTGAAAA TGGGGTTCCA 780 

GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 

CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 

GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC' TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

GGTCTTCCTG GGAAGCCAGO TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 1200 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 1260 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 

GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAATGGACCC 1560 

ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 



348 
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ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 1920 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 2100 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTATCAGACC 2280 

AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATATCAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 2460 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 

AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 2640 

. TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2700 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 2760 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2820 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2880 

GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3000 

CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 3060 

ATATTATTTA TTTATGCACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 

AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAAT AACATC AATAG 

Seq ID NOt 435 Protein sequence 
Protein Accession #: NPJ>00484.2 

1 11 21 31 41 51 

I 1 t I I I 

MLPQIPFLLL VSLNLVHGVF YAERYQMPTG IKGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGI* QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 

GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GPPGEKGAPG 180 

VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 

GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

PGLPGItKGER GPAGXjPGGPG AKGBQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGXPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 

PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT OQIPGIYYFS 600 

YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA 1IDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 436 DNA sequence 

Nucleic Acid Accession *h XM_062811 

Coding sequence: 1..888 

1 11 21 31 41 51 

I i I I I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 

CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 

GGCGACGCCA CCATCTGCTG OGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 

GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 420 

AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG CCCCAGGGGG TAACCGCTTG 480 

ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG 540 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 600 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACA ACGTGT A TGTCAACATG 660 

CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC ACGGTGCAGC ACGACTCTGT GCCCATGACA 780 

GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCAGATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 

Seq ID NO: 437 Protein sequence 
Protein Accession #: XP_0628ll 

1 11 21 31 41 51 

I I I I I I 

MWGARRSSVS SSWNAASLLQ LLLAALLAAG ARASGEYCHG WLDAQGVWRI GFQCPERFDG 60 

GDATICCGSC ALRYCCSSAB ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPPLI 120 

VGSVFVAFII LGSLVAACCC RCLRPXQLPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 180 

SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 240 
QYLHPPYVGY TVOHDSVPMT AVPPFMDGLQ PGYRQIQSPF PKTNSEQKMY PAVTV 

Seq ID NO: 438 DNA sequence 

Nucleic Acid Accession 8: NM_004004.1 

Coding sequence: 1..681 

1 11 21 31 41 51 

I I I I I I 

ATGGATTGGG GCACGCTGCA GACGATCCTG GGGGGTGTGA ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 
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WO 02/086443 

AAGAACGTGT 6CTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG GGCCCTGCAG 
CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA COGGAGACAT 
GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA CATCGAGGAG 
ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC AAGCAGCATC 
TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT GTACGACGGC 
TTCTCCATGC AGCX3GCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC TGTGGACTGC 
TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC AGTGTCTGGA 
ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA TTGTTCTGGG 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO: 439 Protein sequence 
Protein Accession #: NP_003995.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLQTIL GGVNKHSTSI GKIWLTVLFI FRIMILWAA KEVKGDEQAD FVCNTLQPGC 
KNVCYDHYFP ISHIRLWALQ LIFVSSPALL VAMHVAYRRH EKKRKFIKGE IKSEFKDIEE 
IKTQKVRIEG SLWWTYTSSI FFRVIPEAAF MYVFYVMYDG PSMQRLVKCN AWPCPNTVDC 
PVSRPTEKTV FTVFMIAVSG ICILLNVTEL CYLLIRYCSG KSKKPV 



PCT/US02/12476 



Seq ID NO: 440 DNA sequence 

Nucleic Acid Accession 8: XM_061091.1 

Coding sequence: 1..2481 



1 

I 

ATGCCAAATA 
CACCGAGCGC 
CCCGGGTACC 
CAGATGCATG 
GTTGGGAAGC 
ATTTCAGCTG 
GATGGGTCTA 
GTCTGTGACG 
TCCACTCCTC 
AGAATCAAGA 
CTTCTGCACA 
GTCACTGATG 
GGTGTCACTG 
GCCAGCGAGC 
GGCCTCTTCA 
CCCGAGCTTG 
CAGCCCTGCC 



GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGGCAGTG 
CAGGACCGGC 
GCGGGCCCAG 
GCCGTGCGC3G 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 




CTAGAGGGCA 
GCACCCTCAG 
TCTTCATGGA 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AGAGGGGTGC 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 



21 
I 

AACCAGGATT 
TCTCCTTCCG 
GGCTGCCOAT 
AGATCTCTGG 
GGCAAAOGAA 
GATGTGGTGC 
GAAAGGGAGC 
CAGCCCCGAG 
CCCCTTGGAT 
CAAAGGAGGG 
TGGAGGCAGA 
GGGGGATGTG 
GGGGGTCAGG 
GCACGTGCTG 
CAGCTCGGCC 
GCGGTTAATG 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCTCGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
G 



31 
I 

GAAATTTGGC 
GTGAGTCCCA 
GACCGATTCA 
AGCCTTGGTG 
GTCCATGTAA 
TCGGCTGCAG 
TTTGAAAGGT 
AGGGTCAGAG 
TCATTTTCAA 
CGCACGGAGA 
AATGCTTCTG 
GCACTGCCAT 
TTTCCCAGGT 
TTGGCTGAGC 
ATCTGCTCCA 
GGCATCTCTC 
CCAGAAGGAC 
TGTGCCCTGA 
GCGGGCACCA 
GCCGTGCTGA 
CTGGTGGCGG 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTCGTCTTCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTGCOATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 



41 
I 

TTCTCCAAGA 
GCCCCGAGTT 
CGCTCCCGAT 
TTCTTTGCTA 
GCAAAGAAAC 
TGGACATCAT 
CCAAGCACTT 
TGGGAGCATT 
CCCAACAGGA 
CGGAACTTGC 
TGCCCCAGAT 
CCAAGCAGCT 
GGGAGGAGCT 
AGGTGGAGGA 
GCGCCACGCC 
TGATAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TGCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 



51 
I 

GCCGCCCGGG 
GGCTCTGGCG 
GATTGGAGGT 
TGAATTTTTA 
CATCGGGAAG 
GTTTCTGTTA 
TGCCATCACA 
CCAGTTCAGT 
AGTGAAGGCA 
TCTGAAATAC 
CCTCATCATC 
GAAGGAAAGG 
GCATGCACTG 
TGCCACCAAC 
AGCTGGGAGC 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 

TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAGCCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 



Seq ID NO: 441 Protein sequence 
Protein Accession ft: XP_061091.1 



MPNTSGTTRI 
OMHGEKVDLW 
DGSNSVGKGS 
RIKRMVFKGG 
GVTVFAVGVR 
PELVFMERLM 
VDLLFLLDSS 
DVPDLVWSLD 
AGPARHARAR 
RPGCRTQALD 
APGLDTKPTR 
GRGAEDAAVP 



11 

I 

BIWLLQEPPG 
SLGVLCYEFL 
FERSKHFAIT 
RTETELALKY 
FPRWEELHAL 
GISLIGPCDS 
AGTTLDGFLR 
GIPFRGGPTL 
ELLLLGVGSE 
LVFMLDTSAS 
AAMLRAISQA 
AQKLRNNGIS 



21 
I 

HRALVAALLP 
VGKPPFEANE 
VCDGLDISPE 
LLHRGLPGGR 
ASEPRGQHVL 
QPCQNGGTCV 
AKVFVKRFVR 
TGSALRQAAE 
AVRAELEEIT 
VGPENFAQKQ 
PYLGGVGSAG 
VLWGVGPVL 



31 
I 

VSPSPELAIA 
VHVSKETIGK 
RVRVGAFQFS 
NASVPQILII 
LAEQVBDATN 
PBGLDGYQCL 
AVLSEDSRAR 
RGFGSATRTG 
GSPKHVMVYS 
SFVRSCALQF 
TALLHIYDKV 
SEGLRRIAGP 



41 

I 

PGYPPVPAAD 
ISAASKKMWC 
STPHLEFPLD 
VTDGKSQGDV 
GLFSTLSSSA 
CPLAFGGEAN 
VGVATYSREL 
QDRPRRVWL 
DPQDLFNQIP 
EVNPDVTQVG 
MTVQRGARPG 
RDSLIHVAAY 



240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



51 

1 

DRFTLPMIGG 
SAAVDIMFLL 
SFSTQQEVKA 
ALPSKQLKER 
ICSSATPAGS 
CALKLSLECR 
LVAVPVGEYQ 
LTESHSEDEV 
ELOGKLCSRQ 
LWYGSQVQT 
VPKAWVLTG 
ADLRYKQDVL 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 



350 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
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65 
70 
75 
80 
85 



WO 02/086443 

IEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 
GWILETPLRH MAPVQEGSSR TPPSWYREGL GTEMVPTFWN VCAPGP 

Seq ID NO i 442 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence; 1..2424 



PCT/US02/12476 



780 



ATGCCCCCTT 
TCTCTCCCTC 
AGCAAAATOA 
AGCGTCGGGA 
CTGGACATCA 
CTGGAATTCC 
ATGGTTTTCA 
GGGTTGCCTQ 
AAGTCCCAGG 
TTTGCTGTGG 
A6AG00CA6C 
ACCCTCAGCA 
CCCTGTGAGC 
AGAGGATGGC 
AGAGTGTTCC 
TCGCAGCCCT 
CTCTGCCCGC 
AGGGTCGACC 
COGGCCAAAO 
C6A0TGGGTG 
CAGGATGTGC 
CTGACGGGCA 
GGCCAOGACC 
GTTGOGGGCC 
GAGGCCGTGC 
TCGGATCCTC 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTGCCTTCG 
GCCCCCTACC 
GTGATGACCG 
GGCGGGAGAG 
TCTGTCTTGG 
CCCOGGGATT 
CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



11 

1 

TCCTGTTGCT 
TCCAGGAAGT 
TGTGGTGCTC 
AAGGGAGCTT 
GCCCCGAGAG 
CCTTGGATTC 
AAGGAGGGCG 
GAGGCAGAAA 
GGGATGTGGC 
GGGTCAGGTT 
ACGTGCTGTT 
GCTCGGCCAT 
ACAGGACGCT 
GGCGGACCCT 
TAACCCACCC 
GCCAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
TCTTCGTGAA 
TGGCCACATA 
CTGACCTGGT 
GTGCCTTGOG 
GGCCACGTAG 
CAGOGCGTCA 
GGGCAGAGCT 
AGGATCTGTT 
GGTGCOGGAC 
COGAGAATTT 
ACCCTGACGT 
GGCTGGACAC 
TAGGTGGGGT 
TCCAGAGGGG 
GCGCAGAGGA 
TOGTGGGCGT 
CCCTGATCCA 
GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



21 
I 

GGAGGCCGTC 
CCATGTAAGC 
GGCTGCAGTG 
TGAAAGGTCC 
GGTCAGAGTG 
ATTTTCAACC 
CACGGAGACG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTGG 
GGCTGAGCAG 
CTGCTCCAGC 
GGAGATGGTC 
TGCGGTGCTG 
TGCCACCTGC 
AGGCACATGT 
AGGGGAGGCT 
GCTGGACAGC 
GOGGTTTGTG 
CAGCAGGGAG 
CTGGAGCCTC 
GCAGGCGGCA 
AGTGGTGGTT 
CGCAAGGGCG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTG 
TGCTCAGATG 
GACACAGGTC 
CAAACCCACC 
GGGCTCAGCC 
TGCCCGGCCT 
TGCAGCOGTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACCGT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



31 



TGTGTTTTCC 
AAAGAAACCA 
GACATCATGT 
AAGCACTTTG 
GGAGCATTCC 
CAACAGGAAG 
GAACTTGCTC 
CCCCAGATCC 
AAGCAGCTGA 
GAGGAGCTGC 
GTGGAGGATG 
GCCACGCCAG 
CGGGAGTTCG 
GCTGCACACT 
TACAGGACCA 
GTTCCAGAAG 
AACTGTGCCC 
TCTGCGGGCA 
CGGGCCGTGC 
CTGCTGGTGG 
GATGGCATTC 
GAGCGTGGCT 
TTGCTCACTG 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC 
GACCTCGTCT 
CAGAGCTTTG 
GGCCTGGTGG 
CGGGCTGCGA 
GGCACCGCCC 
GGTGTCCCCA 
CCTGCCCAGA 
CTAAGTGAGG 
TACGCCGACC 
CAGCCAGTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGGCACTG 



41 
I 

TGTTTTCCAG 
TCGGGAAGAT 
TTCTGTTAGA 
CCATCACAGT 
AGTTCAGTTC 
TGAAGGCAAG 
TGAAATACCT 
TCATCATCGT 
AGGAAAGGGG 
ATGCACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTGGCAATGC 
GTCCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAGCTGAG 
CCACTCTGGA 
TGAGCGAGGA 

CCTTCOGTGG 
TOGGGAGCGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGCATGT 
AGGGGAAGCT 
TCATGTTGGA 
TGAGAAGCTG 
TGTATGGCAG 




51 
I 

AGTGCCCCCA 
TTCAGCTGCC 
TGGGTCTAAC 
CTGTGACGGT 
CACTCCTCAT 
AATCAAGAGG 
TCTGCACAGA 
CACTGATGGG 
TGTCACTGTG 
CAGCGAGCCT 
CCTCTTCAGC 
CGAGGCTCAC 
CCCATGCTGG 
CAGCTGGAAG 
CCCCTGTGAC 
CTACCAGTGC 
CCTGGAATGC 
CGGCTTCCTG 
CTCTCGGGCC 
GGGGGAGTAC 
TGGCCCCACC 
CACCAGGACA 
CGAGGATGAG 
TGTAGGCAGT 
GATGGTCTAC 
GTGCAGCCGG 
CACCTCTGCC 
TGCCCTCCAG 
CCAGGTGCAG 
CATTAGCCAG 
CTATGACAAA 
GGTGCTCACA 
CAATGGCATC 
GCTTGCAGGT 
CCAGGACGTG 
ACCCAGCCCG 
GTGTCGGGAT 
ATGTGTGAGC 
GGGCAGCAGC 
TACCTTCTGG 



Seq ID NO: 443 Protein sequence 
Protein Accession #: Bos sequence 



MPPFLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
FAVGVRPPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 
RTPPSNYREG 



11 
I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REPAGNAPCW 
VPBGLDGVQC 
RAVLSEDSRA 
ERGPGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSEGLRRIiAG 
NGSYRCKCRD 
LGTEMVPTFW 



21 
I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGGHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLPKQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 
NVCAPGP 



31 



KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
EWSSCSVCVS 



Seq ID NO: 444 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence! 89.. 23 56 



1 
I 

GCCCCCTGGC 
GTCGCCGCTC 
TGTTTTCCTG 
AGAAACCATC 
CATCATGTTT 
GCACTTTGCC 
AGCATTCCAG 
ACAGGAAGTG 
ACTTGCTCTG 
CCAGATCCTC 



11 

I 

CCGAGCCGCG 
TCCTTCCGTT 
TTTTCCAGAG 
GGGAAGATTT 
CTGTTAGATG 
ATCACAGTCT 
TTCAGTTCCA 
AAGGCAAGAA 
AAATACCTTC 
ATCATCGTCA 



21 

I 

CCCGGGTCTG 
ATATCAACAT 
TGCCCCCATC 
CAGCTGCCAG 
GGTCTAACAG 
GTGACGGTCT 
CTOCTCATCT 
TCAAGAGGAT 
TGCACAGAGG 
CTGATGGGAA 



31 
I 

TGAGTAGAGC 
GCCCCCTTTC 
TCTCCCTCTC 
CAAAATGATG 
CGTCGGGAAA 
GGACATCAGC 
GGAATTCCCC 
GGTTTTCAAA 
GTTGCCTGGA 
GTCCCAGGGG 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSOGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 
QGWILETPLR 



41 
I 

CGCCCGGGCA 
CTGTTGCTGG 
CAGGAAGTCC 
TGGTGCTCGG 
GGGAGCTTTG 
CCCGAGAGGG 
TTGGATTCAT 
GGAGGGCGCA 
GGCAGAAATG 
GATGTGGCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 . 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 
HMAPVQEGSS 



51 
I 

CCGAGCGCTG 
AAGCCGTCTG 
ATGTAAGCAA 
CTGCAGTGGA 
AAAGGTCCAA 
TCAGAGTGGG 
TTTCAACCCA 
CGGAGACGGA 
CTTCTGTGCC 
TGCCATCCAA 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 



351 
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WO 02/086443 

GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTTTC 
GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG 
GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT 
CACGCCAGAC TGCAGGGTOG AGGCTCACCC CTGTGAGCAC AGGACGCTGG 
GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG CGGACOCTTG 
TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA ACCCACCCTG 
CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG 
TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG GCCTTTGGAG 
CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC 
TGCGGGCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC 
GGCCGTGCTG AGOGAGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA 
GCTGGTGGOG GTGCCTGTGG GGGAGTACCA GGATGTGCCT GACCTGGTCT 
TGGCATTCCC TTCCGTGGTG GCCCCACCCT GACGGGCAGT GCCTTGCGGC 
GCGTGGCTTC GGGAGCGCCA CCAGGACAGG CCAGGACCGG CCACGTAGAG 
GCTCACTGAG TCACACTCOG AGGATGAGGT TGCGGGCCCA GCGCGTCAOG 
AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG 
AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA 
TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC 
CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC GAGAATTTTG 
GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA 
CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA 
GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG 
CACCGCCCTG CTGCACATCT ATGACAAAGT GATGACCGTC CAGAGGGGTG 
TGTCCCCAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG 
TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGCGTGG 
AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG 
CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG 
GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGCG 
TGGGAGCTAC OGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG 
CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG 
AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA 
CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC CACTATTCTC 
GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA 
AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG 
GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG 
GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTCGCC 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGACGG 

Seq ID NO: 445 Protein sequence 
Protein Accession ft: Eos sequence 



PCT7US02/12476 



CCAGGTGGGA 
CTGAGCAGGT 
GCTCCAGCGC 
AGATGGTCCG 
CGGTGCTGGC 
CCACCTGCTA 
GCACATGTGT 
GGGAGGCTAA 
TGGACAGCTC 
GGTTTGTGCG 
GCAGGGAGCT 
GGAGCCTCGA 
AGGCGGCAGA 
TGGTGGTTTT 
CAAGGGCGCG 
AGGAGATCAC 
ACCAAATCCC 
AAGCCCTGGA 
CTCAGATGCA 
CACAGOTCGG 
AACCCACCCG 
GCTCAGCCGG 
CCCGGCCTGG 
CAGCCGTTCC 
GGCCTGTCCT 
TGGCAGCTTA 
AAGCGAAGCA 
"TCCTGCAGAA 
AGAACCGATT 
TACCCCTCCC 
TGTCTGTGCC 
ACTGAGGGAG 
TGTCACCCAC 
TACCTGCTGT 
GGGTCCTGAA 
AGAATGTTGT 



MPPFLLLEAV 
SVGKGSFERS 
MVPKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 



11 
I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGPGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSEGLRRIAG 
NGSYRCKCRD 



21 
I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 



31 

I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDB 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
FLRRP 



Seq ID NO: 446 DNA sequence 

Nucleic Acid Accession #i NMJJ31942. 

Coding sequence: 145.. 1260 ~ 



I 

CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTOGGG 
GCGATGAAGT 
CAGCCCTCAG 
AATTTTTTGG 
ATGTCTGAAT 
GACTCACAAT 
CCTGAACGGA 
GCTCTACCCA 
ACCGTGGATG 
GTGACCCTTC 
GTCTGCAGCA 
TGCCGTCAGA 
CGAGGCCAGT 
CTGCTGGATC 
CGGCAGCGAG 
TTTGGGAATG 
TATCTGGAAA 
TTTTTTCACT 
AATCAAGTTA 



11 
I 

GCCCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 
AGAAAAGGGC 
TAGAAAGCTT 
CAAGGAGACC 
GAGCTCGTCC 
TGGAGGAGGA 
GCTACATGAA 
CGCATATAAT 
ATTCTCGAGA 
AGACTATTGA 
TCTGTGGCCC 
CGAACTGGCA 
ATGGACGGTG 
TGCATGCCTA 
ATTTGCTGCC 
GAAACCTGAG 
ATCTTAGCAG 



21 
I 

GCCCGGGTCG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 
TTTAAATATA 
CCCTGGCTCG 
GCGAAGGCGT 
TCTTACCAGG 
GGAGGAAGAG 
TGAAGATGAC 
TCGCCCAGTG 
GAAGATATAT 
TACCAAAACA 
CTGCCTTCGA 
TTGCCCGCCT 
TGCGACTGGG 
CTTGAAAAGC 
TGCCTTCTAC 
TTAAAAATCT 
ACATGTGTTT 



31 
I 

GCGCGCCCAG 
GCGGCTGCTC 
GCTCGCCGCG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 
AAGCAAAACA 
TTCCGTGGAA 
ACATTCCCGG 
TCAAGGTCCC 
GATAAGTACA 
CTGCCCAGAA 
GAAGAAATTA 
AACCGTTCAC 
AACTGCAGAA 
AACCGTTATG 
TGTCGAGGAA 
GTCCTTGTGT 
CTGAAACAGG 
TTCTCAAATC 
TGATGATCAG 
CTGGAGCATC 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSOGDVALPS 
TLSSSAICSS 
RVPLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARBARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 



41 

I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA 
AAGCAATGCT 
GACATCCCCT 
GTGTTGCTTC 
GGATCCTCGG 
TGTTGGTGAG 
GCCGTCGCTC 
CAGAGGAGGA 
TGGGCTCTAC 
ACCCAGACTG 
GTGAAGAGGT 
TCTGCAACTG 
ATTTAGCCAA 
AATTTGAAAT 
TTTCTTGTAA 
CCTGTTTCAT 
ACAGAAGGTA 



51 

I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 



51 

I 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 
TGCAAAACTC 
CCCAGGCTCC 
CAGGAGAAAC 
GTCCCTTGAC 
AAAGAGGAAG 
CAGATCATCC 
GTTGGAGAAC 
TTGTCATCAA 
CTGGGGCGTT 
CAGGGATGCT 
CAGTTTCTGC 
ATATCATGGC 
GCAAGCATAA 
AAGTTTCCAA 
AAGAAACTCC 
TATTGCTAGT 



660 
720 
7B0 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800. 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



352 
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WO 02/086443 

TACACTTTGC CCTCCTGCAO TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 
TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 
TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 
GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 
CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 
TCAAGGCACA AAASTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 
CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 
CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 
AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 
ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 
TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 
TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 
CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 
GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 
CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTQGTAGA GGTGGAATCT 
AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 
GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 

Seq ID NO: 447 Protein sequence 
Protein Accession #: NP_114148.1 



PCT/US02/12476 



I 

MDARRVPQKD 
RSQCRHSGPL 
NIKQNKAM1A 
TRSRS RILGS 
PVEBITEEBL 
LRNRYGEEVR 
KSLKQEFEMQ 



11 
I 

LRVXKNLKKP 
RVAMKFPARS 
KLMSELESFP 
LDALPMEEEE 
ENVCSNSREK 
DALLDPNWHC 
A 



21 
I 

RYVKLISMET 
TRGATNKKAB 
GSFRGRHPLP 
EEDKYMLVRK 
IYNRSLGSTC 
PPCRGICNCS 



31 41 51 

I I I 

SSSSDDSCDS FASDNFANTR LQSVREGCRT 
SRQPSENSVT DSNSDSEDES GKNFLEKRAL 
GSDSQSRRPR RRTFPGVASR RNPERRARPL 
RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 
HQCRQKTIDT KTNCRKPDCW GVRGQFCGPC 
FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL 



Seq ID NO: 448 DNA sequence 
Nucleic Acid Accession #: NM_019894 
Coding sequence : 1 . . 1314 



1 
I 

ATGTTACAGG 
AAACCCCGTA 
CTGAGCCTGG 
TACTTCCTCT 
CTGGACTGTC 
GCAGTGGCAG 
GGGAACTGGT 
AGGCAGATGG 
GATCTGGATG 
GGGCCCTGTC 
AAGACCCCCC 
AGCATCCAGT 
CTCACGGCAG 
GGCTCAGACA 
TTCAACCCCA 
ACTTTCTCAG 
GCCACCCCAC 
GACATACTGC 
GCGTACCAGG 
GACACCTGCC 
GTGGGCATCG 
AAGGTCTCAG 



11 
1 

ATCCTGACAG 
TCCCCATGGA 
CGAGTATCAT 
GCGGGCAGCC 
CCTTGGGGGA 



GCTACAGCAG 
TTGTTGAAAT 
TCTCAGGCTC 

ACGACAAACA 
CCCACTGCTT 
AACTGGGCAG 
TGTACCCCAA 
GCACAGTCAG 
TCTGGATCAT 
TGCAGGCGTC 
GGGAAGTCAC 
AGGGTGACAG 
TTAGCTGGGG 
CCTATCTCAA 



21 

I 

TGATCAACCT 
GACCTTCAGA 
CATTGTGGTT 
TCTCCACTTC 
GGAOGAGGAG 
CAAGGACCGA 
TTTOGACAAC 
CAAACCCACT 
CACAGAAAAC 
CCTGGTCTCC 
TGGGGAGGAG 
GCACGTCTGT 
CAGGAAACAT 
CTTCCCATCC 
AGACAATGAC 
GCCCATCTGT 
TGGATGGGGC 
AGTCCAGGTC 
CGAGAAGATG 
TGGTGGGCCC 
CTATGGCTGC 
CTGGATCTAC 



31 

I 

CTGAACAGCC 
AAGGTGGGGA 
GTCCTCATCA 
ATCCCGAGGA 
CACTGTGTCA 
TCCACACTGC 
TTCACAGAAG 
TTCAGAGCTG 
AGCCAGGAGC 
CTGCACTGTC 
GCCTCTGTGG 
GGAGGGAGCA 
ACCGATGTGT 
CTGGCTGTGG 
ATOGCCCTCA 
CTGCCCTTCT 
TTTACGAAGC 
ATTGACAGCA 
ATGTGTGCAG 
CTGATGTACC 
GGGGGCCCGA 
AATGTCTGGA 



41 

I 

TCGATGTCAA 
TCCCCATCAT 
AGGTGATTCT 
AGCAGCTGTG 
AGAGCTTCCC 
AGGTGCTGGA 
CTCTCGCTGA 
TGGAGATTGG 
TTCGCATGCG 
TTGCCTGTGG 
ATTCTTGGCC 
TCCTGGACCC 
TCAACTGGAA 
CCAAGATCAT 
TGAAGCTGCA 
TTGATGAGGA 
AGAATGGAGG 
CACGGTGCAA 
GCATCCCGGA 
AATCTGACCA 
GCACCCCAGG 
AGGCTGAGCT 



Seq ID NO: 449 Protein sequence 
Protein Accession #: NP_063947.l 



MLQDPDSDQP 
YFLCGQPLHF 
GNWFSACFDN 
GPCLSGSLVS 
LTAAHCFRKH 
TFSGTVRPIC 
AYQGEVTEKM 



11 
I 

LNSLDVKPLR 
IPRKQLCDGE 
FTEALAETAC 
LHCLACGKSL 
TDVFNKKVRA 
LPFFDEELTP 
MCAGIPEGGV 



21 
1 

KPRIPMETFR 
LDCPLGEDEE 
RQMGYSSKPT 
KTPRWGGEE 
GSDKLGSFPS 
ATPLWIIGWG 
DTCQGDSGGP 



31 

I 

KVGIPIIIAL 
HCVKSFPEGP 
FRAVEIGPDQ 
ASVDSWPWQV 
LAVAKIIIIE 
FTXQKGGKMS 
LMYQSDQWHV 



41 

I 

IiSLASIIIW 
AVAVRLSKDR 
DLDWEITEN 
SIQYDKQHVC 
FNPMYPKDND 
DILLQASVQV 
VGIVSWGYGC 



Seq ID NO: 450 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 

Coding sequence: 52.. 3042 



11 



21 



31 



41 



51 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



51 
I 

ACCCCTGCGC 
CATAGCACTA 
GGATAAATAC 
TGACGGAGAG 
CGAAGGGCCT 
CTCGGCCACA 
GACAGCCTGT 
CCCAGACCAG 
GAACTCAAGT 
GAAGAGCCTG 
TTGGCAGGTC 
CCACTGGGTC 
GGTGCGGGCA 
CATCATTGAA 
GTTCCCACTC 
GCTCACTCCA 
GAAGATGTCT 
TGCAGACGAT 
AGGGGGTGTG 
GTGGCATGTG 
AGTATACACC 
GTAA 



51 

I 

VLIKVILDKY 
STLQVLDSAT 
SQELRMRNSS 
GGSILDPHWV 
IALMKLQFPL 
IDSTRCNADD 
GGPSTPGVYT 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 
300 
360 
420 



II I I I ! 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 
GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 
GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 
CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 
AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 
TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 



60 
120 
180 
240 
300 
360 



353 



WO 02/086443 

GTGGCAGGGA AACCAATGTA CCTGCACATC 6GGGAG6AGA TAGACGGCGT GGACATGCGG 420 

GOGGAGGTTG 6GCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCQ ATACCTTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

CAGCAGCTGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTOGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 
CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCOGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACOGA 1140 

GCACATTCCA ACTACOGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAQ CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC AOGGGGCCTG GCTGCGOGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGA7AAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

AACTGCACTT TCOGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGGCAT TGCCTTTGAG .1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1800 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGOGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGO GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGrTTG CTTTCTGCTC CA7GAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA OGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATAGGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCOGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGOGC ACAOGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCOG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTCCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACOGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTOGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT OCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 5280 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

AGCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
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WO 02/086443 

ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 
GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 
TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 
AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 451 Protein sequence 
Protein Accession gt XP 051660.2 



5640 
5700 
5760 



PCT/US02/12476 



I 

MDGVNLSTEV 
LEDNVQSWKP 
DMRABVGLLS 
HMGQQLVGQY 
YNSLGHCFFT 
DCNAVSTFWM 
NNRAHSNYRA 
AYKNQDHGAW 
GTEKMDNRIW 
LAFRLNNAWQ 
SEYPGSYLTK 
HPLYLBGALT 
RGTTFSILSD 



11 

I 

VYKKGQDYRF 
GDTLV1ASTD 
RNIIVMGEME 
PIHFHIAGDV 



ANPNNNLINC 
GMI IDNGVKT 
LRGGDVWLES 
GPGGLDHSGR 
SCPHNNVTGI 
NDNWLVRHPD 
RSTHYQQYQP 
VHNRLLKQTS 
MKGCERIKIK 



TKDHFLEVKM 
NSILQGIPWQ 
GFKGSFRPIW 



LFNYVATIPD 
VTLDTEDHKA 



21 
I 

ACYDRGRACR 
YSMYQAEEFQ 
DKCYPYRNHI 
DERGGYDPPT 
DHCLGLLVKS 
AAAGSEETGF 
TEASAKDKRP 
CRFADNGIGL 
TLPIGQNFPI 
APEDVPITSR 
CINVPDWRGA 
WTLQKGYTI 
KTGVFVRTLQ 
ALIPKNAGVS 
WNDFAYIEVD 
NSIVLMASKG 
KIFQWPIPV 



31 
I 

SYRVRFLCGK 
VLPCRSCAPN 
CNFFDFDTFG 
YIRDLSIHHT 
GTLLPSDRDS 
WFIFHHVPTG 
FLSIISARYS 
TLASGGTFPY 
RGIQLYDGPI 
VFFGEPGPWF 
ICSGCYAQMY 
HWDQTAPAEL 
MDKVBQSYPG 
DCTATAYPKF 
GKKYPSSEDG 
RYVSRGPWTR 
VKKKKL 



41 
I 

PVRPKLTVTI 
QVKVAGKPMY 
GHIKFALGFK 
FSRCVTVHGS 
KMCKMITGDS 
PSVGMYSPGY 
PHQDADPLKP 
DDGSKQEIKN 
NIQNCTFRKF 
NQLDMDGDKT 
IQAYKTSNLR 
AIWLINFNKG 
RSHYYWPEDS 
TERAWDVPM 
IQVWIDGNQ 
VLEKLGADRG 



Seq ID NO: 452 DNA sequence 

Nucleic Acid Accession #i Eos sequence 

Coding sequence: 261.. 2861 



GAGCTAGCGC 
CGGCGCGGGG 
AGCTACCACT 
ACGTCCGGGG 
AGAGGGAGCA 
TGCTGACCAT 
CTGGGTGCCC 
ACCATGTGCA 
CCATCCACAT 
TGCGAACCCG 
GCCCTTTCCA 
CGGATOCTTA 
ATGGACAGAA 
CAGAAGGAGG 
TCGACCCCAA 
AAGAGAGTGA 
TTGCAGTGAA 
AATTGGGAAG 
TGAAAGGAAA 
CTGCTGCTGC 
CTTTGTCCAG 
TATCTCAGAC 
TATGCAATCG 
AGGTTGTCTA 
GCCGGAGCTA 
CCATTGACAC 
AACCTGGAGA 
TCCAGGTGCT 
TGTACCTGCA 
TGAGCCGGAA 
ACATCTGCAA 
TTAAGGCAGC 
AGTACCGGAT 
CCACATACAT 
GCTCCAATGG 
TCACGGAAGA 
AGTCTGGAAC 
ACTCCTACCC 
GGATGGCCAA 
GATTTTGGTT 
GTTATTCAGA 
GGGCTGGCAT 
GGCCGTTCCT 
AGCCCCGGGA 
CCTGGCTGCG 
AAGGCTTCTT 
CCTCTGGAAT 
CGGGGTCGCC 
GGGAGCAATG 
CTCTGACTCC 
TCTTCATCCA 
CCTCGGCCTG 



11 
1 

TCAAGCAGAG 
AGCCAGCGGG 
CCGCTTGCCC 
CCGCTGCGCT 
CACTGCCAGG 
CAGCTGGCTC 
TGACCAGAGC 
TATCGGCCAG 
CTCAGAGGGA 
GCACATCCTG 
GGGCAATTTC 
CTATGGTCTG 
AAAGCTCTCC 
CTATTTTTTT 
ATCAGGCACA 
ACGTCTGGTC 
TGATGAAGGT 
CAAACACTTC 
TCCATCATCT 
CCGGGTATTC 
TGAGTGGGTT 
TAAAGGTGGG 
TCCCATTGAT 
CAAAAAAGGC 
CCGTGTAGGG 
CAATGTGAAC 
TACCCTGGTC 
TCCCTGCAGA 
CATCGGGGAG 
CATCATAGTG 
TTTCTTTGAC 
ACACTTGGAG 
TCACTTCCAC 
CAGGGACCTC 
CTTGTTGATC 
TGGGCCGGAG 
CCTCCTCCCC 
AGGGTACATC 
TCCCAACAAC 
TATTTTTCAC 
GCACATTCCA 
GATCATAGAC 
CTCAATCATC 
GCCGGCCATC 
CGGCGGGGAT 
GCTTACAGGA 
GGCTCAGGGA 
CTTTGCTCAC 
GGCTTTGCTG 
AAGAGGGTGA 
GGGAACTGAG 
GGATTTCAGA 



21 
I 

CCCAGCGCGG 
GCTGAGCGCG 
ACGCCCCGGG 
CCTGGCCOGC 
ATGGGAGCTG 
ACTCTGACCT 
CCTGAGTTGC 
GGCAAGACAC 
GGCAAGCTGG 
ATTGACAAOG 
ACCATCATTT 
AAGTACATTG 
TGGACATTTC 
GAAAGGAGCT 
GTCATCCATT 
CAGTATTTGA 
TCTCGAAATC 
CTGCACCTTG 
TCAGTGGAAG 
AAATTGTTCC 
CAAGACGTGG 
GAGAAAATTT 
ATACAGGCCA 
CAGGATTATA 
TTCCTCTGTG 
AGCACCATTC 
ATTGCCAGTA 
TCCTGCGCCC 
GAGATAGACG 
ATGGGGGAGA 
TTCGATACCT 
GGCACGGAGC 
CTGGCCGGTG 
TCCATCCATC 
AAGGACGTTG 
GAACGCAACA 
TCGGACCGTG 
CCCAAGCCCA 
AACCTCATCA 
CACGTACCAA 
CTGGGAAAAT 
AACGGAGTCA 
TCTGCCAGAT 
ATCAGACACT 
GTGTGGCTGG 
ATGAAGGCTG 
TTCAGCCCTC 
GTCTCTCTGG 
CTTATGAGCA 
AGTCCACAGA 
CACAGGGGGC 
GCTGGAAATA 



31 
I 

TGCTATCGGA 
GCCAGGGTCT 
AGCTCGCGGC 
GAGGCGTGAC 
CTGGGAGGCA 
GCTTCCCTGG 
AACCCTGGAA 
TGCTGCTCAC 
TCATTAAAGA 
GAGGAGAGCT 
TGTATGGAAG 
GGGTTGGTAA 
TGAACAAGAC 
GGGGCCACCG 
CTGACCGGTT 
ACGOGGTGCC 
TGGATGACAT 
GATTTAGACA 
ACCATATTGA 
AGACAGAGCA 
AGTGGACGGA 
CAGACCTCTG 
CTACAATGGA 
GGTTT0CTT8 
GGAAGCCTGT 
TGAACTTGGA 
CTGATTACTC 
CCAACCAGGT 
GCGTGGACAT 
TGGAGGACAA 
TTGGGGGCCA 
TGAAGCATAT 
ATGTAGACGA 
ATACATTCTC 
TGGGCTATAA 
CTTTTGACCA 
ACAGCAAGAT 
GGCAAGACTG 
ACTGTGCCGC 
CGGGCCCCTC 
TCTATAACAA 
AAACCACCGA 
ACAGCCCTCA 
TCATTGCCTA 
ACAGCTGCCA 
GGGGCATTTT 
CCTGCCGCTG 
CCCACTCATG 
CAGAGGAATT 
AGTGAGCTCC 
CTCCAGGAGA 
TAGAAAATAT 



41 

1 

CAGAGCCTGG 
GAACCCAGAT 
GCCTGGCGGT 
ACTGTCTCGG 
GGACTTCCTC 
GGCCACATCC 
CCCTGGCCAT 
CTCTTCTGCC 
CCACGACGAG 
GCATGCTGGG 
GGCTGATGAA 
AGGAGGCGCT 
CCTTCACCCA 
TGGAGTTATT 
TGACACCTAT 
CGATGGCAGG 
GGCCAGGAAG 
CCCTTGGAGT 
ATATCATGGA 
TGGCGAATAT 
GTGGTTCGAT 
GAAAGCTCAC 
TGGAGTTAAC 
CTACGACCGG 
GAGGCCCAAA 
GGATAATGTA 
CATGTACCAG 
CAAAGTGGCA 
GCGGGCGGAG 
ATGCTACCXX: 
CATCAAGTTT 
GGGACAGCAG 
AAGGGGAGGT 
TCGCTGCGTC 
CTCTTTGGGC 
CTGTCTTGGC 
GTGCAAGATG 
CAATGCTGTG 
TGCAGGATCT 
CGTGGGAATG 
CCGAGCACAT 
GGCCTCTGCC 
CCAGGACGCC 
CAAGAACCAG 
TTTCAGAGGG 
GCTGGGGGGA 
CCTGCTGAAG 
ATGGAGAAGT 
CAGTCCCCAG 
TGGCTTAGGG 
CCCTAGATGT 
CTAGCCCAAA 



51 
I 

DTNVNSTILN 
LHIGEEIDGV 
AAHLEGTELK 
NGIiLIKDWG 
YPGYIPKPRQ 
SEHIPLGKFY 
REPAIIRHFI 
5LFVGESGNV 
VALEGRHTSA 
SVFHDVDGSV 
MKIIKNDFPS 
DWIRVGLCYP 
GLLFLKLKAQ 
PKKLFGSQLK 
GRWSHTSFR 
LKLKEQMAPV 



51 
I 

CGAGCGCAAG 

TTCCCAGACT 

CAGOGACCAG 

CTACAGACCC 

TTCAAGGCCA 

ACAGTGGCTG 

GACCAAGACC 

ACGGTCTATT 

CCGATTGTTT 

AGTGCCCTCT 

GGTATTCAGC 

CTTGAGTTGC 

GGTGGCATGG 

GTTCATGTCA 

AGATCCAAGA 

ATCCTTTCTG 

GCGATGACCA 

TTTCTAACTG 

CATCGAGGCT 

TTCAATGTTT 

CATGATAAAG 

CCAGGAAAAA 

CTCAGCACCG 

GGCAGAGCCT 

CTCACAGTCA 

CAGTCATGGA 

GCAGAAGAGT 

GGGAAACCAA 

GTTGGGCTTC 

TACAGAAACC 

GCTCTGGGAT 

CTGGTGGGTC 

TATGACCCAC 

ACAGTCCATG 

CACTGCTTCT 

CTCCTTGTCA . 

ATCACAGAGG 

TCCACCTTCT 

GAGGAAACTG 

TACTCCCCAG 

TCCAACTACC 

AAGGACAAGC 

GACCCGCTGA 

GACCACGGGG 

GAGGCTCAGG 

GATGAGGCAG 

CTGGTGACTA 

GTGGTCAGAG 

GCAGCCCTGC 

CCTCATTTGC 

GCTCGTACTC 

GCCTTCATTT 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
560 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



355 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC 
TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG 
GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC 
AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC 
AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT 
AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG 
TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG 
CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT 
TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA 
TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG 
GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA 
AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATO 
TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA 
GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG 
GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGG 
TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA 
GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT 
TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT 
AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC 
TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG 
TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG 
TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT 
TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT 
ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT 
GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG 
TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG 
AAAAAAAAAA AAAAAAAAAA AA 

Seq ID KOs 453 Protein sequence 
Protein Accession #: Eos sequence 



MGAAGRQDFL 
GKTLLLTSSA 
TIILYGRADE 



SRNLDDMARK 
KLPQTEHGEY 
IQATTMDGVN 
STILNLEDNV 
EIDGVDMRAB 
GTELKHMGQQ 
KDWGYNSLG 
PKPRQDCNAV 
LGKFYNNRAH 
IRHFIAYKNQ 
PSPPCRCLIiK 



11 
I 

PKAMIiTISWL 
TVYSIHISBG 
GIQPDPYYGL 
VHVIDPKSGT 
AMTKLGSKHF 
FNVSLSSEWV 
LSTEWYKKG 
QSWKPGDTLV 
VGLLSRNIIV 
LVGQYPIHPH 
HCFFTEDGPE 
STFWMANPNN 
SNYRAGMIID 
DHGAWLRGGD 
LVTTGSPFAH 



21 
I 

TLTCFPGATS 
GKLVIKDHDE 
KYIGVGKGGA 
VIHSDRFDTY 
LHLGFRHPWS 
QDVEWTEWFD 
QDYRFACYDR 
IASTDYSMYQ 
MGEMEDKCYP 
LAGDVDERGG 
ERNTFDHCDG 
NLINCAAAGS 
NGVKTTEASA 
VWLDSCHFRG 
VSLAHS 



31 



TVAAGCPDQS 
PIVLRTRRIL 



RSKKESERLV 
PLTVKGNPSS 
HDKVSQTKGG 
GRACRSYRVR 
AEEPQVLPCR 
YRNHICNPFD 
YDPPTYIRDL 
LLVKSGTLLP 
EETGPWFIFH 
KDKRPFLSII 
EAQEGFIiLTG 



CACACAGCTA 
CCTCAACAAC 
AAGTCCCCTC 
TCCTGCTCCC 
CACTTTGTTC 
TGCTACCTGG 
CTCGCCATGT 
CCACCCCAGT 
GAGAGCCCAA 
AGGCACCAGA 
CCACAAACTC 
AGACTCGGTC 
CCAAACATCT 
CTTCAAAGGC 
AGAGTTAAAA 
ATGCAAGGGT 
GTTGAATGTC 
GTACATATGT 
AAGAGCCAAT 
TCCTCCTTGT 
CTCCTCTGAA 
AGATGTATAG 
ATTTTCTATT 
TGTCTTAAAT 
CCAAGGAAAT 
ATGTCAAAAA 



41 

I 

PELQPWNPGH 
IDUGGELHAG 
WTPLNKTLHP 
QYLNAVPDGR 
SVEDHIEYHG 
EK1SDLWKAB 
FfcCGKPVRPK 
SCAPNQVKVA 
PDTFGGHIKP 
SIHHTFSRCV 
SDRDSKMCKM 
HVPTGPSVGM 
SARYSPHQDA 
KKAGGILLGG 



Seq ID NO: 454 DNA sequence 

Nucleic Acid Accession #: NM_013282.2 

Coding sequence t 85.-2466 



1 
I 

CGACTCCTTA 
GTCCCTCCCC 
ACCCACACGG 
CAGGAGCTGT 
GAGGACGGCC 
GTCCGCCAGA 
ACCGACTCCG 
GCGGCCGCCG 
GGGCTGTACA 
GAGGCGCAGG 
ACGTCCAGGC 
GAGAACGGCG 
AAGTGGCAGG 
AAGGAGCGGG 
OGGGAACTCT 
TTCGTGGACG 
CCCATGAGAC 
TGCCGGGTCT 
TGCGATGAGT 
CCCAGCGAGG 
GCGGGAGAGC 
TCACAGCGGG 
GTCCOGTCCA 
CGAGTCCAGG 
AGCAACGACG 
GGGAATTTTT 
GCGGAACAGT 
TTTGCTCCCA 
GTCAGGGTGG 
AACCGCTACG 
TTTCTCGTGT 
GAGGGGAAGG 



11 

I 

GAGCATGGCA 
TCAGCGCCGA 
TGGACTOGCT 
TCCACGTGGA 
ATACCCTCTT 
GCCTCGTGCT 
GCTGCTGCCT 
AGACTGACAG 
AGGTCAATGA 
TGGTCAGGGT 
CGGOGCTGGA 
TGGTCCAGAT 
ACCTGGAGGT 
GCTTCTGGTA 
ACGCCAACGT 
AAGTCTTCAA 
GGAAGAGCGG 
GCGCCTGCCA 
GCGACATGGC 
ACGAGTGGTA 
GGCTGAGAGA 
ACTGGGGCAA 
ACCACTACGG 
TCAGOGAGTC 
GAGOGTACTC 
TCACATACAC 
CTTGTGATCA 
TCAATGACCA 
TGCGCAATGT 
ATGGCATCTA 
GGCGCTACCT 
ACCGGATCAA 



21 

I 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG 
GATTGAGCGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 
CTGCCCTGAG 
GAGCAAGAAG 
GGGCATGGCC 
ACCCATCCCG 
GGGTGTCCAT 
CCTAGTCCTG 
GGGTAGTGGT 
GAAACTCACC 
AGAAGGGGCC 
CAAGGGTGGC 
CAAGGTTGTG 
TCTGCGGAGG 
GAAGCTGGGG 



31 

I 

GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTCCGOCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 

ATTTACCACG 
GACGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 
TGCGGGAATG 
AAGGCGAAGA 
TGTGTGGGCC 
GGGATCCCCG 
CGGCCCCAGG 
GCGGGGGGCT 
GGTCGAGATC 
AACACCAACA 
GAGGCCAAGG 
AAGAATAGCA 
AAATACTGGC 
GACGATGATG 
CTGACCATGC 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA 
CGCGCGCCOG 
ACTACAACCC 
AGCGOGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCOGACAA 
ACCCGCCCCr 
ATGCCAGCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATGA 
TTTCCGGCAA 
GGGCGCTGGC 
ACTGGCGGTC 
AGTACGCCCC 
CCGAGAAGGG 
AGCCTGGCCC 
AGTATCCAGA 



AGGGAGGGCC 
CGGCCCCAGA 
GAAGGAAAGG 
AGCGCACACA 
ACTACCTGTC 
CTCTCCTGTC 
TTCTGGTGAG 
CCCTTTCAGC 
AGAGCTCCTG 
GTCTCCCTGG 
TTTCCTTCAA 
CAAGAGTCCA 
TTCAGCTGCT. 
CAGAGTCACA 
TGACCTCATG 
CTCACACTGT 
TTTGGCTCAG 
TTCACAGTAC 
ATCTAGGCAT 
TATTTCTGTT 
ATGCTTGTCT 
AGTGTTTGTA 
TATTTATTAT 
GTCATGATTG 
TGGCTCTGGT 
AAAAAAAAAA 



51 
I 

DQDHHVHIGQ 
SALCPFQGNP 
GGMAEGGYFF 
ILSVAVNDEG 
HRGSAAARVF 
PGKIQ3RPID 
LTVTIDTNVN 
GKPMYLHIGS 
ALGPKAAHLE 
TVHGSNGLLI 
ITEDSYPGYI 
YSPGYSEHIP 
DPLKPREPAI 
DEAASGMAQG 



51 
I 

GGTTTTTGCT 
OGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGOGTGGTTT 
CTGCAGCTCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 
GGTGGTACTG 
CACATCGTCC 
ATGTACCATC 
GTGGCGGTTC 
ACACGGCCGG 
CGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAAGCCG 
CGCTGAGGGC 
GAAGTCCGGG 
TTGGACGAAG 
AGGCTACCTG 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 



356 



WO 02/086443 

GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 2100 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG OGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 2520 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2680 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG 7TCCCCGTGG 2940 

TGGCCCGTGG CAGCCCGTGG CATGGOGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGOGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 3180 

TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

Seq ID NO: 455 Protein sequence 
Protein Accession ft: NP_037414.2 

1 11 21 31 41 51 

I I I I I I 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELPHVEP GLQRLFYRGK QMEDGHTLFD 60 

YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDBT ELGLYKVNEY VDARDTNMGA WPEAQWRVT RKAPSRDEPC SSTSRPALBE 180 

DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVPXI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL OGGRQDPOKQ LMCDECDMAF HIYCLDPPI»S SVPSEDEWYC 360 

PECRNDASEV VLAGERLRES KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMW RPRVQVSESG VHRPHVAGIH GRSNDGAYSL VLAGGYEDDV DHGNFFTYTG 480 

SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPINDQB GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 600 

LGLTMQYPEG YLEALANRER EKENSKREEE EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 . 

SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LXDRPASGSP FQLFLSKVEE 720 
TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLGRSY AMOVNQPLQT 

Seq ID NO: 456 DNA sequence 

Nucleic Acid Accession #$ NM_001200.l . 

Coding sequence: 3 2 5.. 1514 

1 11 21 31 41 51 

I II t I I 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGOG CCGGTGCCTT 60 

TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 120 

TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCOGG 180 

GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGAOGTGTCC CCGCGTGCTT CTTAGACGGA 300 

CTGOGGTCTC CTAAAGGTOG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 360 

TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTOGTTCC GGAGCTGGGC CGCAGGAAGT 420 

TOGCGGCGGC GTCGTCGGGC GGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 480 

TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACOCACCCCC AGCAGGGACG 540 

CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 

CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 

ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 720 

TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 

TCCGAGAACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC OGAATTAATA 840 

TTTATGAAAT CATAAAACCT GCAACAGCCA ACTCGAAATT CCCCGTGACC AGACTTTTGG 900 

ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGATT CGTGGTGGAA GTGGCCCACT 1020 

TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 1080 

ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAAAAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 

GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGCCA CGGAGAATGC CCTTTTCCTC 1320 

TGGCTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 1380 

ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT ACATAAATAT ATATATA 
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1 11 21 31 41 SI 

111)11 

MVAGTRCL1A LLLPQVLLGG AAGLVPELGR RKFAAASSGR PSSQPSDEVL SEPELRLLSM 60 

PGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SFHHEESLEE 120 

LPETSGKTTR RFFFNLSSIP TEEFITSAElt QVFREQMQDA LGNNSSFHHR INIYEIIKPA 180 
TANSKFPVTR LLDT 

Seq ID NO: 458 DNA sequence • 

Nucleic Acid Accession #: NM_001999.2 

Coding sequence: 1..8736 

1 11 21 31 41 51 

I I 1 I I I 

ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 720 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

CGACGGGGTT TCATCCCCAA CATCOGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 

TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 

TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 

AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 

AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 

CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 

CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 

TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCGAGGAG GGACAGGCTT CATCCCCATC 1380 

CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 

GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 

GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 

AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 

AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 

GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 

AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 

TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 

ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 

ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 

TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 

TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 

ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 

GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 

CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 

ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 

GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 

GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 

GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 

GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 

GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GOCCGGCAGC 2640 

AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 

ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 

GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 

CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2880 

TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 

TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 

CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 

ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 

AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 

GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 

GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 

AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 

TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 

TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3460 

TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 

CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3.600 

AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 

ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 

GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 

GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3B40 

GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 

ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACA7GAAA 3960 

ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 

GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 

ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 

GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 

AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 

AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 

GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
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AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 

CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 

GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 

GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 

TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 

TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 

AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGOGTC 4800 

AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGQ GAAACCCCTG TGAGACATGC 4860 

CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 

AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 

CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 

TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5X00 

GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 

GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 

OGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 5280 

TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 

CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 

CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 

AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCGGCGGGT 5640 

TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC. GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 

GGTCAGGTGT GCAGAAATGG AOGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 

AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 

GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTOGCCAG 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 

ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 

CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 

TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 6600 

TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 

TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 

CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 

AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC TTGTATAGAC, CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 

GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7980 

AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 

TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8260 

TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 6460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 

GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8820 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 9060 

CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
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CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA 
TTGAAAGGGA ACACCTGGCA TTCT6TGGTG TTTCGTGCTG 
ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC 
GGAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT 
TGTAGTTATA CACCATATGC CTCATTTTAT CATAGCCTAT 
ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA 
TATTAAGAGC ACGTATCCAT TATTCTTCTC AACCCAAGAA 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT 
TTAATAACTA AAAAAAAACT CGTGCC 



GCTGAAGAAC 
TCTTAAATAA 
AAATCATGCA 
TTTAGAATTT 
TGTGTATGAA 
GATATTGTAC 
CCTGTTTCCT 
CCTGGTTGTT 
TTTGAGGTTT 



CGCCGTTCCC 
TGGTGCATTT 
ATGCAGCCAA 
TCATTAATAC 
AGATGTTTGT 
CAGGATGTGC 
GGACCAG7GA 
CCTCTCAAAC 
TATTGTTTTT 



9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
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Seg 10 NO: 459 Protein sequence 
Protein Accession *: NP 001990.1 



MGRRRRLCLQ 
PEYREEGAAV 
DGFCSRPNMC 
ENGCQNGGRC 
TTGRAWGHPC 
RCPAGHKQSE 
RTGMCFSGLV 
PMGGIPGSAG 
GPIITGLTIIj 
NPCTNGDCVN 
CNAGFELTTD 
CQTPGICMNG 
AVTKSECCCA 
GICENLRGSY 
VFRTBTETCE 
IQDSRCEVNI 
PPGVCPNGRC 
MDACCCAVGA 
GMCTYGKCRN 
CFEGYESGFM 



GSYECSCSEG 
TCIDVNECDI. 
ASCLNIPGSF 
GDGPTCSDVD 
VSGTCNNLPG 
PQLNPTGVGC 
PPVNSTEYYT 
YLSEDTRICE 
RSYNGTTCEN 
HTGKAVDIDE 
QRNADCINSP 
GFKAEQDQTM 
GQVCRNGRCF 
ICPPGYEVKS 
SFCFTNFENG 
VPSLBDTRED 
PCGNGTCTNV 
GYALREDQKM 
CRTKPGICEN 
RNLVTKSECC 
NGQCINTMGS 
VLQEDGKTCK 
CGGKGICQNT 
GYIQHYQWNQ 
SKNPCNYGCS 
CYECKINGYP 
RPAIQPLNNH 



11 
I 

LYFLWLGCW 
ASRVRRRGQQ 
TCSSGQISST 
IAQPCACVYG 
EMCPAQPQPC 
TTQKCEDIDE 
NGRCAQELPG 
SRPGGTGGNG 
NQTIDICKHH 
TPGSYYCKCH 
GKNCVDHDEC 
HCINSEGSFR 
NPDYGFGEPC 
RCNCNSGYEP 
DINECESNPC 
NGATLKSECC 
VNSKGSPHCE 
AWGTECEECP 
TIGSFKCRCN 
MMKNCMDIDG 
RNGKCVNMIG 
YALMPDGRSC 
NSNICMFGEC 
KCSCREGWIG 
ECAENINLCE 
MFHCICDDGY 
VDNRVGNCYL 
IiCPGGEGFRP 
DIDECPAHPG 
ELPFNVTKRM 
CKEIPGXCAN 
GSYRCECAAG 
CMDVDECERH 
NEIGSFKCLC 
ENCIDINECD 
KCSVPKAFNT 
VNECLESPGI 
IGSFECNCNE 
CKDLDECAEG 
GRCVNIIGSY 



21 
1 

LWAQGTAGQP 
DVLRGPNVCG 
CGSKSIQQCS 
FTGPQCERDY 
RRGFIPNIRT 
CSIIPGICET 
RMTKMQCCCE 
FAPSGKGNGY 
ANLCLNGRCI 
AGFQRTPTKQ 
TTTKMCLMGM 
CDCPPGLAVG 
QPCPAKNSAE 
DASGRNCIDI 
VNGACRNNLG 
ATLGAAHGSP 
CPEGLTLDGT 
KPGTKEYETL 
SGFALDMEER 
CERNPLLCRG 
TYQCSCNPGY 
ADIDECENNP 
ENTKGSFICH 
NGIKCIDLDB 
NGQCLNVPGA 
ELDRTGGNCT 
KFGPRGDGSL 
MPITIILEDI 
VCGPGTCYNT 
CCCTYNVGKA 
GVCINQIGSF 
FKLSPNGACV 
PCGNGTCKNT 
NEGYELTPDG 
EDPNICLFGS 
TKAKCCCSKM 
CSNGQCINTD 
GFEPGPMMNC 



FRCPCKVGYT 
DLDECQTKQH 



CVDENECSNP 
NTEGGYLCGC 
KKDSRQKRSI 
IRYVISQGND 
DDYLLGELGE 



RCECNEGFQS 
CELCPLPGTA 
TDISGTSC1D 
NCQFLCVNTL 
GFSLDATGLN 
NACGSASCYN 
PPGYYRVGQG 
HEPDPTAVEQ 
DSVFRIHQRN 
ALRMRLQIQL 



31 

I 

QPPPPKPPRP 
SRFHSYCCPG 
VRCMNGGTCA 
RTGPCFTQVN 
GACQDVOECQ 
GECSNTVGSY 
PGRCWGIGTI 
GPGGTGFIPI 
PTVSSYRCEC 
ACIDIDECIQ 
CINEDGSFKC 
MDGRVCVDTH 
FHGLCSSGVG 
DECLVNRLLC 
SFNCECSPGS 
CERCELDTAC 
GRVCLDIRME 
CPRGAGPANR 
NCTDIDECRI 
GTCVNTEGSF 
QATFDRQGCT 
DICDGGQCTN 
CQLGYSVKKG 
CSNGTHQCSI 
YRCECEMGFT 
DIDECADPIN 
SCNTEIGVGV 
DECQELPGLC 
LGNYTCICPP 
GNKPCEPCPT 
RCECPTGFSY 
DRNECLEIPN 
VGSYNCLCYP 
KNCIDTNECV 
CTNTPGGFQC 
PGEGWGDPCE 
GSFRCECPMG 
EDINECAQNP 
CKNLIGTFMC 
SSSGTECLDN 
QYKKICPHGP 
LDECSQSPKP 
GGPTCKCPPG 
CEDVDECDGN 
TLGSYKCACP 
HCVSGMGFNK 
ISLESVDMDS 
GLSYLHTAKK 
Y 



41 

I 

QPPPQQVRSA 
WXTLPGGNQC 
DDHCQCQKGY 
NQMCQGQIiTG 
AIPGICQGGN 
PCVCPRGYVT 
PEACPVRGSE 
PGGNGFSPGV 
NMGYKQDANG 
NGVLCKNGRC 
ICKPGFVLAP 
MRSTCYGGIK 
ITVDGRDINE 
DNGLCRNTPG 
KLiSSTGLICI 
PRGLARIKGV 
QCYLKWDEDE 
GDVLTGRPFY 
SPDIiCGSGIC 
OCDCPLGHEL 
DIDECMIMNG 
IPGEYRCLCY 
TTGCTDVDEC 
NAQCVNTPGS 
PASDSRSCQD 
CVNGLCVNTP 
SRSSCCCSLG 
QGGNCINTFG 
EYMQVNGGHN 
PGTADFKTIC 
NDUjLVCEDI 
VCSHGLCVDL 
GFELTHNNDC 
ALPGSCSPGT 
LCPPGFVLSD 
LCPKDDBVAF 
YNLDYTGVRC 
LbCALRCMNT 
ICPPGMARRP 
RQGkCFAEVL 
GYTTDGRDID 
QJY1CKNTBG 
FTQHHTACID 
HROQHGCQNI 
SGFSPDQPSS 
GQYLSLDTEV 
PVNMKFNLSH 
KLMPGTYTLE 



51 

I 

TAGSEGGFLA 
IVPICRNSCG 
IGTYCGQPVC 
IVCTKTLCCA 
CINTVGSFEC 
STDGSRCIDQ 
BYRRLCMDGL 
GGAGVGAGGQ 
DCIDVDECTS 
VNSDGSFQCI 
NGRYCTDVDE 
KGVCVRPFPG 
CALDPDICAN 
SYSCTCPPGY 
DSLKGTCWLN 
TCEDVNECEV 
CIHPVPGKFR 
KDINECKAFP 
VNTPGSFECE 
SPSREDCVDI 
GCDTQCTNSE 
DGPNASMDHX 
EIGAHNCDMH 
YRCACSEGFT 
IDECSFQNIC 
GRYECSCPPD 
KAWGNPCETC 
SFQCECPQGY 
CMDMRKSFCY 
GNIPGFTFDI 
DECSNGDNLC 
QGSYQCICHN 
LDIDECSSFF 
CQNLEGSFRC 
NGRRCFDTRQ 
QDLCPYGHGT 
VDTDECSIGN 
PGSYECTCPI 
DGEGCVDENE 
OTICQMASSS 
ECKVMPNLCT 
SYQCSCPRGY 
NNECGSQPLL 
LGGYRCGCPQ 
ACHDVNECSS 
DEENALSPEA 
LGSKEHILEL 
XTSIPLYKKX 



Seq ID NO: 460 DHA sequence 

Nucleic Acid Accession ft: NM_013372.1 

Coding sequence: 63.. 617 



GCGGCCGCAC 
GTATGAGCCG 
CGGCTGCTGA 
AGCACAATGA 
GGGGCCAAGG 
CCCTGCATGT 
AGCAGACCAT 
GCCAGTGCAA 
CCTGCTCCTT 
AACTACAGCC 
CCATCGATTT 
AGGAAGTCCC 
AACCCCCAGC 
ATGGGTGCCT 
CCCTATTTTG 



11 
I 

TCAGCGCCAC 
CACAGCCTAC 
AGGGAAAAAG 
CTCAGAGCAG 
GGGGGGCACT 
GACGGAGCGC 
CCACGAGGAA 
CTCTTTCTAC 
CTGCAAGCCC 
ACCTACCAAG 
GGATTAAGCC 
AGACCTAAAA 
TGCCTCCTGG 
GTGGGTGTTT 
TAAACATATC 



21 

I 

GCGTCGAAAG 
ACGGTGGGAG 
AAAGGGTCCC 
ACTCAGTCGC 
GCCATGCCCG 
AAATACCTGA 
GGCTGCAACA 
ATCCCCAGGC 
AAGAAATTCA 
AAGAAGAGAG 
AAATCCAGGT 
CAACCAGATT 
CAGGAGCCTG 
TTAGACACCA 
TGCTTTAATG 



31 
I 

OGCAGGCCCC 
CCCTGCTTCT 
AAGGTGCCAT 
CCCAGCAGCC 
GGGAGGAGGT 
AGCGAGACTG 
GTCGCACCAT 
ACATCGGGAA 
CTACCATGAT 
TCACACGTGT 
GCACCCAGCA 
CTTACTTGGC 
CTTGTGCGTA 
GAGAAAACAC 
GGGATGTACC 



41 
I 

GAGGACCCGC 
CCTCTTGGGG 
CCCCCCGCCA 
TGGCTCCAGG 
GCTGGAGTCC 
GTGCAAAACC 
CATCAACCGC 
GGAGGAAGGT 
GGTCACACTC 
GAAGCAGTGT 
TGTCCTAGGA 
TTAAACCTAG 
GTTCGTGTGC 
AGTCTCTGCT 
AGAAACCCAC 



51 

I 

CGCACTGACA 
ACCCTGCTGC 
GACAAGGCCC 
AACCGGGGGC ' 
AGCCAAGAGG 
CAGCCGCTTA 
TTCTGTTACG 
TCCTTTCAGT 
AACTGCCCTG 
CGTTGCATAT 
ATGCAGCCCC 
AGGCCAGAAG 
ATGAGTGTGG 
AGAGAGCACT 
CTCACCCCGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
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900 
960 
1020 
1080 
1140 
1200 
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1320 
1380 
1440 
1500 
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1620 
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1740 
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1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



60 
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900 
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CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG A6GCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTOGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG QGGGT CTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGSATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 22 BO 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGOGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGOGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TT GTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAAT Cr.IT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seg ID NO: 461 Protein sequence 
Protein Accession #: NP_037504.1 

1 11 21 31 41 51 

I I I I I I 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLBSSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 120 

QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMKVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 

IDLD 

Seq ID NO: 462 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..2733 

1 11 21 31 41 51 

I I I I I I 

ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 60 

TTCCTGGGGA AAAATGATGG CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 

CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 

GAGAAAAGAG ATTTGAGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGTCACAT 240 

GGGCTAATTA GAATTATCAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTOCTC CCTCATGCCT TGATCCCCAG 360 

AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 

AGCCAGAGTG TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATGAA 480 

AGGTTTACAA ATGACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 

ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 

ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 

TCTGAACTGC TGTCAGCCAT TGAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 

CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAGCCCAGTG TAATGACATT 780 

GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAGCAG TGGCTACAGG 840 

GGAAACATCA CAGCCAAGTG. TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 900 

CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

GCAGCTGTGT CATCCTTCGT GCAAAATCTT TCTGTCATCA TTCGGCAAAA CCCATCAACC 1020 
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ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA 
GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG 
ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC 
AGCTCAOGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC 
CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA 
CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC 
AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT 
AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT 
GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT 
TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT 
CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG 
TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT 
ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG 
ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA 
CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTCCC 
TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT 
GTGTTCTTTA CACACTTCTT CTACCTCTCT TTOTTCTTCT 
CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG 
GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT 
ACGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT 
AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA 
GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG 
GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC 
GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA 
GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA 
TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT 
CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC 
AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 

Seq ID NO i 463 Protein sequence 
Protein Accession #: Bos sequence 



X 
I 

MKVGVLWLIS 
EKRDLRNFLK 
NCYLHTAGAL 
IEIQLKKAYE 
LFPLEDGSFR 
LSLLEELNKN 
ASHFRVSNST 
PLNFSRKFID 
SMASLTLGNI 
HLQWNDAGCH 
ILCLIIEAIiF 
VFPTHFFYLS 
TQPSNTYKRK 
DDKATI IRVG 
LDSKLRQLLF 
IMLTQFVSNE 



11 
I 

FFTFTDGHGG 
LLKPPLLWSH 
PSCECHLNNL 
RIQGFESVQV 
VFGKAQCNDI 
FSMIVGNATE 
MEDVISIADN 
WKGIPVNKSQ 
LPVSKNGNAQ 
LVNETQDIVT 
WKQIKKSQTS 
LFFWMLMLGI 
DVCWLNWSNG 
KSLLILTPLL 
NKLSALSSWK 



21 

I 

FLGKNDGIKT 
GLIRIZRAKA 
SQSVNFCBRT 
TQFRNGSIVA 
VFGFGSKDDE 
AAVSSFVQNJj 
ILNSASVTNW 
LKRGYSYQIK 
VNGPVISTVI 
CQCTHLTSFS 
HTRRICMVNI 
LLAYRIILVF 
SKPLLAFWP 
GLTWGFGIGT 
QTEKQNSSDL 



31 
I 

KKELIVNKKK 
TTDCNSLNGV 
KIWGTFKIKE 
GYEWGSSSA 
YTLPCSSGYR 
SVIIRQNPST 
TVLLREBKYA 
MCPQNTSIPI 
QNYSINEVFL 
ILMSPFVPST 
ALSLLIADVW 
HHMAQHLMMA 
ALiAIVAVNFV 
IVDSQN1AWH 
SAKPKFSKPP 



Seq ID NO i 464 DNA sequence 
Nucleic Acid Accession #: AB035089. 
Coding sequence: 9845.. 10219 



GGGCATGCAG 
CAGTTCTAGT 
CCAAGAGGAA 
TTGGTTTGAA 
AGGAGGAAGG 
GAGACTATTT 
AAGAAAGGAA 
GTAGACAGAA 
AATCTCCTCC 
AATAAAATGT 
AAGGAAGTGA 
GTAATGACAG 
GGATTCCTGG 
CAAGTGTTCA 
TCAGGATATG 
TAGGAGAACT 
CAAGCTTTCT 
TTCAACCTTC 
CCATAGATTQ 
CCTCCATTCC 
TGGTACCCGA 
CTGGATGCAG 
TTTCCTCTTC 
CTATTCCCCG 
ATATATATTG 
GTGAGAAATT 
CCTATGTGTT 
AAAACAAACT 
TGTGGAGAAA 



11 

I 

CCATCGGGGA 
AAAAGGGAGA 
TTAGGGAGAG 
AGCATACAGT 
GAGGCAAGAG 
CCCTCTCTGC 
AGCTAGTTAG 
TCCTTGGGAA 
ACTAACCAGT 
TCTCTTGACT 
GGTCCAGGAA 
GATATTTCCT 
AGCCAATGAA 
TATGCAAAAA 
TCCAGTCTCA 
ATTTAGGAAC 
CTAAATTTAA 
AGGGCAAACC 
GTCCCCTGTA 
CAGGATGAGC 
GTAAATCCAT 
ACTCAGCTGA 
TTTCTTTTTC 
ACTTCAATCA 
ATACAGGAGA 
TCCAGATTGG 
TCTGGCACCT 
CACGGCTGGT 
TCAGAACTCT 



21 
I 

AAATCCATAG 
ACATCAATAT 
AGTTATAAGA 
AAATATGATG 
ATAATATCAT 
TTTTCAAACC 
TCTTGTTCTG 
TACAGTAATT 
TTCCCTATAG 
TTGTTACTTA 
AATCTAGGAG 
GAAAGTGTAA 
GTTGGTGTAT 
CTTCTTGGAA 
CACACCAGGA 
AGAAAAAAAT 
GCAAACTCCT 
TCCGTGCCTC 
ACCCOGGTTT 
TTGTTGCTTC 
CCTACTCCAA 
GAAGACCATT 
TTCTATCTTT 
GGGAACTTAT 
CCTAAGAAGA 
AAACACAGCT 
TGTTGTAGAT 
GTTAAAAAGG 
ATTCACGGTC 



31 

I 

TGCAGATAAA 
AGGATGTTTC 
GATCAGCAAG 
TCTGTCCCTG 
TTTCTCTGTG 
TTACTGGAGT 
AGGTTGTTCA 
GACATATATT 
ATTGCCACAA 
ACAATGCTGA 
ATATTTCTTA 
TTTCCCATTG 
GTTTATGAAA 
TTTCTGAGTT 
TATGTCCTTT 
GCCTGAAATG 
GGTCATTTTC 
AGACGTTTAG 
GTCTCAGCTT 
TGTCCTATGA 
TAGAGGAAGG 
ATTCATTTTT 
GGATTTTTAG 
ACCTCTTAAA 
GCATGTCTTG 
TCCTTTCTCC 
AAATCTCCCT 
GCCCATGACA 
GGTTGGAATG 



ATATTTCATC 
TCATCAGTAT 
TGCGGGAAGA 
TGGTGCCTCC 
TTCCAGTGAA 
AAAATACATC 
CCCTTCCAGA 
CCAAAAATGG 
CCATAAATGA 
GTGTGTTTTG 
AAACTCAAGA 
CACCTTTTGT 
GTATCTCCAT 
TTAAAAAAAG 
TCTTGATTGC 
CTGGAGTCTG 
GGATGCTCAT 
CCCAGCATTT 
CTGTCATTAC 
GGCTTAACTG 
TTGTGGCTGT 
TTGGGGAAAG 
TCATTCTGAC 
GCCAGAATCT 
TCTTATGCTT 
CTGCCTTAAG 
CCAAATTCTC 
CTGGAGATTC 



41 
I 

HU5PVEEYQL 
LQCTCEDSYT 
RFTNDLLNSS 
SELLSAIEHV 
GNITAKCESS 
TVGNIASWS 
SSRX&ETLEN 
RGRVLIGSDQ 
FFSKIESNLS 
IFPWKWITY 
FIVGATVDTT 
VGFCLGYGCP 
WLLVLTKLW 
VIFAUJIAFQ 
NFLQNKGHYA 



41 

! 

GCAAGGAGGA 
TTAGCAATAG 
GGGACAGGGT 
GCAGTGTTGG 
CTCCAACTGT 
TGTTTTCCCT 
ATGTATACAT 
CTGTTATTTG 
GCACATAATA 
GAAAACTTTA 
ACCAATCTAT 
AGGATTTGTT 
TATCAAGAGA 
CTCTGTGGCA 
CTAGCCTGTC 
ATTTCTCATT 
AGTTAGTACC 
CCATAGTCTG 
GTTATCCTGT 
GACATTAGAT 
TCCATTTTTG 
GGAATTCTTT 
TCCATCAACG 
CTCATTCAGA 
GGGGTTGAGG 
CATCCAGCCC 
TGACTTTGTG 
ATACCAAGTG 
CACACTTGTG 



TCTGTCACTG 
AGCTGACAAT 
AAAGTATGCC 
GACAGCTCTT 
CAAAAGCCAA 
TATTCCCATC 
AACTATTATC 
AAATGCTCAG 
AGTTTTCCTA 
GGATTTCAGT 
CATCGTGACG 
CCCCTCTACA 
TGGAAGTCTC 
CCAAACCTCT 
TGATGTCTGG 
CACAGCTGCT 
GCTTGGCATC 
GATGATGGCT 
CATTGCTGTC 
GTCCAATGGA 
GAACTTCGTT 
ACTGAGTCGG 
CCCTCTGCTA 
GGCTTGGCAT 
TGGAATACTC 
TTCTTGGAAG 
AAAGCCTTTC 
CTCCGACAAC 



51 
I 

LLQVTYRDSK. 
WFPPSCLDPQ 
SAIYSKYANG 
AEKAKTALHK 
GWQVIRBTCV 
ILSNISSLSL 
ISTLVPPTAL 
FQRSLPBTII 
QPHCVFWDFS 
VGIiGISIGSL 
VNPSGVCTAA 
LIISVITIAV 
RPTVGERLSR 
GFFILCFGIL 
FSHTGDSSDN 



51 
1 

AGAAGAAGGA 
AAAAAGAAGG 
TAGATTTGGT 
CAGAGTAGGA 
ACTTACATAT 
CATGAAAACC 
ATCTATATCT 
ATGCTTGAAA 
AGAAACAATA 
CAGCCTTCAT 
AAAGGCATTA 
TTTAATTTCT 
CATAAGTTGG 
ATATATGACA 
TATCACATGC 
TGAACTCATC 
TTTCCTTAAG 
AAATTCTCTT 
TTTTTTCTTC 
TCCTTTTCTT 
TCTTATAGCG 
ATCTCAGATA 
CCCCATTAGT 
GACTCAAAAC 
AAACAGGCAG 
CTACTTTCAG 
ATGTGCTGAG 
TTGGGGAGAA 
CAGAATTCTA 
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60 
120 
180 
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300 
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180 
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600 
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1200 
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1320 
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1500 
1560 
1620 
1680 
1740 
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TGGAGAAGAG TCTGGCATTT CCTCAAAATO TTAACCTGGA TTTACCATAT GACCCAGOGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC I860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 

AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2040 

TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 ' 

ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 

AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 2220 

GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 

TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 

TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 

ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 

AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 2520 

CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 

GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 

AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 

GGGGAGGCCT CAGAATCACA G7AGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2760 

AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2820 

ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 

TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 2940 

GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 3000 

AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 3060 

CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 3120 

ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 

TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 

GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 

CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 3360 

AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACfTATTCT 3420 

CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 

TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 

TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 

TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 

GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 3720 

GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 

AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 

TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 3900 

CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 

GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 4020 

AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 

GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 

TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 

CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 

TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 4320 

TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 4380 

CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 4440 

CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 

CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 

TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 

AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 

TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 

CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 4800 

CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 

AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 

TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 

CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 5040 

AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 

CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 

GCCTTGCCCA TAAAAGTCAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 5220 

TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 

ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 5340 

CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 

GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 

TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTGC 5520 

TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5S80 

AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 

CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 5700 

AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 

AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 5820 

AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 

GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 

ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 

AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 

ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG AOGTATCAAT TTTTACAGGT 6180 

AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 

AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 6300 

GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 6360 

ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 

TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 

CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 

GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 

CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 

GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 

CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 

TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 

ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6960 



363 
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CATCAAGAAA TTTTACCAGA CCAGTGTGGA 
AAGTCGAAAG AAGATTAACT CCTGGGTGGA 
TTATAGAAAC ACCTTTGAGA AACCTATGCC 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC 
5 AGGATTTGTC CAAATGAATA TGAAGAGAGC 
GCCAATTAGA TGGAAACACA TCTGGAGAAT 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT 
AGACTCAAAT TTTTATGATG AATATTTAAT 
TTTACTAATT GGGAAACAAG CAGCTCTCTG 

10 CTGCCTGGAT CACATCTGTA GCCAATGTGT 
ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC 
TCTCATTACA GGCCAAACTT ACAGCAACTC 
GAAGTAGTGT CTGACAGCAC AGGACATGCG 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG 

15 AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA 
GTTTATCAGT ATTCCAAATC AGATGATTGG 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA 

20 CATCTTCCAG ACAAGCCATT TGTAGCTTGT 
GAAAAAATTA AAAACCTATT TCCTGATGGG 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG 
TAATACATGG AATGTTAAAC ATTTCTGATG 

25 TTGTTCATGT CTGTTATTTT GTTGTTTTAC 
GAAAAACTAT TGTTTCTAAC TCATGGAATT 
TGTTGCTATC TCAATAATAT TATCTTTTTT 
ACATTGATTT ATTGCAGAAT ACATACAAAT 
TTAATTTTGC CTTGCTGGAG GATGTACAGG 

30 AAGATCTAAG CATGATTGTG CTGCTGCCAA 
CTTGCATCTA CAACTCTTCC TTCTACTGCC 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC 
ATGACTCACC TTCATATTAC AAATATTTGA 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT 

35 ATGTGCACAA TGTGCAGGTT AGTTACATAT 
CCATTAACTC ATCATTTAGC GTTAGGTATA 
CACCCCACAA CAGTCCTCAG AGTGTGATGT 
TCAATTCCCA TCTATGATTT AATTCCATCT 
. TTACATGCAT TGGATATCAA TTTGATGATA 

40 AGAAATATAT ACCAATGTGA GGAAAGTTTA 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT 
GAAAAATATG ATGAGCCTAT TAAAAATTGA 
AGTGATACAT ACAGTTCATT GTGTACAAAA 
ACAGTTGTAT ACATACATAT GTACACATAT 

45 ACATACATGT ATATACATAT ACACATATAA 
TTTTACCTAT TAGAATAATA ATGTCTATTA 
CCAAAATAAC GAATCTCCAC ATAGTCAATT 
. TAGTCATATC AGTTTCTTTT TTCCATTTGT 
_ ATTGATGGAA TGGACAAGTT TGCAGAATAT 

50 TCGGTTCAAA ATGGAAGAGA GCTATGACCT 
GAATATCTTC AATGGGGATG CAGACCTCTC 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA 
TGCCACCGCT GTAGTAGTAG TCGAATTATC 
TAATCACCCT TTCCTATTCT TCATAAGGCA 

55 CAGATTCTCA TCCCCATAGA TGCAATTAGT 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA 
CTATCTCATC TTGATGATGA TAGTCATCAT 
TTCTCTCTTT CTCTTAATAA GCCCACATAT 
- A CTTGAGGAAA AATGTCCAAG ATAAGATGAA 

DO ATATAATTCT GTTTCTGACC TGTTTTAAAT 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT 
GTTTCTAAAT TTTGTGATTC TATAAAACAC 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG 
AATTTAGAGA TATATTAATC AT AT ATAAAG 

65 GCCTGGAGTA G AAGG CAT AT CCCAGAACAG 
CTATTTTATG CTAAAGGGAT AAGAAACTCA 
GGGGCATAAA ATTATCATAT CCACATCTAG 
TGGATGCGAG GAAAGAACAG TGTGGTTACC 
A GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC 

70 AGAAAGACAT CTAAGTTGCT GACCTATCTG 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA 
TGATCTGAAG CTCTAAATTT GTGATATTCA 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA 

75 AAAGAAGAAT CTAGAATTGA CCTTGAAGTA 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT 
TCCC 

80 
85 



Seq ID NO: 465 Protein sequence 
Protein Accession #: BAB21525.1 

1 11 



ATCTACTGAT 
AAGTCAAACG 
AGTGAGCCTT 
AGGGAGGGCT 
CTAGGGGAGC 
TATTTGCTTA 
ATTTTGAGAT 
AGAAAACATT 
GTAAATCACC 
TCTGCAGGGA 
ATTCTACAGA 
AACATGAGAG 
TTTCATATTA 
AATGGGGACC 
AGACACATGT 
AGACATTCAT 
AAGGGTAACT 
GTGAGTCTAA 
ATTAACGATC 
GTAAAAGACT 
ACTATTGGCA 
GAGAATAAAT 
TCTATATTTT 
GAATGTAACA 
TCTTATAACT 
CCTGGGTTAT 
GTCTTGTGTT 
CTGTACAGAT 
CCAAGGTCCT 
ATGAAATCGA 
GGACATTTTT 
AAAATGATGA 
GCATAGGGCC 
ATTATTATAA 
GTATACATGT 
TCTCCTAATG 
TACCTTCCTG 
ATGGCTTAGT 
TTATTATGCA 
CAAATAGGCC 
AATTGCAATG 
CACATGTAGT 
TAATGTAATC 
ACATATACGT 
CCCAATGTAT 
AAGTGAACCT 
CATTGTTAAG 
ATAGCTTGAA 
GAGAGAGACA 
CAAGGACACG 
AGGCATGACC 
GGTCACTGAG 
ATCTCCTTCA 
AAATAAGACC 
CTGTCACTOC 
CAACAGATTC 
CAAGAATTTA 
AAATGTACTT 
TCATTTAATA 
GAACCAAACC 
GAATTTAGGT 
ATCATCAATA 
GGCAAGTGTC 
GAAAATTAAA 
GAGGAGCCTT 
TTACAGGCTT 
AAAATACATC 
ATATATAAAT 
TGGGGGAATT 
ACTGGATGGA 
GTACTTGATG 
GATATTAGTC 
ATATAAATAC 
GATACTAGGG 
TATCAGCATG 
TAGGGTTCCA 



TTTGCAAATG 
AATGGTAGGA 
GTGCTTGACA 
GAAGAGGGCA 
CAAGGAGGAA 
TGGCCCTGCA 
CATATCCTTT 
AGAAAGCGTC 
CTTTTGTCTC 
TTATCACAGC 
CATCTTTCTT 
TGAATAGGAA 
CAGAGCTCAA 
CCGTAGTTAA 
TGGAAGAGTA 
ACACAGAGAA 
TTATAOGACT 
CAATTCAAAG 
AGTCACGAAC 
CTTTTATTCT 
ATGATACGAC 
TTAAAAAAGA 
ATTTATATAG 
TGATAAGTAA 
TTATTTAGTT 
TTCTTAGAAG 
TCACGTGTTA 
GATGAGGCAA 
GGAAATACCA 
TGGTCTGCAG 
CCAAAGATAC 
AAAATTCTAA 
TGACACAAAC 
TACTTTAAGC 
GCCATGCTGG 
CTATCCCTCC 
TGTCCAAGTG 
TAATGATTAA 
GCAATTTAAA 
GAGTAGAAAA 
TGTACTAAAT 
AGGCTGTTGG 
ATATTTTACA 
AAAAACATGA 
TTATATATTC 
TCTGTATTTC 
GTGTATTAGA 
GAGAAACTCA 
TGTGTCGATT 
TTGAGAACCA 
TGGAGCCACG 
GAGGGAGTGG 
ACTAATGAAG 
AACAGCATCC 
ATTTAGAAAA 
TCTTGGCTCA 
ATGATTAAAA 
TTCCTTCCAO 
CCGTGTCTTC 
AAATCATACT 
GATACCTAAA 
AAATAATGAC 
CAGTTACACA 
AACAGAGTAG 
GT AAACCACA 
TGATGGTTGT 
TCTGGCTACG 
TAGGAAATCA 
ATTTAGGAAA 
TGGAAGAATA 
AATTAAGGTA 
TAAATCTCAG 
TTTAGAGTCA 
AGAAAGGATA 
TGTAAAGATC 
TATTTACTCC 



CTCCAGAAGA 
GAGCCACCCA 
CTGCATGGGG 
CTCCAGATGA 
ATCACAGGAA 
TGACAATAGC 
ACTTTAAATC 
TCTCGTCTCC 
TGAGCTGGAG 
TCTCTTCCCC 
ACCTCCCACC 
GATACCCCCG 
GTCACTCATC 
AAAAAAATAA 
GACAAAATCA 
CGTGAACTCC 
GGAGCCTTGA 
ATCAAAGTCA 
ATTTGCAAAG 
TTCCCTTGCA 
ACTGGTTCTT 
AAACACTAAA 
TGTAATATGT 
AAAATAAAAA 
AGGAATACCT 
AAGAAGGATG 
TTTGTTGGAC 
TACAATTCCT 
TACAAAGGCA 
AAGGTAAGAA 
CAAGTTTAAA 
ATGAGGAATG 
TGAAAGCTTA 
TTTAGGGTAC 
TGTGCTGCAC 

TTCTCATTGT 
TTTATTAGAG 
CTTGACTGGG 
GGGAATACAA 
AATGTATACA 
CACAAGAAAT 
TGTGTATCAT 
TTCTGTTTTT 
AGGACTCATA 
ACATTTATTG 
GATCGACAGT 
CTGCTGAGAA 
TACACTTACC 
TGGGAATGGT 
GTCTCTCAGT 
AAGCTGCAGC 
AGTTCTGTTG 
TCTTCTATGG 
TGTTCACCTA 
TATTTCTTTT 
TAGCATGCCT 
AAAAATTTCC 
TAAATTTGAA 
TTCTCTTCAA 
TCCTTCTTAT 
ATAAAATCAT 
TAGGAAAGAT 
TTCATGATGA 
TAGGAACTTC 
TTGTCAAAGA 
CTGATATCAA 
TTAGAGTATT 
TAATAGTTAC 
TCTTGTTTCT 
CTTTTAATAT 
AAACAGAATA 
TTGGGATAAA 
AAGTTAGAAG 
AGGAATTGAT 
CATAGATTCT 



7020" 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
6220 
6280 
8340 
8400 
6460 
8520 
8560 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
94B0 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10660 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 



PCT/US02/12476 



51 



21 31 41 

I I I I 

MNSLSEANTK FMFDUQQFR KSKENNIFYS PISITSALGM VLLGAKDNTA QQISKVLHFD 60 

QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQPLQEYL 120 

DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 180 

YFKGQWENKP KKENTKBEKP WFNKNTYKSV QMMRQYNSFN FAIiLEDVQAK VLEIPYKGKD 240 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

LSMXVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 
TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 
EEPCCHHPFL FFIRQNKTNS ILFYGRFSSP 

Seq ID NO: 4 66 DNA sequence 

Nucleic Acid Accession #i NM_001910.1 

Coding sequence: 50.. 1240 



PCT/US02/12476 



300 
360 



GGAGAGAAGA 
CCTT CTTTT G 
GCCCCTCAGG 
GTTCTGGAAA 
GAGTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTG 
GTCCAGCACA 
GTCCGGGATC 
GTTTGGAGAA 
TCTGGGCCTG 
GATGGCTCAG 
AGGTGGTGCG 
CCTGAATTGQ 
OGTGGGAGGC 
TTCCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 
TGGGCCCCTC 
TGGGAATAAC 
CCTGCCTGTC 
GTTATTTTCC 
ACATGAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TTGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTGCTGGGTG 



11 
I 

AAGGAGGGGG 
CTGCTGGTGC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCAGC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGG 
GGGAGCGAGC 
GTCCCAGTCA 
ACTGTTATGT 
ACTGGCCCTT 
GGAGAATATG 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
OGTGTGGGAC 
TGACAGACCT 
AGAGAATGTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGGATTG 
CTGTAAGTCT 
GAGGTGTCAT 
GGAGGTCTGA 
CCTGTCAGGA 
GGTTTGGAGT 



21 
I 

CAAGGGAGAA 
TCCTGGAGCT 
CCCTCAAGAA 
TGGACATGAT 
TCAACTACTT 
TCACTGTCAT 
GCCCAGCCTG 
CAGGTCAATC 
ACCAAGTCTC 
AGCCAGGCCA 
CCTTGGCTGT 
ACTTGCCGAT 
TGATTTTTGG 
CCAAGCAAGC 
TCTGCTCCGA 
CCGACAAGAT 
CTGTGGAGTG 
CCTATACCCT 
GCAGCAGTGG 
GGGATGTCTT 
TGGCCCCAGC 
TGAATATGTT 
GCTGTTTCCA 
ACACACATAT 
GGAGGAATTA 
TTTCACATTT 
AATCCCTTTG 
TCTACACTGC 
CAAATTCCGA 
TCGGAATTCA 
TTTGTATTAG 
CTTTCCATCT 
TTGACATGGT 
TTTTGTGGCA 
GGCAGATAAT 
TCTTGGCTTT 



31 
I 

GCTGCTGGTC 
GGGAGAGGCC 
GAAGCTGCGG 
CCAGTTCACC 
GGATATGGAA 
CTTCGACACT 
CAAGACGCAC 
TTTCTCCATT 
TGTGGAAGGA 
GACCTTTGTG 
GGGAGGAGTG 
GTTTTCTGTC 
AGGCTACGAC 
TTACTGGCAG 
GGGCTGCCAG 
TAAGCAGCTG 
TGCCAACCTT 
CAGCCCAACT 
CTTTCAAGGA 
CATTCGACAG 
AGTCCCCTAA 
AGGCTGGGGC 
GGGTTGCAAC 
ACACACACAC 
CGTTATACAT 
GATTATGAAA 
CAACTCCACT 
TGCCCACTCC 
GCATTACATC 
AGCATCTCCC 
GATTCAAGCA 
ACAGAGTTTA 
TCTCTGAACT 
AAAATACTTC 
GCTGGTGCCT 
AATCATTCAT 



41 
I 

GGACTCACAA 
CAAGGATCCC 
GCACGGAGCC 
GAGTCCTGCT 
TACTTCGGCA 
GGCTCCTCCA 
AGCAGGTTCC 
CAGTATGGAA 
CTAACCGTGG 
GATGCAGAGT 
ACTCCAGTAT 
TACATGAGCA 
CACTCCCATT 
ATTGCACTGG 
GCCATTGTGG 
CAAAACGCCA 
AACGTCATGC 
GCCTACACCC 
CTTGACATCC 
TTTTACTCAG 
GGAGGGGCCT 
ATTCTTTACA 
TTGAATTAAG 
ACACTTCACA 
TCATATTTTG 
ATCTCCAAAC 
CAGCCCTGAC 
TCTCTCCAGC 
ATTTTGTCCA 
ATTGTCCCAC 
AGGCCCATAT 
GCACATTTGA 
TATCTTTCCT 
CTAGGTGGTG 
CTCTATTGGT 
TACAAAGTTC 



51 
I 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AGCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGCA 
TTGATGGAAT 
TTGACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 
TGTGTCTGTG 
CCTACAAAAA 
ACCAAACAGA 
CATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCACATGCT 
TAAATATTTC 
AAATGTTTGG 
ATTGCATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



Seq ID NO: 467 Protein sequence 
Protein Accession #: NP 001901.1 



1 

I 

MKTLLLIiLLV 
SMDQSAKEPL 
QPSQSSTYSQ 
FDGILGLGYP 
FSGSLNWVPV 
IGAAPVDGBY 
HPPAGPLWIL 



11 
I 

LLELGEAQGS 
INYLDMEYPG 
PGQSFSIQYG 
SLAVGGVTPV 
TKQAYWQIAL 
AVECANLNVM 
GDVFIRQFYS 



21 
I 

LHRVPLRRHP 
TISIGSPPQN 
TGSLSGIIGA 
FDNMMAQNLV 
DNIQVGGTVM 
PDVTFTINGV 
VFDRGNNRVG 



Seq ID NO: 468 DNA sequence 

Nucleic Acid Accession fh nm_01805B.i 

Coding sequence: 319.. 1575 



TACGCGCTGC 
GACGGCCGGG 
TACACCGACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTGATGCCC 
CTCAGAGATG 
GTGGGCCCCA 
AACTTCCTTT 
GTGGACGACC 
AAAGTGGACA 
ACCCATGGGA 
GTCCGCACGG 
AACATTGCCT 
GGAGACCCCC 
ACAGGGGGTG 
GGAGAGTCCA 
TGGCTGCGAG 
CTCTACACCA 
TGTGAGATGG 
GTGACGTGGC 



11 
I 

GGGACCGGCA 
AGGAGATCTA 
AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCAG 
TCCACAACCG 
CCCACCAGCA 
TCGTCTATGG 
AGGTCCGCTT 
TCATCACCGC 
ACCGCAGCTC 
TCATCGAGGA 
TGGTGACCGA 
TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATGGCAA 



21 
I 

GGGGAACGCC 
CTTCCTCAAC 
GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCCTGAG 

CAGTGCCTCG 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CTCAGCCAAC 
GCTCAATCCC 
CTTCGACGGA 
GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACA CTTTG GC 
GATGGTGAGC 



31 

1 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TTTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTGGCCCTGG 
GGCCCCCACC 
GCCTCACCCA 
AATGACCAGG 

GGCGACGCCT 
GACGGGATGC 
TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGGAACGTGG 



41 
I 

CAGCCTGCGA 
CCTTCTCGGG 
AAGACATCCT 

ACGCCTACGG 
TCTCCCGGGG 
CAGGGGGCCG 
GCGACAATGA 
ACGCTGCGGC 
CTGACTTCAA 
GCCTCTATCT 
AGTTCTCCAT 
AGCTGGAGAT 
GOGTCATCCG 
TGGAGCCTGA 
TGGACCTCAT 
ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAGCCAG 
CCAGCGGGGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 



31 


41 


51 




1 

SLKKKLRARS 


1 

QLSEFWXSHN 


1 

LDMIQFTESC 


60 


FTVIFDTGSS 


NLWVPSVYCT 


SPACKTHSRF 


120 


DQVSVEGLTV 


VGQQFGSSVT 


EPGQTFVDAE 


160 


DLiPMFSVYMS 


SNPEGGAGSB 


LIFGGYDHSH 


240 


FCSBGCQAXV 


DTGTSLITGP 


SDKIKQLQNA 


300 


PYTLSPTAYT 


LLDFVDGMQP 


CSSGFQGLDI 


360 


LAPAVP 









51 
I 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAA7GTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAGTGCTGGT 
CCGTGATGGC 
GCAAATGAGC 
GCCCTCCCCT 
CTTCTTCAAC 
TAGAGAGCAC 
GGGCCGGGGC 
CTTGTCCCAT 
CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



365 



5 

10 
15 
20 
25 
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85 



WO 02/086443 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 
ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 
ACACCTATGG AAGCTACAGO TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 
ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 
CCCCCACCGC TGCTGCTGCC ACTGCOGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 
CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 
CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 
CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 
AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 
GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 
GGGCTTTGTC AACACGTG 

Seq ID NO: 469 Protein sequence 
Protein Accession &: NP_060528.1 



PCT/US02/12476 



i 
I 

MDPEASDLSR 
RGDGTFVDAA 
FRDIASPKFS 
ELNPGDALEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 
I 

GIIALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKWLYTKKS 
BMNSVLEILY 
ATSPTRMAQP 



21 
I 

EAGVSKYTGG 
HGRGVALADF 
ADFDNDQELE 
DFDGDGMLDL 
GAHLRIIDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 
I 

RGVSVGPIIiS 
NRDGKVDXVY 
IFFNNIAYRS 
IliSHGESMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



41 
I 

SSASDIFCDN 
GNWNGPHRLY 
SSANRLFRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPLLLPliLE 



51 

I 

ENGPNFLFHN 
LQMSTHGKVR 
RREHGDPLIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LPLLHRSS 



Seq ID NO: 470 DNA sequence 
Nucleic Acid Accession JM AJ279016 
Coding sequence t 1 . . 1962 



ATGTCCAGGA 
CAGCGGGCTG 
AGTAATCCCA 
TTTGAGATCG 
CAGAAGCGGC 
GACCGGCAGG 
GAGATCTACT 
TTGTTCAAGT 
CGTGGTGTGG 
GGACGCTACT 
ATTGAAATGG 
GCTGCTGAGG 
CTCAGCAGCA 
CACAACCGGG 
CACCAGCATG 
GTCTATGGCA 
GTCCGCTTCC 
ATCACCGCCG 
CGCAGCTCCT 
ATCGAGGAGC 
GTGACCGACT 
GCTCAGCCGC 
GTGCCACGCA 
AAGAGTGGGG 
CCCGTGGCAC 
GATGGCAAGA 
CTCTACCCCC 
TTCTCCCAGC 
GTGTGCCCTC 
AACAAGAAGT 
CTCGGCCAGT 
GCTGCTGCCG 
CTGGGGTCGG 
CCAGCGGATG 
AGGGATGTAA 
TAACAATTAG 
CAGACAGGGT 
GGACACAGAT 
CCTGAGTTCA 
AACTTGTTAG 
TTAGTGTGGA 
AAAGGCTCAA 



11 
I 

TGTTACCGTT 
AACCCATGTT 
CCCAGCTCAA 
TCGTGGCGGG 
TGGTGAACAT 
GGAACGCCAT 
TCCTCAACAC 
TCCGCAATAA 
CCAGCCTCTT 
CTATCTACAT 
ACCCTGAGGC 
CTGGGGTCAG 
GTGCCTCGGA 
GCGATGGCAC 
GGCGAGGTGT 
ACTGGAATGG 
GGGACATCGC 
ACTTTGACAA 
CAGCCAACCG 
TCAATCCCGG 
TCGACGGAGA 
TGTCCGTCTT 
CCCGGTTTGG 
CCCACCTGAG 
ACTTTGGCCT 
TGGTGAGCCG 
GGGATGAGGA 
AGGAAAATGG 
GAGACAAGCC 
GCAGTCGGGG 
CACCGGGCCC 
CTGCTGGAGC 
TGGTTAAGGA 
GAGTCCAGCA 
AGGCCTGGGA 
GGAGACTCGT 
CGCTGCCCTG 
GTGCCCAGGG 
AATCCTGATT 
CCATCCATTA 
GATTAGATTA 
TAAAAACAAG 



21 
I 

CCTGCTGCTO 
CACTGCAGTC 
CTATGGTGTG 
GTACAATGGA 
CGCGGTCGAT 
CGGGGTCACA 
CAATAATGCC 
CCGGTGGGAA 
TGCCGGACGC 
TGCCAATTAC 
CAGTGACCTC 
CAAATATACA 
TATCTTCTGC 
CTTTGTGGAC 
CGCCCTGGCT 
CCCCCACCGC 
CTCACCCAAG 
TGACCAGGAG 
CCTCTTCCGC 
CGACGCCTTG 
CGGGATGCTG 
COGGGGCAAT 
GGCCTTTGCC 
GATCATCGAC 
GGGGAAGGAT 
GAACGTGGCC 
CACACTTCAG 
CCATTGCATG 
CGTATGTGTC 
CTACGAGCCC 
CCGCCCCACC 
TGCCACTGCT 
GAGCTGCGAG 
GGGGAGTGGG 
GCTAGACCCT 
AAGGCCAGGC 
ATGGCGCTTA 
AGGTGGTGTC 
CAGGAACTCA 
TCGCATCTGC 
AATGTATGTA 
TGCCTCTCAC 



31 
I 

CTCTGGTTTC 
ACCAACTCAG 
GCAGTTACTG 
CCCAACCTGG 
GAGCGCAGCT 
GCCTGCGACA 
TTCTCGGGGG 
GACATCCTGA 
TCTGTGGCCT 
GCCTACGGTA 
TCCCGGGGCA 
GGGGGCCGAG 
GACAATGAGA 
GCTGCGGCCA 
GACTTCAACC 
CTCTATCTGC 
TTCTCCATGC 
CTGGAGATCT 
GTCATCCGTA 
GAGCCTGAGG 
GACCTCATCT 
CAGGGCTTCA 
AGGGGAGCTA 
GGGGGCTCAG 
GAAGCCAGCA 
AGCGGGGAGA 
GACCCAGCCC 
GACACCAATG 
AACACCTATG 
AACGAGGATG 
ACCCCCACCG 
GCACCGGTCC 
CCCAGCTGCT 
AAAGTGGGCT 
CCCCAAGCCC 
CCTGTGCTGG 
CATTCCAGTG 
ACTGCACAGG 
CAAAGCTATG 
AAAATGGGGA 
AGAC ACTT GG 
TGGGCTTTGT 



41 

I 

TGCCCATCAC 
TTCTGCCTCC 
ATGTGGACCA 
TTCTGAAGTA 
CACCCTACTA 
TCGACGGGGA 
TGGCCACGTA 
GCGATGAGGT 
GTGTGGACAG 
ATGTGGGCCC 
TTCTGGCGCT 
GCGTCAGCGT 
ATGGGCCTAA 
GTGCTGGTGT 
GTGATGGCAA 
AAATGAGCAC 
CCTCCCCTGT 
TCTTCAACAA 
GAGAGCACGG 
GCCGGGGCAC 
TGTCCCATGG 
ACAACAACTG 
AGGTCGTGCT 
GCTACCTGTG 
GTGTGGAGGT 
TGAACTCAGT 
CACTGGAGTG 
AATGCATCCA 
GAAGCTACAG 
GCACAGCCTG 
CTGCTGCTGC 
TCGTAGATGG 
GAGCAGGGGT 
TGTGCTGCTG 
ATCCATGCAC 
GCACATAGCT 
GGTCTAATGA 
AAGTATGAGG 
TGACCTTACA 
TTAAGAATAG 
CACAAAACCT 
CAACACG 



Seq ID NO: 471 Protein sequence 
Protein Accession #: CAC08451 

. 1 11 



1320 
13B0 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
19B0 
2040 
2100 
2160 



51 
I 

TGAGGGGTCC 
TGACTATGAC 
TGATGGGGAC 
TGACCGGGCC 
CGCGCTGCGG 
CGGCCGGGAG 
CACCGACAAG 
CAACGTGGCC 
AAAGGGCTCT 
TGATGCCCTC 
CAGAGATGTG 
GGGCCCCATC 
CTTCCTTTTC 
GGACGACCCC 
AGTGGACATC 
CCATGGGAAG 
CCGCACGGTC 
CATTGCCTAC 
AGACCCCCTC 
AGGGGGTGTG 
AGAGTCCATG 
GCTGCGAGTG 
CTACACCAAG 
TGAGATGGAG 
GACGTGGCCA 
GCTGGAGATC 
TGGCCAAGGA 
GTTCCCATTC 
GTGCCGGACC 
CGTGGGGACT 
CACTGCCGCT 
AGATCTCAAT 
GGGACATGAA 
CCTAGACAGT 
ATTACTTAGC 
GTGATCACAG 
CCATATCTTA 
ACTTTAGTGT 
CCAGTCACTT 
AATCTTGGGG 
GGCACATAGT 



60 
120 
. 180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



21 31 41 51 

I ! Ill) 

MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD 
FEIWAGYNG PNLVLKYDRA QKRLVNIAVD BRSSPYYALR DRQGNAIGVT ACDIDGDGRB 
EIYFUJTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 



60 
120 
180 



366 



5 

10 
15 
20 
25 
30 

35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GRYSIYIANY AYGNVGPDAL IEMDPEASDL SRGILALSDV 
LSSSASDIFC DtJENGPNFLF HNRGDGTFVD AAASAGVDDP 
VYGNWNGPHR LYLQMSTHGK VRFRBIASPK FSKPSPVRTV 
RSSSANRLPR VIRREHGDPL IEELNPGDAL EPEGRGTGGV 
AQPLSVPRGN QGFNNNWLRV VPRTRFGAFA RGAKWLYTK 
PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVtEI 
FSQQENGHCM DTNECIQFPP VCPRDKPVCV NTYQSYRCRT 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVLVDGDLN 

Seq ID NOi 472 DMA sequence 
Nucleic Acid Accession »: FGEHESHH 
Coding sequence: 1..4794 



PCT/US02/12476 



AAEAGVSKYT 


GGRGVSVGPI 


240 


HQHGRGVALA 


DFNRDGKVDI 


300 


ITADFDNDQE 


LEIFFNNIAY 


360 


VTDFDGDGML 


DLILSHGESM 


420 


KSGAHLRIID 


GGSGYLCEME 


4B0 


LYPRDEDTLQ 


DPAPLECGQG 


540 


NKKCSRGYEP 


NEDGTACVGT 


600 


U3SWKESCB 


PSC 





ATGGCGTGTC 
AGCGGCTCCT 
GTTCTGAAGT 
TCACCCTACT 
ATCGACGGGG 
CACAGCAGCT 
CCACCTACAA 
TCCTCCCTGQ 
TGTCGGGGTG 
GGGGTGGCCA 
CTGAGCGATG 
GCCTGTGTGG 
GGTAATGTGG 
GGCATTCTGG 
TTCTCCCACA 
GGAGGAGACC 
TGCCGGCTGG 
CAGAGGGAGG 
TCCAAAAGCC 
GCGCCTTCTC 
CCCCTTGTCA 
CCCCACCCCC 
CTGATGGCTG 
CTGAGAAGCT 
GAGCTGGGAG 
CTGGGAGAAC 
CCCAAGGTCA 
GGCCCCGGGA 
CTCTCCCATC 
GTGCCGGGAG 
CTGGCGTGGA 
TTTAGGCTCA 
CTGCAGTTCC 
TCTGCCACTC 
ATCCTCAGCA 
TTCCACAACC 
GCCTTCATCG 
CTAGCAGAAA 
CCACATTGCC 
TTCTTGACGC 
CAGGGGGCCC 
ACTGCCTATT 
TTGTCCTCTG 
GCCCTGGCTG 
CCCCACCGCC 
TCACCCAAGT 
GACCAGGAGC 
CTCTTCCGAT 
GGTCAGGGAG 
AAGGTCAACA 
AGAGGCTGTG 
AAAGGGAAGG 
CCACACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTACGGGCT 
AGGGGCTACG 
AGAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTATGGGGT 
AGGGGCTATG 
AGAGAGCACG 
GGCCGGGGCA 
TTGTCCCATG 
AACAACAACT 
AAGGTCGTGC 
GGCTACCTGT 
AGTGTGGAGG 



11 
I 

CGGGAGGACT 
CCCCAGCATC 
ATGACCGGGC 
ACGCGCTGCG 
ACGGCCGGGA 
CAGCGCAGGT 
CCCCTGCAGG 
GTCAGGCTTC 
GACTGAGACC 
CGTACACCGA 
AGGTCAACGT 
ACAGAAAGGG 
GCCCTGATGC 
CGCTCAGAGA 
CTGCCTCTCC 
CAGAGGAGGC 
GCTGGAAGGA 
CTGGGGCAGC 
ATTTGGCTGA 
CAGCCCACCC 
CTCAGCTAAT 
GAGCCCCAGG 
AGGCTTTGGG 
GGGAGGAAAG 
GTCCCTGGAG 
CTCCCATTTT 
CACAGGAGTG 
GGGTGGCCAA 
CCCTGGTCCC 
CTGCCCTGCC 
ACCAGATGGA 
GGAAAGCACG 
CCTCAGGCCT 
ACTGTGGGTC 
GCAGTGCCTC 
GGGGCGATGG 
TTCACCTCAA 
CTGGTCCTTC 
ATCATGGTTT 
AAGGCTTGGC 
CACCCTGCCT 
ACATTGTCCT 
AAAGAGTCAA 
ACTTCAACCG 
TCTATCTGCA 
TCTCCATGCC 
TGGAGATCTT 
GCTOCATOCT 
AAGGTTTAAG 
CAGGTCCOCT 
GGAATGCAGG 
GAAATGTGGC 
ACAAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGCTCCAATC 
TACGGGCTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCACAGAA 
TCACTACCAG 
CCAATCACTA 
GGGTCCAATC 
GAGACCCCCT 
CAGGGGGTGT 
GAGAGTCCAT 
GGCTGCGAGT 
TCTACACCAA 
GTGAGATGGA 
TGACGTGGCC 



21 

I 

CCCAGCCCGT 
CCCTCCCCAT 
CCAGAAGCGG 
GGACCGGCAG 
GGAGATCTAC 
CCCTTCTGGG 
CCTCCTGGGT 
TCCGGACAGC 
TACCCATGAA 
CAAGTTGTTC 
GGCCCGTGGT 
CTCTGGACGC 
CCTCATTGAA 
TGTGGCTGCT 
AAGCATTGGT 
AGATGAGGAG 
CGGGCAGTTC 
TGGCGTGCCC 
CAAGAACCTA 
TTTCCCTGCC 
GACACATGGA 
AATGGACCCC 
CGCGTGGCCA 
CAGGCAGAAG 
CCAAGCCACA 
ACAAAGAACA 
CCATCTAGTG 
GCGAGAGATT 
CAACTTCCCC 
TGGGAATCCT 
AAAAGAGGAG 
GGAAGCAGAA 
CAGAGGCAGC 
GATGTCTTTT 
GGATATCTTC 
CACCTTTGTG 
ATATCACCTC 
CTCCTCCTGC 
GTCTATGAGC 
CTCCAGTGCC 
TCTGGCAAGA 
GTGGTCTGCC 
CGTGGGTGTG 
TGATGGCAAA 
AATGAGCACC 
CTCCCCTGTC 
CTTCAACAAC 
GGCTCGTGGC 
AATCAGAAGG 
GATGAAGAAA 
GCAAAGCCTG 
CCAAAGTGTG 
GCTACAGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CGGGCTCCAA 
GCTACAGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCACAG 
CATCGAGGAG 
GGTGACCGAC 
GGCTCAGCCG 
GGTGCCACGC 
GAAGAGTGGG 
GCCCGTGGCA 
AGATGGCAAG 



31 
I 

TGCTCTGGTT 
TCCTCCTCCA 
CTGGTGAACA 
GGGAACGCCA 
TTCCTCAACA 
CTCCACAGAA 
CTGCCTCCAC 
AGGCAGGGAG 
CCAGAACCAT 
AAGTTCCGCA 
GTGGCCAQCC 
TACTCTATCT 
ATGGACCCTG 
GAGGCTGGGG 
GAGATATCTG 
CACAGTGGGG 
AAGGAAGAAG 
AGAGGACGTG 
TTTGGCCCAC 
CGCCAAGCCC 
CGTCTGGCTG 
AAATGTAAGG 
GCGCTCAGCA 
GGGCAGGCCA 
CAGCACCTGC 
GACGGAGATC 
GCCACCATGC 
GGGAGAGAGA 
AGCTGCTTGA 
GGGAACTGGG 
GGGAAGATTC 
TTCCCCCCAG 
CCTGTCCTCC 
CTAGGGGGCC 
TGOGACAATG 
GACGCTGCGG 
TGCAGAGATT 
TGCCCGTGGC 
TTTACAAGGA 
CACCGGAGGA 
GCTCCCTGTG 
ATCCCAGAGA 
GACGACCCCC 
GTGGACATCG 
CATGGGAAGG 
CGCACGGTCA 
ATTGCCTACC 
TCTTCATCCT 
GGAGGGTTCC 
CAGAAAGGAA 
GCCAAGGAGC 
CCCAGAACCC 
CCAATCACTA 
GGGTCCAATC 
TACGGGGTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGCTCCAATC 
TACGGGCTCC 
GGGCTACGGG 
AAAGGGGCTA 
CTCAATCCCG 
TTCGACGGAG 
CTGTCCGTCT 
ACCCGGTTTG 
GCCCACCTGA 
CACTTTGGCC 
ATGGTGAGCC 



41 

I 

GGATGGGACT 
GGTACAATGG 
TCGCGGTCGA 
TCGGGGTCAC 
CCAATAATGC 
ACAGGCCTGT 
TCAGCGGAAG 
AGAGGGTGCC 
TTCTTCTGAG 
ATAACCGGTG 
TCTTTGCCGG 
ACATTGCCAA 
AGGCCAGTGA 
TCAGCAAATA 
GCAGAACCGA 
ATGGAAGCAC 
CAGCAGCTTT 
TTCGAACAGC 
CATGTTACTA 
CCCAACACTA 
GAAAACTAGC 
GCCGCCATGC 
CCACTGTGGT 
TGTCCAGATG 
CTGCTAGAGA 
CAGGGAGGAG 
CAGCTCTCGG 
CTGGGGCAGT 
GGCCTCTTGA 
TTCTGGACAT 
ATGGAGACCA 
GCTCCTCTGA 
AGGTGGGCCT 
GAGGCGTCAG 
AGAATGGGCC 
CCAGTGCTGA 
TTCCTCACTC 
ATGCACGTCT 
CCGGGTCACG 
CACTCAGCCT 
TCCTGGGGTC 
GCCTGATGAC 
ACCAGCATGG 
TCTATGGCAA 
TCCGCTTCCG 
TCACCGCCGA 
GCAGCTCCTC 
rGACAGCTGG 
CAGGGCCAGG 
GGAAGGACGA 
CGGCCTCTGC 
AAGCGCCACA 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CAGGGTCCAA 
GCTACGGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
CTCCAATCAC 
CGGGGTCCAA 
GCGACGCCTT 
ACGGGATGCT 
TCCGGGGCAA 
GGGCCTTTGC 
GGATCATCGA 
TGGGGAAGGA 
GGAACGTGGC 



SI 
I 

GGGTGGGCCC 
ACCCAACCTG 
TGAGCGCAGC 
AGCCTGCGAC 
CTTCTCGGGC 
GCTGAAGCCT 
GGACTTTTCC 
GGTTCCCTGC 
ACCCAAATCA 
GGAAGACATC 
ACGCTCTGTG 
TTACGCCTAC 
CCTCTCCCGG 
TACAGAAGGC 
GGAGCGGGAA 
CAGCCAACTG 
GGTGGAGGAA 
TCTGCAGACT 
TTCTGTCTGC 
CCCTGTAGCC 
CCGGAGTGTC 
TGAGCCCGGC 
GCCAGGGGGC 
TGCACTCAGG 
GCTGTATGAC 
AAGGGACTCG 
GGGACTCGAG 
AGGAAGACCA 
AGCCGGGACA 
GGCCAAGGCC 
TGAGCCCAGA 
GGAGCCTCTG 
GGGGCTTGCT 
CGTGGGCCCC 
TAACTTCCTT 
ACGTCGTTTA 
CCTGTGCCAC 
TCTTCAGGCT 
GTTCTATTCA 
CCAGGGTTCT 
TCTGATCCCC 
CCACAGCTAT 
GCGAGGTGTC 
CTGGAATGGC 
GGACATCGCC 
CTTTGACAAT 
AGCCAACCGC 
TGGGAGGAAC 
GGGTCAGGCC 
GGACTGGGCA 
TATTGCAGGG 
AGATACAAAG 
GGGCTACGGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCAGGAAA 
TCACTACCAC 
CCAATCACTA 
GGGTCCAATC 
TACGGGGTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGAG 
TACCAGGAAA 
CGTCATCCGT 
GGAGCCTGAG 
GGACCTCATC 
TCAGGGCTTC 
CAGGGGAGCT 
CGGGGGCTCA 
TGAAGCCAGC 
CAGCGGGGAG 



60 
120 
160 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780' 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 



367 



WO 02/086443 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 

Seq ID NO: 473 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MACPGGLPAR CSGWMGLGGP SGSSPASPPH SSSRYNGPNL VLKYDRAQKR LVNIAVDERS 60 

SPYYALRDRQ GNAIGVTACD IDGDGREEIY FLNTNNAFSG HSSSAOVPSG LHRNRPVLKP 120 

PPTTPAGLLG LPPLSGRDFS SSLGQASPDS RQGERVPVPC CRGGLRPTHE PEPFLLRPKS 180 

GVATYTDKLP KFRNNRWEDI LSDEVNVARG VASIiFAGRSV ACVDRKGSGR YSIYIANYAY 240 

GNVGPDALIE MDPEASDLSR GILALRDVAA EAGVSKYTEG PSHTASPSIG EISGRTEERE 300 

GGDPEEADEE HSGOGSTSQL CRLGWKDGQF KEEAAALVEE QREAGAAGVP RGRVRTALQT 360 

SKSHLADKNL PGPPCYYSVC APSPAHPPPA RQAPQHYPVA PLVTQLMTHG RLAGKLARSV 420 

PHPRAPGMDP KCKGRHAEPG LMAEALGAWP ALSTTWPGG LRSWEESRQK GQAMSRCAIiR 480 

ELGGPWSQAT QHLPARELYD LGEPPILQRT DGDPGRRRDS PKVTQECHLV ATMPALGGLE 540 

GPGRVAKREI GRETGAVGRP LSHPLVPNFP SCLRPLEAGT VPGAALPGNP GNWVLDMAKA 600 

LAWNQMEKEE GKIHGDHEPR PRLRKAREAE FPPGSSEEPL LQFPSGLRGS PVLOVGLGLA 660 

SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGPNFL FHNRGDGTFV DAAASAERRL 720 

AFIVHLKYHL CRDFPHSLCH LAETGPSSSC CPWBARLLQA PHGKHGLSMS FTRTGSRFYS 780 

FLTQGLASSA HRRTLSLQGS QGAPPCLLAR APCVLGSLIP TAYYIVLWSA IPESLMTHSY 840 

LSSERVNVGV DDPHQHGRGV ALADPNRDGK VDIVYGNWNG PHRLYLQMST HGKVRFRDIA 900 

SPKFSMPSPV RTVITADPDM DQELEIFFNN IAYRSSSANR LFRCSILARG SSSLTAGGRN 960 

GQGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA RGCGNAGQSL AEEPA5AIAG 1020 

KGKGNVAQSV PRTQAPQDTK PHYHKKGLQG PITTRKRGYG VQSLPGKGAT GSNHYQEKGL 1080 

RGPITTRKRG YGVQSLPGKG ATGSNHYQEK GLOGPITTRK RGYGLQSLPG KGATGSNHYH 1140 

RKGLRAPITT RKRGYGVQSL PGKGATGSNH YQEKGLRGPI TTRKRGYGLQ SLPGKGATGS 1200 

NHYQEKGLQG PITTRKRGYR VQSLPQKGAT GSNHYQEKGL RGPITTRKRG YGLGSLPGKE 1260 

AMGSNHYQEK GLRAPITTRK RGYGVQSLPQ KGATGSNVIR REHGDPLIEE LNPGDALEPE 1320 

GRGTGGWTD FDGDGMLDLI LSHGESMAQP LSVFRGNQGP NNNWLRWPR TRFGAFARGA 1380 

KWLYTKKSG AHLRIIDGGS GYLCEMEPVA HFGLGKDEAS SVEVTWPDGK MVSRNVASGE 1440 

MNSVLEILYP RDEDTLQDPA PLECGQGFSQ QENGHCMDTN ECIQFPFVCP RDKPVCVNTY 1500 

G5YRCRTNKK CSRGYEPNED GTACVGTELG SRHTMTWKPR PKKELQLSOG ICTPVWSFFL 1560 
PGCRLLLKRA QLQAAPSTLL QKAPGIPEAQ VYEQDQE 

Seq ID NO> 474 dna sequence 

Nucleic Acid Accession #: NM_003661.1 

Coding sequence: 1..1152 

1 11 21 31 41 51 

I I I I 1 I 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG AGGGGAGAGG AAGCTGGAGC GAGGGTGCAA 60 

CAAAACGTTC CAAGTGGGAC AGATACTGGA GATCCTCAAA GTAAGCCCCT CGGTGACTGG 120 

GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGG CTGCTGCTGA ACTGCCCAGG AATGAGGCAG ATGAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 360 

TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATOGCC 480 

AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTAGGGAAA GAAGTGGTGG 660 

ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 780 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

ACGGATGTGG CCCCTGTAAG CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 

CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 1140 
CAAGAACTGT GA 

Seq ID NO: 475 Protein sequence 
Protein Accession NPJ503652.1 

1 11 21 31 41 51 

I i I t I I 

MSALFLGVGV RAEEAGARVQ QNVPSGTDTG DPQSKPLGDW AAGTKDPESS IFIEDAIKYF 60 

KEKVSTQNLL LLLTDNEAWN GFVAAAELPR NEADELRKAL DNLARQMIMK DKNWHDKGQQ 120 

YRNWFLKEFP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGILTLVGMG 180 

LAPFTEGGSL VLLEPGMELG ITAAtiTGITS STMDYGKKWW TQAQAHDLVI KSLDKLKEVR 240 

EFLGENISNP LSIAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISAESG 300 

EOVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYE SKHLHEGAKS ETAEELKKVA 3 SO 
QELEEKLNIL NNNYKILQAD QEL 

Seq ID NO: 476 DNA sequence 

Nucleic Acid Accession #: NM_014452.1 

Coding sequence: 1..1968 
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WO 02/086443 

III! 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT 
GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG 
GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT 
ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA 
ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG 
AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC 
TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC 
AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT 
ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG 
TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA 
AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC 
ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG 
GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA 
TCTGTTAGAC CAAAGGTACT GAGTAGCATC. CAGGAAGGGA 
TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA 
CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC 
GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG 
CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA 
GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT 
CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC 
ACCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC 
CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT 
AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG 
GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGCCA 
GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA 
ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA 
AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG 
TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT 
GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG 
AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT 
GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG 
GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC 



PCT/US02/12476 



Seq ID NO i All Protein sequence 
Protein Accession #: NP 055267.1 



MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRB 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFF 
DDMLHFLNPE 



11 
I 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 
RKSSRTLKKG 
KDIYQFItCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 
1 

ATATMIAGSL 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVB 
SEREVAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLFE 



31 
I 

LLLGFLSTTT 
CPVGTFTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
LPMSPSPLSP 
SRNGSFITKE 
IIGVKSQEAS 



GCAGCCGCAT 
GATTCCTTAG 
ACCGCCATGT 
CCTATGTCTC 
GGACCTTTAC 
CATGGCCAAT 
CACCTGGCAT 
GGGGTGTGCG 
GTACCTTCTC 
GTCAGAACCT 



A6CACATGGA 
CAGAATCCAA 
CAGTCCCTGA 
ACCTTCAGGT 
CGTCCATGGA 
GACATCCTAG 
TTGTGCTTTT 
CGAGGACTCT 
TGAAGAAATC 
ATGGTATCGA 
ATCAGTTTCT 
CCGACCACGA 
GCCTCGCCCA 
TTCGTGGGCT 
GCCCCAGCCC 
CTCTCCTGAC 
CGGAGCCCCT 
GTTCCTTTAT 
GTGACTTGCA 
TGATTGAAGA 
TCAAGAGCCA 
TGCTGTAG 



41 
I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIP 
SARGKEDVKK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 





60 




120 


x unu war x uvu 


180 




240 


OVGGCATGAG 


300 


(3 ATTYSAR AA A 


'9! 360 




420 


ClAACVAAAGGG 


480 




540 


uw X OV1 X VJtrV X w 


600 


CTCCAGCTCC 


660 


A APTVATRA A 


• 720 


viUl xv»XV9V*Sw 


7B0 




840 


AGTCAAnCAC* 


900 




960 


ACAGAAPCTA 


1020 




1080 


nAAAAAftftftft 


1140 


rATRArTCPA 


1200 


TATCCTGAAG 


1260 


TTGCAATGCC 


1320 


f3r , Rfjr5f*rTA c 

WOWWLV X /TW 


1380 


GCTAATTAG C 


1440 


GATGGAAGAC 


1500 


GCTTAGCCCG 


1560 


GGTGGAGCCT 


1620 


TCTCCGCTGT 


1680 


TACCAAAGAA 


1740 


GCCTATCTTT 


1800 


GATTCCCCAG . 


1860 


GGAAGCCAGC 


1920 



Seq ID NO: 478 DNA sequence 
Nucleic Acid Accession # : XM_044533 
Coding sequence: 238.. 2751 



GCTCTGCCCA 
AGGGGCTGAG 
GGAGGCGGGG 
AGTCCGGCCG 
CTGCGCACCG 
CGGCCACCGC 
TGGGCGCTCA 
TTCGAAGCTG 
CTGTACGTGG 
GGCGGGGAGT 
TTCAAGGGCA 
AGCGGCAGTC 
AACATGGAGA 
AAGGGCCGTT 
CTCTACACTG 
AGCCTTCGCC 
GCCTCAGCCT 
TTTTTCTTCA 
ATTGCCCGCA 
TCCTTCCTCA 
CTGCAGGATG 
GGGGTCTTCA 
ACAATGAAGG 
CAGCAGTGGT 
AACAGTGCCC 
TTCCTCAAGG 
CCCCAGGCTC 
GATGTCCTCT 
CGGGTGCACA 



11 

I 

AGCCGAGGCT 
TTTGCCAGGG 
GCCCCCGGGG 
AGCCACCTGA 
CGATGGGCCT 
TGCTGCTGCT 
GCCCCCGGAT 
AACACATCTC 
GTGCTCGAGA 
ACCAGGAGCT 
AGGACCCACA 
ACCTGTTCAC 
ACTTCACCCT 
GTCCCTTCGA 
GAACAGTCAG 
CCACCAAGAC 
ACATTCCTGA 
GCGAGACTGG 
TCTGCAAGGG 
AGGCCCAGCT 
TCTTCACGCT 
CTTCCCAGTG 
ATGTGCAGAQ 
ACACCGTGAC 
GGGAAAGGAA 
ACCACTTCCT 
GCTACCAGCG 
TCCTGGGCAC 
TCATTGAGGA 



21 
I 

GCGGGGCCGG 
CCCACTTGAC 
CGACTCGGGG 
GCCCGAGCCG 
GAGGAGCTGG 
CCTGCTGCTG 
CAGCCTGCCT 
CAACTACACA 
GGCCCTCTTT 
GCTTTGGGGT 
GCGCGACTGT 
CTGTGGCACA 
GGCAAGGGAC 
CCCGAATTTC 
CAGCTTCCAA 
CGAGAGCTCC 
GAGCCTGGGC 
CCAGGAATTT 
CGATGAGGGT 
GCTGTGCTCA 
GAGCCCCAGC 
GCACAGGGGA 
AGTCTTCAGC 
CCACCCGGTG 
GATCAACTCA 
GATGGACGGG 
CGTGGCTGTA 
TGGTGACGGC 
GCTGCAGATC 



31 
I 

CGCCGGCGGG 
CCTGTTTCCC 
GCGGACCGCG 
CGGGACACCG 
CTCGCCGCCC 
CTGCTCCTGC 
CTGGGCTCTG 
GCCCTTCTGC 
GCACTCAGTA 
GCAGACGCAG 
CAAAACTACA 
GCAGCCTTCA 
GAGAAGGGGA 
AAGTCCACTG 
GGGAATGACC 
CTCAACTGGC 
AGCTTGCAAG 
GAGTTCTTTG 
GGAGAGCGGG 
CGGCCCGACG 
CCCCAGGACT 
ACTACAGAAG 
GGCCTCTACA 
CCCACACCCC 
TCCCTGCAGC 
CAGGTCCGAA 
CACCGCGTCC 
CGGCTCCACA 
TTCTCATCGG 



41 
I 

AGGACTGCGG 
ACCTCCCGCC 
GGGCGGAGCT 
TCGCTCCTGC 
CATGGGGCGC 
TGCAGCCGCC 
AAGAGCGGCC 
TGAGCAGGGA 
GCAACCTCAG 
AGAAGAAACA 
TCAAGATCCT 
GCCCCATGTG 
ATGTCCTCCT 
CCCTGGTGGT 
CGGCCATCTC 
TGCAAGACCC 
GCGATGATGA 
AGAACACCAT 
TGCTACAGCA 
ATGGCTTCCC 
GGGGTGACAC 
GCTCTGCCGT 
AGGAGGTGAA 
GGCCTGGAGC 
TCCCAGACCG 
GCCGCATGCT 
CTGGCCTGCA 
AGGCAGTGAG 
GACAGCCCGT 



IGTYRHVDRA 


60 


QPCPWPMIEK 


120 


CARGTFSDVP 


180 


PRPEHMETHE 


240 


TLPNIjQWNH 


300 


PWMIVLFLLL 


360 


CNGHGIDILK 


420 


PEASLAQLIS 


480 


ENSALLTVEP 


540 


LDPCDLQPIF 


600 


LPDLL 




51 




1 

TGCCCCGCGG 


60 


CCCCAGGTCC 


120 


GCCGCCCGTG. 


180 


TCTCCGAATG 


240 


GCTGCCGCCT 


300 


GCCTCCGACC 


360 


ATTCCTCAGA 


420 


TGGCAGGACC 


480 


CTTCCTGCCA 


540 


GCAGTGCAGC 


600 


CCTGCCGCTC 


660 


TACCTACATC 


720 


GGAAGATGGC 


780 


TGATGGCGAG 


840 


GCGGAGCCAA 


900 


AGCTTTTGTG 


960 


CAAGATCTAC 


1020 


TGTGTCCCGC 


1080 


GCGCTGGACC 


1140 


CTTCAACGTG . 


1200 


CCTTTTCTAT 


1260 


CTGTGTCTTC 


1320 


COGTGAGACA 


1380 


GTGCATCACC 


1440 


CGTGCTGAAC 


1500 


GCTGCTGCAG 


1560 


CCACACCTAC 


1620 


CGTGGGCCCC 


1680 


GCAGAATCTG 


1740 
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CTCCTGGACA 
CCCATGGCCA 
TACTGTGCTT 
ACCAGGCCGT 
TOGGTTGTGT 
CAGCCCAACA 
TGGCTACGCA 
GACCTGCTGC 
GGCTTCCAGC 
CAAACAGATG 
GCTGGTGGCA 
TGCACGCTCT 
AACAGCATGA 
CCTGTGGTGC 
CTCGATCACC 
GAGTCAGAGA 
CCCCGGCCCC 
TCCAGAGGAC 
TCCCCTCCGC 
GGCCAGCTGG 
ACAGCCGTGG 
TGCTCCTTAT 
GAGAGGGAAG 
GGGGTGCTGG 
TGGCCTCTTC 
CAGGACCAGC 
CTGCCGTCGT 
GGTCCTGGGC 
GAGAGGACAG 
CAGGGAAGAG 
CCACCATATC 
GTCCTCTCCC 
TCAACACTGC 
ATGCACTTTA 



CCCACAGGGG 
ACTGCAGCCT 
GGAGCGGCTC 
GGATCCAGGA 
CCCCGTCTTT 
CAGTGAACAC 
ACGGGGCCCC 
TGGTGGGCAC 
AGCTGGTAGC 
AGGGTGGCAG 
AGGCCAGCTG 
TTGTGCTGGC 
AAGTCTTCCT 
TGCCCCCTGA 
GAGGGTACCA 
AGAGGCCACT 
GGGTCCGCCT 
GCTGCCCTGG 
TCTGCTCTTC 
CCTGCTGCTC 
CCCCAGAGGT 
GTAAACTGAG 
AGATAGCATG 
GGATGCATCC 
ACCTTCCACA 
TTGGGCTGCG 
CCCACCACCT 
TCGGACCCAA 
CGCGAGCTCA 
ACTGTCGCCT 
CACCCTCGCT 
CAGTCCCCAG 
CCAGCACAGG 
TGTCATTTTT 



GCTGCTGTAT 
GTACAGGAGC 
CAGCTGCAAG 
CATCGAGGGA 
TGTACCAACA 
TTTGGCCTGC 
CGTCAATGCC 
CCAACAGCTG 
CAGCTACTGC 
TGTACCCGTC 
GGGTGCAGAC 
CGTGCTGCTC 
GAAGCAGGGG 
GACCCGCCCA 
GTCCCTGTCA 
CAGCATCCAA 
TGGCTCGGAG 
CTTCAGGGGC 
GTGGAACACG 
TCCAGTCAAG 
CCTGGCCAAA 
CCCTTTGTTT 
GCATGCAGCA 
AAAGTGGTTG 
TTATCCOGCT 
TGOGTTCTGC 
CAGGGACCAG 
CTCCTGGACC 
GGAGAGATTT 
GCCTTCCTCC 

ccatctttga 
ttcaccctcc 

GGCCCTGAAT 
TAATAAAGTC 



GCGGCCTCAC 
TGTGGGGACT 
CACGTCAGCC 
GCCAGCGCCA 
GGGGAGAAGC 
CCGCTCCTCT 
TCGGCCTCCT 
GGGGAGTTCC 
CCAGAGGTGG 
ATTATCAGCA 
AGGTCCTACT 
CCAGTTTTAT 
GAATGTGCCA 
CTCAACGGCC 
GACAGCCCCC 
GACAGCTTCG 
ATCCGTGACT 
TGTGAATGCT 
ACCGTGGTGC 
TAGCGAAGCT 
TATGGGGGCC 
AAAAAACAAT 
CACACGGCTG 
TCTGAGACAG 
GCCACCGGCT 
CTTGCCAGTC 
AGGGCTAGGT 
TTTCCAGCCT 
CGTGACAATG 
GTTGTTGCGT 
ACTCAAACAC 
ATCCCTCACC 
TTATGTGGTT 
TGAAGAATTA 



ACTCGGGCGT 
GCCTCCTCGC 
TCTACCAGCC 
AGGACCTTTG 
CATGTGAGCA 
CCAACCTGGC 
GOCACGTGCT 
AGTGCTGGTC 
TGGAGGACGG 
CATCGCGTGT 
GGAAGGAGTT 
TCTTGCTCTA 
GOGTGCACCC 
TAGGGCCCCC 
CGGGGTCCCG 
TGGAGGTATC 
CTGTGGTGTG 
CGGAGAGGGT 
COGGCCCTTO 
CCTACCACCC 
TGCCTAGGTT 
TCCAAATGTG 
CTCCAGTTCA 
AGTTGGAAAC 
GCCCTGTCTC 
AGCCGAGGAT 
TGGCACTGCG 
GTATCAGGCT 
TACGCCTTTC 
GAGAACCCGT 
GAGGAACTAA 
TTCCTCCACT 
TTTATACATT 
CTGTTT 



AGTCCAGGTG 
CCGGGACCCC 
TCAGCTGGCC 
CAGCGCGTCT 
AGTCCAGTTC 
GACCCGACTC 
ACCCACTGGG 
ACTAGAGGAG 
GGTGGCAGAC 
GAGTGCACCA 
CCTGGTGATG 
CCGGCACOGG 
CAAGACCTGC 
TAGCACOCOG 
AGTCTTCACT 
CCCAGTGTGC 
AGAGCTGACT 
CAACTGGACC 
GGAGCCTTGG 
AGACACCCAA 
GGTGGAACAG 
AAACTAGAAT 
TGGCCTCCCA 
CCTCACCAAC 
ACTGCAGATT 
GTAGTTGTTG 
GCCCTCACCA 
GTGGCCACAC 
CCTCAGAATT 
GTGCCCCTTC 
CTGCACCCTG 
CTAAGGGATA 
TTTTAATAAG 



Seq ID NO: 479 Protein sequence 
Protein Accession fl: XP 044533.3 



MLRTAMGLRS 
RFEAEHISNY 
SFKGKDPQRD 
GKGRCPFDPN 
VASAYIPESL 
TSFLKAQLLC 
FTMKDVQRVF 
NFLKDHFLMD 
PRVHIIEELQ 
PYCAWSGSSC 
FQPNTVNTLA 
EGFQQLVASY 
MCTLFVLAVL 
PLDHRGYQSL 



11 
I 

WLAAPWGALP 
TALLLSRDGR 
CQNYIKILLP 
PKSTALWDG 
GSLQGDDDKI 
SRPDDGPPFN 
SGLYKEVNRE 
GQVRSRMLLL 
IFSSGQPVQN 
KHVSLYQPQL 
CPLLSNLATR 
CPEWEDGVA 
LPVLFLLYRH 
SDSPPGSRVF 



21 
I 

PRPPLLLLLL 
TLYVGAREAL 
LSGSHLFTCG 
ELYTGTVSSF 
YFFFSETGQE 
VLQDVFTLSP 
TQQWYTVTHP 
QPQARYQRVA 
LLLDTHRGUj 
ATRPWIQDIB 
LWLRNGAPVN 
DQTDEGGSVP 
RNSMKVPLKQ 
TESEKRPLSI 



31 
I 

LLLLI/QPPPP 
FALSSNLSFL 
TAAFSPMCTY 
QGNDPAISRS 
PEFFENTIVS 
SPQDWRDTLF 
VPTPRPGACI 
VHRVPGLHHT 
YAASHSGWQ 
GASAKDLCSA 
ASASCHVZiPT 
VIISTSRVSA 
GECASVHPKT 
QDSFVEVSPV 



41 

I 

TWALSPRZSL 
PGGEYQELLW 
INMENPTLAR 
QSLRPTKTES 
RIARICKGDE 
YGVFTSQWHR 
TNSARERKIN 
YDVLFLGTGD 
VPMANCSLYR 
SSWSPSPVP 
GDLLLVGTQQ 
PAGGKASWGA 
CPWLPPETR 
CPRPRVRLGS 



51 
I 

PLGSEERPFL 
GADABXKQQC 
DEKGNVLLED 
SLNWLQDPAF 
GGERVLQQRW 
GTTBGSAVCV 
SSLQLPDRVIi 
GRLHKAVSVG 
SCGDCLLARD 
TGEKPCEQVQ 
LGEFQCWSLE 
DRSYWKEFLV 
PLNGLGPPST 
EIRDSW 



Seq ID HO: 480 DNA sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence: 58.. 10 92 



1 

I 

GGCCGGGAGA 
GCCCAGAAGG 
AGCACCCTGC 
ATGAGCCGCT 
AGTGGGACAC 
CCTCTGGGCA 
ATCGTGGCGC 
CTGCGCAGAG 
AACTATTTTT 
CTCTACAAGG 
GAGGAGTTGG 
AAGCCAGAAA 
TCTGTGCATG 
CCAGAGATGA 
CTTTGCTATG 
TATCGCCGCA 
GACCTCATCT 
TCAGCCCACC 
TCTGTCGCCT 
TATAGGGGAA 
TTAATAAAGG 



11 
I 

GTAGCAGTGC 
AGAACTCCTA 
CCCAGCGAGT 
CCAATGTCCA 
CCGACATCTT 
AAGGCAAGTT 
TCAAGGTCCT 
AGATCGAAAT 
ATGACCGGAG 
AGCTGCAGAA 
CAGATGCTCT 
ATCTGCTCTT 
CGCCCTCCCT 
TTGAGGGGCG 
AGCTGCTGGT 
TCGTCAAGGT 
CCAAACTGCT 
CTTGGGTCCG 
GATGGTCCCT 
AGAAGGGATC 
CTGAAGCTTT 



21 
I 

CTTGGACCCC 
CCCCTGGCCC 
CCTCCGGAAA 
GCCCACAGCT 
AACGOGGCAC 
TGGAAACGTG 
CTTCAAGTCC 
CCAGGCCCAC 
GAGGATCTAC 
GAGCTGCACA 
AATGTACTGC 
AGGGCTCAAG 
GAGGAGGAAG 
CATGCACAAT 
GGGGAACCCA 
GGACCTAAAG 
CAGGCATAAC 
GGCCAACTCT 
GTCATTCACT 
CCTAACTGTT 
TTGT 



31 
I 

AGCTCTCCTC 
TACGGCCGAC 
GAGCCTGTCA 
GCCCCTGGCC 
TTCACAATTG 
TACTTGGCTC 
CAGATAGAGA 
CTGCACCATC 
TTGATTCTAG 
TTTGACGAGC 
CATGGGAAGA 
GGAGAGCTGA 
ACAATGTGTG 
GAGAAGGTGG 
CCCTTTGAGA 
TTCCCCGCTT 
CCCTCGGAAC 
CGGAGGGTGC 
CGGGTGCGTG 
CCCTTATCTG 



41 

I 

CCCCTTTCTC 
AGACGGCTCC 
CCCCATCTGC 
AGAAGGTGAT 
ATGACTTTGA 
GGGAGAAGAA 
AGGAGGGCGT 
CCAACATCCT 
AGTATGCCCC 
AGCGAACAGC 
AGGTGATTCA 
AGATTGCTGA 
GCACCCTGGA 
ATCTGTGGTG 
GTGCATCACA 
CTGTGCCCAC 
GGCTGCCCCT 
TG CCTCC CTC 
TGTTTGTATG 
TTTTCTACCT 



51 

I 

TCTAAGGATG 
ATCTGGCCTG 
ACTTGTCCTC 
GGAGAATAGC 
GATTGGGCGT 
AAGCCATTTC 
GGAGCATCAG 
GCGTCTCTAC 
CCGCGGGGAG 
CACGATCATG 
CAGAGACATA 
CTTCGGCTGG 
CTACCTGCCC 
CATTGGAGTG 
CAACGAGACC 
GGGAGCCCAG 
GGCCCAGGTC 
TGCCCTTCAA 
TCTGTGTATG 
CCTCCTTTGT 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 481 Protein sequence 
Protein Accession #: NP 004208 



11 



21 



31 



51 
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WO 02/086443 

I I I I 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KBPVTPSALV 
SSGTPDILTR HFTIDDFBIG RPLGKGKFGN VYLAREKKSH 
QLRREIEIQA HLHHPNILRL YNYPYDRRRI YLILEYAPRG 
MEELADALMY CHGKKVIHRD IKPENLLLGL KGELKIADPG 
PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE 
QDLISKLLRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL 

Seg ID NO: 482 DNA sequence 
Nucleic Acid Accession #: AK055663 
Coding sequence: 38.. 1423 



I I. 
LMSRSNVQPT AAPGQKVMEN 
FIVALKVLPK SQIEKEGVEH 
ELYKELQKSC TFDBQRTATI 
WSVHAPSLRR KTMCGTLDYL 
TYRRIVKVDL KFPASVPTGA 
QSVA 



PCT/US02/12476 



AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



11 

I 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 

I 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCAOGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 
I 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 

I 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



51 



Seq ID NO: 483 Protein sequence 
Protein Accession #* BAB709B0.1 



MGTIHLFRKP 
TAYTYLTIFD 



RSLCGIIPGL 
GTWYPMSVYS 
VRIRRDANEQ 
MPLLKGTDDL 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCLIS 
BIHTGRLLVG 
SSIPLPRMNP 
GKVLLQTTPP 
MVLAHVTNRL 
NPVTSTPAKP 
PGIGATQGLR 



21 
1 

EFRLVAADRR 
YWVTIiRKPSP 
TFVALCFNLF 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGPTNIPSRY 



31 

I 

SWKILLFGVI 
VYSFGFERLB 
TMLSIRNKPF 
ALCITYMLIE. 
REVSTLDGVL 
IPKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 
I 

NLICTGFLLM 
VLAVFASTVb 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



Seq ID NO: 484 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..900 



ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
GGCTGCGGCG 
GGACCCCGGG 
CTTCCTAACT 
CCGGTGCGCA 
CTTTGCTACC 
TTTCAAAACA 
GTGCTGCTGG 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGCTGTGCA 
GAGGAGACTG 
CAGGCTCTCC 
TTGAGCTCTG 
CGGATACCGA 
TCACAGAGAA 
TGGGCACCCA 



21 

I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGGT 
CCGGGGAGCT 
GAGCAGGCCC 
CCGCCCCGCC 
GGACACAGCG 
TGTCTTCCTG 
ATGGCTGCCC 
GGCCGACCTG 



31 
I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
ACGTACGTTC 
GGGGCGGGCG 
CGAGGTGGCG 
CCTGCAGTGC 
GGACAGGAGG 
GCGTGCTTCA 
GAGATCCGCA 
AGGGACGATG 



41 
I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
AATCGCCCGT 
TCTCGGCGGG 
CTGGTGCGGC 
AAGTCCTGGT 
ATTTTGACCG 
GCGTGGTGCA 
CGCACAACCC 
TCAACGTACT 



60 
120 
180 
240 
300 



ATCTCTTTCG 
TAGCAGCTGA 



ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



51 
I 

WCSSTNSIAL 
AQLGALFILK 
WLQEHVADLS 
SAIAIALMTF 
GPGSLAGSVH 
LNFSDHHVIP 
PGLNHGHTPY 



51 

1 

GACCCCTCCC 
GGGCGACGGC 
CQCGCGCTAC 
GCGGCCGCGT 
AGGGCGCAGA 
CCAGGACGCT 
GGATGGAGCT 
ACTTCGTTCC 
GCCCAGCTCC 
CCAGGCGCCT 
AATTCAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
Q40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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WO 02/086443 

GACCAGGGGG GCCGGGAGGG CCCOtSTGCCC CAACCCCAGG CTCAGGQTCT GGCCGAGAAG 
ATCOGAGCCT CCTOCTACCT TGAGTGCTCA GCCTTOACOC AGAAGAACTT GAAGGAAGTA 
TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC OGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq ID NO: 485 Protein sequence 
Protein Accession 8: PGENESH predicted 



PCT/US02/12476 



720 
7B0 
640 



1 11 

I I 
MPPRELSEAE PPPLRAPTPP 
RPTALDTFSG TYVQSPVRPR 
LPNSGSPRPA PAVQVLVDGA 
FQNITEKWLP EIRTHNPQAP 
IRACCYLECS ALTQKNLKEV 



21 31 41 

I I I 

PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS 
GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP 
PVRIELWDTA GQEDFDRLRS LCYPDTDVFL 
VLLVGTQADL RDDVNVLIQL DQGGREGPVP 
FDSAILSAIE HKARLEKKLN AKGVRTLSRC 



Seq ID NO: 486 DNA sequence 

Nucleic Acid Accession ih XM_063832.2 

Coding sequence: 1..711 



ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
ATTGAGCTCT 
CCGGATACCG 
ATCACAGAGA 
GTGGGCACCC 
GGCCGGGAGG 
TGCTGCTACC 
GCTATTCTCA 
GTGCGCACCC 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGACACAGC 
ATGTCTTCCT 
AATGGCTGCC 
AGGCCGACCT 
GCCCCGTGCC 
TTGAGTGCTC 
GTGCCATTGA 
TCTCCCGCTG 



21 
I 

CGAGGCCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGTG 
GGGACAGGAG 
GGCGTGCTTC 
OGAGATCCGC 
GAGGGACGAT 
CCAACCCCAG 
AGCCTTGACG 
GCACAAAGCC 
CCGCTGGAAG 



31 
I 

CCGCCCCCGC 
GGCATCAAGT 
TACACCTGCA 
CAAGTCCTGG 
GATTTTGACC 
AGCGTGGTGC 
ACGCACAACC 
GTCAACGTAC 
GCTCAGGGTC 
CAGAAGAACT 
CGGCTGGAGA 
AAGTTCTTCT 



41 

I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
TGGATGGAGC 
GACTTCGTTC 
AGCCCAGCTC 
CCCAGGCGCC 
TAATTCAGCT 
TGGCCGAGAA 
TGAAGGAAGT 
AGAAACTGAA 
GCTTCGTTTG 



51 
I 

YTCNGYPARY 
RGGAGAAQDA 
ACPSWQPSS 
QPQAQGLABK 
RWKKPPCFV 



51 

i 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
TCCGGTGCGC 
CCTTTGCTAC 
CTTTCAAAAC 
TGTGCTGCTG 
GGACCAGGGG 
GATCCGAGCC 
ATTTGACTCG 
TGCCAAAGGT 
A 



Seq ID NO: 487 Protein sequence 
Protein Accession ih XP_063B32.1 

1 11 ' 21 31 41 51 

111 I I I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 
RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVPLACP SWQPSSFQN 
ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 

Seq ID NO: 48 B DNA sequence 

Nucleic Acid Accession #: NMJ>14398.1 

Coding sequence: 64.. 13 14 



1 
I 

GGCACCGATT 
ACCATGCCCC 
CACGATGGCA 
ACTGCAGCAG 
CCTCACCAAA 
ACAGTAAAAA 
ATTACCTACA 
GTTACTGAAG 
CCACCAGCTC 
ACTCAACCCA 
ACAACCGGTC 
AATACCACCC 
CCATCGTCAG 
GCAGAGATGG 
TACTTCAACA 
AACCTTCTGT 
TCATATTATA 
CAAGGAATCA 
GTGAGTGAAC 
CTTCAAGCCT 
TACACAATTG 
GGTGTCTATA 
CCCGGGGGGA 
TTGGGAAATT 
AATGAAGTGA 
GTTTATTTTA 
GCCACTCAAA 
AGCCTTCAAA 
ATTTTATTTT 
GCTTTTACTA 
ACTCCTTTTC 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCTTGACCT 
AGCCATTGOG 



11 
I 

CGGGGCCTGC 
GGCAGCTCAG 
GTCAAATGAG 
CAACAGTACA 
CTTTAGCAGC 
TTCCAACAAC 
CCCTGGTCAC 
TTACAGTCGG 
ATACAGCTGG 
GTAACCAGAC 
AGAAGCCTGA 
GCACAGCTGC 
TCAAGACTGG 
GGATACAGCT 
TCGACCCCAA 
TGAATTTTCA 
TCAGTGAAGT 
AACATGOGGT 
AGAGCCTCCA 
TTGATTTTGA 
TGCTTCCTGT 
AAATCCGCCT 
ATGAAAATAA 
CCCTCAGAGT 
GTCATGTGTG 
TGAAAGATAT 
GTCAACATTT 
TTATAAACCA 
ACCCTTGATC 
TCTGTGTTTT 
CACTTTAAAT 
TACAGTGGCA 
GCTTCAGCTT 
GTATTTTTAT 
CAGGTGATCC 
CCCGGCCTTA 



21 
I 

CCGGACTTCG 
CGCGGCGGCC 
AGCAAAAGCA 
GGACATAAAA 
AAGATTCATG 
TACCCCAGCA 
AACCCAGGCC 
CCCTAGCTTA 
AACCAGTTCA 
CACCCTTCCA 
TCAACCCACC 
ACCTGCCTCC 
AATTTATCAG 
GATTGTTCAA 
CGCAACGCAA 
GGGCGGATTT 
GGGAGCCTAT 
GGTGATGTTC 
GTTGTCAGCC 
AGATGACCAC 
GATTGGGGCC 
AAGGTGTCAA 
TGGAATTTAG 
GTGGGTCCTT 
ATTTAAGTTC 
AGTGAGCTGT 
GAGATATGTT 
AGGGTCAATT 
TTAACAAAGC 
ATGGTTTCAT 
TTGTTTTTGT 
CGATCTCGGC 
CCCGAGTAGC 
TATAGACGGG 
ACCCACCTCA 
AATGTTTTTT 



31 

I 

CCGCACGCTG 
GCGCTCTTCG 
TTTCCAGAAA 
AAACCTGTCC 
GATGGTCATA 
ACTACAAAAA 
ACACCCAACA 
GCCCCTTATT 
TCAACCGTCA 
GCAACTTTAT 
CATGCCCCAG 
ACGGTTCCTG 
GTTCTAAACG 
GACAAGGAGT 
GCCTCTGGGA 
GTGAATCTCA 
TTGACCGTCT 
CAGACAGCAG 
CACCTGCAGG 
TTTGGAAATG 
ATCGTGGTTG 
TCATCTGGAT 
AGAACTCTTT 
CAAACAATGT 
AGGCAGCACA 
TTATTTTCTA 
GAATTAACAT 
GTAACTAATA 
CTTTGCTTTG 
GTAACATACA 
TTTTTGAGAC 
TTATGGCAAC 
TGGGATTACA 
TTTCACCATG 
GCCTCCCAAA 
TTAATCATCA 



41 
I 

CAGAACCTCG 
CGTCCCTGGC 
CCAGAGATTA 
AGCAACCAGC 
TCACCTTTCA 
ACACTGCAAC 
ACTCACACAC 
CACTGCCACC 
GCCACACAAC 
CGATAGCACT 
GAACAACGGC 
GGCCCACCCT 
GAAGCAGACT 
CGGTTTTTTC 
ACTGTGGCAC 
CATTTACCAA 
CAGATCCAGA 
TCGGGCATTC 
TGAAAACAAC 
TGGATGAGTG 
GTCTCTGCCT 
ACCAGAGAAT 
CATCCCTTCC 
AAACCACCAT 
TCAATTTCTA 
GTTTCCTTTA 
AATATATGTA 
CTACTGTGTG 
TTATCAAATG 
TATTCCTGGT 
GGAGTTTCAC 
CTCCGCCTCC 
GGCACACACT 
TTGGCCAGAC 
GTGCTGGGAT 
AAAAGAACAA 



51 
I 

CCCAGCGCCC 
CGTAATTTTG 
TTCTCAACCT 
TAAGCAAGCA 
AACAGCGGCC 
CACCAGCCCA 
AGCTCCTCCA 
CACCATCACC 
TGGGAACACC 
GCACAAAAGC 
AGCTGCCCAC 
TGCACCTCAG 
CTGTATAAAA 
ACCTCGGAGA 
CCGAAAATCC 
GGATGAAGAA 
GACAGTTTAC 
CTTCAAGTGC 
CGATGTCCAA 
CTCGTCTGAC 
TATGGGTATG 
CTAATTGTTG 
AGGATGGATG 
CTTCTATTCA 
AATACTTTTT 
GAATATTTTA 
AAGTAGAATA 
TGCATTGAAG 
GACTTTCAGT 
GTAGCACTTA 
TCTTGTCACC 
CGGGTTCAAG 
ACCACGCCTG 
TGGTCTTGAA 
TACAGGCATG 
CATATCTCAG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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WO 02/086443 

GTT6TCTAA6 TGTTTTTATG TAAAACCAAC AAAAAGAACA 
CTTGAT6ACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA 
CTAAACAATA AGCAAGAGAC AATAATAATG GCCCTTAATT 
TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA 
ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG 
GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT 
CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA 
TCACCTTACA GGGAAATGGG TTTATCCAGG ATCATGAGAC 
AGCTTTGCAG ATAACAAAAT AGCCTATCCT TAATAAATCC 
TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTA 
GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA 
ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC 
TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT 
GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG 
AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT 
CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC 

Seq ID NO: 489 Protein sequence 
Protein Accession NP 055213.1 



AATCAGCTTA 
GGTGGCTACA 
ATTAACAAAG 
ACTTATAAGT 
GCTAACTGAT 
CTTCTCCCTA 
CACCTCATAC 
ATTAGGGTAG 
TCCACTCTCT 
TGGGATTGCT 
ATTTTTTTGA 
ATCTTGTGGT 
TCTGTAGAGC 
AACTAAGTCA 
AATTCAAGTC 
AGTTGGCAAG 
AATAAAAACT 



TATTTTTTAT 
GATGGTAGAA 
TGCCAGAGTC 
GAATGAGTAA 
GGCAGAGCCA 
CACCAAGTTA ' 
CAGCATACGC 
ATGAAAGGAG 
GGAAGGAGAC 
TAGCTGGGCT 
TAATAGAGAA 
AACTTGCTGC 
AGCCTGCCAA 
GGATGTTAAC 
AGCCTAGAGA 
GTCCTGACTT 
TACTTTGG 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



PCT/US02/12476 



i 

I 

MPRQIiSAAAA 
HQTLAARFMD 
TEVTVGPSLA 
TGQKPDQPTH 
EMGIQLIVQD 
YYISEVGAYL 
QAFDFEDDHF 



11 

I 

LFASIiAVILH 
GHITPQTAAT 
PYSLPPTITP 
APGTTAAAHN 
KESVFSPRRY 
TVSDPETVYQ 
GNVDECSSDY 



21 

I 

DGSQMRAKAF 
VKIPTTTPAT 
PAHTAGTSSS 
TTRTAAPAST 
FNIDPNATQA 
GIKHAWMFQ 
TIVLPVIGAI 



31 
I 

PETRDYSQPT 
TKNTATTSPI 
TVSHTTGNTT 
VPGPTLAPQP 
SGNOGTRKSN 
TAVGHSFKCV 
WGLCU4GMG 



41 

I 

AAATVQDIKK 
TYTLVTTQAT 
QPSNQTTLPA 
SSVKTGIYQV 
LLLNFQGGFV 
SEQSLQLSAH 
VYKIRLRCQS 



51 

I 

PVQQPAKQAP 
PNNSHTAPPV 
TLSIALHKST 
LNGSRLCIKA 
NLTFTKDEES 
LQVKTTDVQL 
SGYQRI 



Seq ID NO: 490 DNA sequence 

Nucleic Acid Accession #: NM_005409.3 

Coding sequence: 94.. 378 



TTCCTTTCAT 
CAACAGCACC 
TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 
GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGGCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC 
CATCTATGTG 
CCAAATATCA 
TTTATAACCA 
TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 
GTCATTTTTT 



11 

) 

GTTCAGCATT 
AGCAGCAACA 
TATTGTGTGC 
TAGGCCCTGG 
CAAGTAACAA 
GATGCCTAAA 
ATTTTTAAAA 
AAGTTTAACT 
ATGGTTTTGT 
ACCAAAAACA 
GAGTCCAGCA 
GGAGTTTACA 
GCTAGAGAAC 
CTGTCTCTAG 
AGTGGAGACA 
TCGTAAAGCA 
TGTAGCACAT 
ATTCATTAAA 
GCAACAGTGC 
AACTTTTATT 
GTTTTGATCC 
TTTTGTCTAC 
ACTTTTTGTA 
CTATATACTG 
TCTCTAATAA 



21 
1 

TCTACTCCTT 
GCAAAAAACA 
TACAGTTGTT 
GGTAAAAGCA 
CTGTGACAAA 
TCCCAAATCG 
ATATCAAAAC 
GTGACTACTG 
GACTTTCAAC 
GAAATACAGT 
ATTAAATGGA 
AAGTGCTTTC 
CTTCTAGATT 
AAGTTATCTG 
TTGACATTAT 
TTCCTCAAAC 
CAATATGTAG 
TGTAATTCAT 
ACATATTTCA 
CATTGAGATG 
GTTTGTATAA 
CAAAGAAAAA 
ATTCTGTCTC 
TAAAATTTAG 
ACTACCACAA 



31 
I 

CCAAGAAGAG 
AACATGAGTG 
CAAGGCTTCC 
GTGAAAGTGG 
ATAGAAGTGA 
AAGCAAGCAA 
ATATGAAGTC 
AAATGACAAG 
TTTTGTACAG 
CTTCCTGAAT 
TTTCTAGGAA 
AOGTTCTTAC 
TGATGCTTAC 
TCTGTATTGA 
TACTGGAGTC 
ATTTTTTCAT 
GGAAACATTC 
AAAATGTACT 
TAACCAAATT 
TTTTGAAGCA 
ATGATAGCAA 
TGTTGAAAAA 
TTAGAAAAAT 
GTATACTCAA 
CCTTTCTTTT 



41 
I 

CAGCAAAGCT 
TGAAGGGCAT 
CCATGTTCAA 
CAGATATTGA 
TTATTACCCT 
GGCTTATAAT 
CTGGAAAAGG 
AATTCTACAG 
TTATGTGAAG 
GAATGACAAT 
AAGCTACCTT 
TTGTTGTATT 
AACTATTCTG 
TCTTTATGCT 
AAGCCCTTAT 
GCAAATACAC 
TTATGCATCA 
ATGAAAAAAA 
AGCAGCACCG 
ATTAGGATAT 
TATCTTGGAC 
TAAGCAAATG 
ACATAATCTA 
GACTAGTTTA 
TTAAAAAAAA 



Seq ID NO: 491 Protein sequence 
Protein Accession #: NP_005400.1 

1 11 



21 31 41 51 

i ! ' I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIBKASI MYPSNNCDKI 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNP 

Seq ID NO: 492 DNA sequence 

Nucleic Acid Accession fir NM_000577.1 

Coding sequence : 41 . . 520 



GGCACGAGGG 
CCGACCCTCT 
GAAGACCTTC 
CAATTTAGAA 
CCATGGAGGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
CGTCATGGTC 
TCCCATTCTT 



11 
I 

GAAGACCTCC 
GGGAGAAAAT 
TATCTGAGGA 
GAAAAGATAG 
AAGATGTGCC 
AACATCACTG 
GACAGTGGCC 
GCGATGGAAG 
ACCAAATTCT 
GCATGGCAAG 



21 
1 

TGTCCTATCA 
CCAGCAAGAT 
ACAACCAACT 
ATGTGGTACC 
TGTCCTGTGT 
ACCTGAGCGA 
CCACCACCAG 
CTGACCAGCC 
ACTTCCAGGA 
GACTGCAGGG 



31 
I 

GGCCCTCCCC 
GCAAGCCTTC 
AGTTCCCGGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAGAAAG 
TTTTGAGTCT 
CGTCAGCCTC 
GGACGAGTAG 
ACTGCCAGTC 



41 

i 

ATGGCTTTAG 
AGAATCTGGG 
TACTTGCAAG 
CATGCTCTGT 
GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 



60 
120 
180 
240 
300 
360 



51 
I 

GAAGTAGCAG 
GGCTATAGCC 
AAGAGGACGC 
GAAAGCCTCC 
GAAAGAAAAT 
CAAAAAAGTT 
GCATCTGAAA 
TAGGAAACTG 
GATGAAAGGT 
CAGAATTCCA 
AAGAAAGGCT 
ATACATTCAT 
TTGTGACTAT 
ATATTACTAT 
AAGTCAAAAG 
ACTTCTTTCC 
TTTGGTTTGT 
TT ATACGCT A 
GTCTTAATTT 
GTGTGTTTAC 
ACATTTGAAA 
TATACCTAGC 
ATCAATTTCT 
AAGAATCAAA 
AAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 




CTGACGAAGG 
GCCTGCCTGT 
AGGGCTCCCG 



373 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCTATGGGGQ CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 
CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 
GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 
TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 
CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 
ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 
TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT CAGTCCCCGT 
GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 
ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 
TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 
ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 
CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 
TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 
TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 
ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 
GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 
CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 
CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 493 Protein sequence 
Protein Accession #: NP_000S68.1 

1 11 



PCT/US02/12476 



41 



51 



21 31 

I ! I I 1 I 

KALETICRPS GRKSSKMQAF RIWDVNQKTP YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 
HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSPES 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 

Seq ID NO: 494 DNA sequence 

Nucleic Acid Accession ft: NM_002081.1 

Coding sequence: 222.. 1898 



1 

1 

GGCTGCCCGA 
GGCTTTTGTT 
CGGGACCTTG 
AGAGGCGCGG 
GCTGGTGGCT 
GCAAGAGCCG 
GCGACGTGCC 
CCTGCTGCAC 
CCGCGCTCCG 
TCGATGACCA 
CCGGCGCCTT 
AGCTGCGCCT 
GGGCCCGCCT 
ACTACCTGGA 
GAGAGCTGCG 
TGGGCGTGGC 
CGAGAGCTGT 
CCTGCCCTGA 
ACGCCGAGTG 
CATCGGGTGT 
CCCTCCAGGA 
AGGTCAACCC 
GGGAGAGGCC 
GCGACGTCCA 
TGAGCACTGC 
AGGTCATGGG 
CCAAGCCGGA 
TGCGCAGCGC 
GCTCGGGCAG 
GCTCCAGCTC 
AGAAGACCTC 

GAGGCCAAGG 
TGGAGAGGCC 
GTCCCAGCCC 
CAGGTCAGCT 
TCCGGCTGCC 
CTACAGAGGA 
CGCCTCCTCC 
CCTCCAGAGA 
TCTGAGATGA 
CTGCGCCCTT 
GGAGTCTGAG 
GGGGCCACTG 
GGAGGCAGCG 
GCCTTGCTGG 
CCTGCTCCCA 
CAGGGCTCAG 
CCCCGCACTG 
GCACGGGGAC 
TGTCCTTGTT 
CACCTTGGAC 



11 
I 

GCGAGCGTTC 
GTCTCCGCCT 
GCTCTGCCCT 
GCGGGTGGCC 
GCTATGTGCG 
GAGCTGCGGC 
CCAGGCGGAG 
CAGCGAGATG 
GGACAGCAGC 
CTTCCAGCAC 
CGGAGAGCTG 
GTACTACCGC 
GCTOGAGCGC 
CTGCCIGGGC 
CCTGCGGGCC 
CAGCGACGTG 
CATGAAGCTG 
CTATTGCCGA 
GAGGAACCTC 
GGAGAGTGTC 
CAACAGGGAC 
CCAGGGCCCT 
ACCTTCAGGC 
GGACTTCTGG 
CAGTGATGAC 
TGACGGCCTG 
CATGACCATC 
CTACAACGGC 
CGGTGATGGC 
CCGGACGCCC 
GGCTGCCAGC 
TACAGTAGCC 
ACTGACTTTG 
TGGGGTGGGA 
CAGGCCTGGC 
GGGAGCCAGT 
TAGCCCTCCC 
GGCCTCAAAG 
CACTGGGACT 
AGCCCCGCAC 
TGCATGATGC 
GAGGGGCCCC 
GACTGTCCTC 
ACCCACCTGC 
TGGGCTCTGC 
GGTCCAGGGC 
TCCTCACCCA 
AGTGACCCTC 
CACACGGGAA 
CTGGATAGTT 
CATGGAGAGC 
CCTGGTGACC 



AGCGTCTGCA 
CCACAGACCC 
GCTTCTGCTG 
CAATGTGGGC 
TGTTGGAGGA 
GATCAGGAAC 
GGCTGTCACC 
TGCCTAGGTC 
AAGGGCTTTT 

TCCTGTCACT 



31 
I 

CCCCGCGCGC 
CX3CCGCCTCT 
GAACTGCGCA 
CCGGCCCCGC 
TGGTCGCCTG 
AGATCTACGG 
AGCACCTGCG 
TGGCCAACCG 
AGGCCATGCT 
ACTOGGAGOG 
ACGCGAGGGC 
TGCACCTGGA 
AGCTGCACCC 
AGGCGCTGCG 
TCGTGGCTGC 
TGGCTCAGGT 
CTCACTGCCT 
AGGGCTGCCT 
TGGTGCTCAT 
TGCACACGTG 
CCAAGGTCAT 
AGAAGCGGCG 
AGCTGGTCTC 
CAGGGACACT 
ACGGGATGGC 
TCAACAACCC 
TCATGCAGCT 
ACTTCCAGGA 
ACCTCTGOGG 
CCCTCCCAGG 
CCCCGACCTT 
GGCGGTAACT 
AACACAGACG 
CGGCGGCTCT 
TTTCTGCCTT 
GCCATGTATT 
TGCACCGCCG 
GAGCCCACAG 
CCCACCAGCC 
GGTGTCCGCC 
GCGCAGGCTG 
GGGTGACGOC 
TGCAGTGAGG 
GAGGAGGGGA 



CCCCGAGGGC 
CAGGGCCTCC 
TGCTCACAGG 
CCTTCCOGAC 
CCAAACATGC 
TCCCAGATGG 
CACTGAGGCC 



41 
I 

CCCGCGCCGC 
GGACCGCGAG 
GGACCCGGCC 
CATGGAGCTC 
CGCCCGCGGG 
AGCCAAGGGC 
GATCTGTCCC 
CAGCCATGCC 
TGCCACCCAG 
GACGCTGCAG 
CTTCCGGGAC 
GGAGACGCTG 
CCAGCTGCTG 
GCCCTTCGGG 
TCGCTCCTTT 
CCCCCTGGGC 
GGGAGTCCCC 
TGCCAACCAG 
CACCGACAAG 
GCTGGCGGAG 
CCAGGGCTGC 
CCGGGGCAAG 
TGAAGCCAAG 
GTGCAGTGAG 
CAGAGGCCGG 
CGAGGTGGAG 
GAAGATCATG 
CGCCAGTGAC 
CCGGAAGGTC 
CCTGTCAGAG 
CCTCCTGCCC 
GCCCCAAGGC 
ATATTTAATT 
GAGCAGGGGC 
TTAATTTTGT 
TCAGGGACCT 
CAGAAGCAGC 
CGAGCCTGTG 
AGCCCTGGCC 
ATCCAGGGTC 
CAGAGCCCGG 
TGAGACAGCA 
GGCCCTCCAT 
AGCTGGGCCC 
ACACAGGGCT 
TGAGGAGCAG 
CTGTTCACGG 
GATGCTGGTG 
CCAGCCAGCT 
ATCCATTTAC 
CTTCGGAGGC 
ATCAGGGCCC 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



51 
I 

CGCCGCCGCC 

CCGCGCGCGC 

AGGATCCGAG 

CGGGCCCGAG ' 

GACCCGGCCA 

TTCAGCCTGA 

CAGGGCTACA 

GAGCTGGAGA 

CTGOGCAGCT 

GCCACCTTCC 

CTGTACTCAG 

GCCGAGTTCT 

CTGCCTGATG 

GAGGCCCCGA 

GTGCAGGGCC 

CCGGAGTGCT 

GGCGCCAGGC 

GCCGACCTGG 

TTCTGGGGTA 

GCCATCAACG 

GGGAACCCCA 

CTGGCCCCGC 

GCCCAGCTCC 

AAGATGGCCC 

TACCTCCCCG 

GTGGACATCA 

ACCAACCGGC 

GACGGCAGCG 

AGCAGGAAGA 

CAGGAAGGAC 

CTCCTCCTCT 

CCCAGGGACA 

CACCTCAGCC 

AGGCGCAGAG 

ATGAGGTCCT 

CAGGGGCACC 

CCCTCGAGGC 

CCTTCCTCCC 

CACCCCCCAQ 

TGGCAGAGCC 

CCCCACCTCC 

CCACTGCTGA 

GCGCAGATGA 

AAAGGCCCAG 

CACAGGGCAG 

CCAGGACCCG 

TGACACAGGT 

GCTGGTGAGA 

GCACTGCAGG 

TGACACTTCC 

CCGCAGGGCC 

TGCCCCAGGC 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
28B0 
2940 
3000 
3060 
3120 



374 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG 
TGTGGTGTTO GGAAGGGGTC CTGCAGGGGG AGGAGGACTT 
TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC 
CGCACAGTGG AOGGAGGTCC CCGGTTGCTG GTCAGGTCCC 
CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG 
CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT 
CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT 
TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC 
CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 495 Protein sequence 
Protein Accession Jh NP_002072.1 



1 
I 

MELRARGWWL 
ICPQGYTCCT 
TLQATPPGAP 
QLLLPDDYLD 
PLGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 



11 
I 

LCAAAAItVAC 
SEMEENLANR 
GELYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQINNP 



21 

I 

ARGDPASKSR 
SHAELETALR 
FRDLYSELRL 
PFGEAPRELR 
GVPGARPCPD 
LAEAINALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
I 

SCGEVRQIYG 
DSSRVLQAML 
YYRGANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTLTAKVI 
DFWISLPGTL 
MTIRQQIMQXi 



Seq ID NO: 496 DMA sequence 

Nucleic Acid Accession &» NM_001650 . 

Coding sequence t 40.1011 



1 
I 

GGGGCAGGCA 
AGGCGGTGGG 
GGGGTCTGGA 
TTTGTTCTCC 
GTCGACATGG 
TTTGGCCATA 
AGGAAGATCA 
ATTGGAGCAG 
ACCATGGTTC 
TTTCAATTGG 
TCAATAGCTT 
ACTGGTGCCA 
GAAAACCATT 
TATGAGTATG 
AAAGCTGCCC 
GAGACGGATG 
GAGGAGAAGA 
CGCACTGAAA 
GAAACAGATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAGTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 

I 

ATGAGAGCTG 
GTAAGTGTGG 
CTCAAGCTTT 
TCAGCCTGGG 
TTCTCATCTC 
TCAGCGGTGG 
GCATCGCCAA 
GAATCCTCTA 
ATGGAAATCT 
TGTTTACTAT 
TAGCAATTGG 
GCATGAATCC 
GGATATATTG 
TCTTCTGTCC 
AGCAAACAAA 
ACCTGATTCT 
AGGGGAAAGA 
GCAGACAAGA 
TGTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 



21 

I 

CACTCTGGCT 
ACCTTTGTGT 
CTGGAAAGCA 
ATCCACCATC 
CCTTTGCTTT 
CCACATCAAC 
GTCTGTCTTC 
TCTGGTCACA 
TACOGCTGGT 
CTTTGCCAGC 
ATTTTCTGTT 
CGCCCGATCC 
GGTTGGGCCC 
AGATGTTGAA 
AGGAAGCTAC 
AAAACCTGGA 
CCAATCTGGA 
CTCCTTAGAA 
TAGAAATGTG 
ATAATTTACA 
TATTTTTAAG 
AACCAATTTT 
CGTCTATCAG 
TTTATTCTAT 



31 

! 

GGGGAAGGCA 
ACCAGAGAGA 
GTCACAGCGG 
AACTGGGGTG 
GGACTCAGCA 
CCTGCAGTGA 
TACATCGCAG 
CCTCCCAGTG 
CATGGTCTCC 
TGTGATTCCA 
GCAATTGGAC 
TTTGGACCTG 
ATCATAGGAG 
TTCAAACGTC 
ATGGAGGTGG 
GTGGTGCATG 
GAGGTATTGT 
CTGTCCTCAG 
CAGGTTTGTT 
AAGGAGGAAC 
ATGTTCTTAA 
AACCGTGTGT 
CTTATTCCTT 
TAAACTGAGT 



GTGGCCCTGG 
GGAGGGTCTG 
TTTGCTTTTC 
CATGGCTTGT 
6CAAGTCCAC 
GATGCCGGGC 
CAGACCCCAC 
AGTGGCCAAG 
GTGAOGTGTG 



41 
I 

AKGFSLSDVP 
ATQLRSFDDH 
ETIAEFWARL 
RSFVQGLGVA 
ANQADLDAEW 
GGOGNPKVNP 
CSEKMALSTA 
KIMTNRLRSA 



41 
I 

TGAGTGACAG 
ACATCATGGT 
AATTTCTGGC 
GAACAGAAAA 
TTGCAACCAT 
CTGTGGCCAT 
CCCAGTGCCT 
TGGTGGGAGG 
TGGTTGAGTT 
AACGGACTGA 
ATTTATTTGC 
CAGTTATCAT 
CTGTCCTCGC 
GTTTTAAAGA 
AGGACAACAG 
TGATTGACGT 
CTTCAGTATG 
ATTTCCTTCC 
GTTTCATGTC 
GGAAGAAACC 
GCAAATATAT 
CAAGATTTGG 
CTCTACTGGA 
TTAACAATGG 



GGAGGGGTGG 
GGGGCAGCTG 
ATCACCGTCC 
TCTCTGGAAC 
CCCATAATAA 
GCCAGGACAG 
CCTACGCTCA 
CCTGCTGTGT 
TTCTTTTGAG 



51 
I 

QAEISGEHLR 
PQHLLNDSER 
LERLFKQLHP 
SDWRKVAQV 
RNLLDSMVLI 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDPQD 



51 
I 

ACCCACAGCA 
GGCTTTCAAA 
CATGCTTATT 
GCCTTTACCG 
GGTGCAGTGC 
GGTGTGCACC 
GGGGGCCATC 
CCTGGGAGTC 
GATAATCACA 
TGTCACTGGC 
AATCAATTAT 
GGGAAATTGG 
TGGTGGCCTT 
AGCCTTCAGC 
GAGTCAGGTA 
TGACCGGGGA 
ACTAGAAGAT 
ACCCATTAAG 
ATATTACTCA 
TATTGTGAAT 
ACCTATTTTA 
TTAAGTCTTG 
ATATTGGTAT 
C 



Seq ID NO: 497 Protein sequence 
Protein Accession #: NP_001641.1 

11 



41 



51 



I 

CCCCCTTGTC 
TTGGTACCGG 
GACGGTTACC 
TGCTTGCTTT 
CATATATGGC 
CTTTTTCAAT 
CTCATATCCA 
ACTTTATAGA 
GCAGTTTGCT 
CGGGTACATT 
ACTTTGTTTT 
TTTGGTAATT 



11 
I 

ATTAATACAT 
ATTTATACCA 
AGAGGAGAAG 
TATGTTGCTG 
ACATATTTAA 
CATGGAGAGT 
TTTCTTGTTC 
GGAAGCTTGA 
CAGTTTGTAC 
GATATATGTA 
GTTTTGATGT 
ATTTGGGGTA 



21 
I 

TAAAAAGATT 
AAATAATGGA 
GACTCAGTCC 
TAATTTTTAT 
GTGGCAGCCG 
GTACCCGTGT 
TTCAGATGTT 
TTGCACTCTG 
TTCTTACTCA 
AATTACGGAA 
TTGGGAACTC 
TTCTGGCAAT 



31 
I 

CAATCTTTAC 
CTTGATTGGT 
TATTGAAAGC 
TTTAAATGGA 
ATTAGGAGGC 
AATGTGGACA 
GCTAGTGACT 
CATTTCCAAT 
GATTGCATCA 
GATCATTTAT 
AATGTTATTA 
GAAACCACAT 



41 

I 

CCTGAGGTAA 
ATTCAAACCA 
TGTGAAGGAT 
CTAATGATGG 
CTGGTTACAG 
CCACCTCTCC 
CATATTCTCA 
GTATTTTTCA 
TTATTTGCAG 
ATACACATGA 
ACTTCTTATT 
TTCCTGAAAA 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



PCT7US02/12476 



60 
120 
160 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
90 0 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



1 11 21 31 

I I I I i I 

MSDRPTARRW GKCGPLCTRE NIMVAFKGVN TQAFWKAVTA EFLAMLIFVL LSLGSTINWG 
GTEKPLPVDM VLISLCPGLS IATMVQCPGH ISGGHINPAV TVAMVCTRKI SIAKSVPYIA 
AQCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL LVELIITFQL VFTIPASCDS 
KRTDVTGSIA LAIGFSVAIG HLFAXNYTGA SMNPARSPGP AVIMGNWENH WIYWVGPIIG 
AVLAGGLYBY VPCPDVEFKR RFKEAFSKAA QQTKGSYMEV EDNRSQVETD DLILKPGWH 
VIDVDRGEEK KGKDQSGEVL SSV 

Seq ID NO: 498 DNA sequence 

Nucleic Acid Accession ft: AB0206B4.1 

Coding sequence: 1..1744 



60 
120 
180 
240 
300 



51 
I 

TTTTGGCCAG 
AGATATGTTG 
TGGGAGATCC 
CATTATTCTT 
TGTTGTGCTT 
GTGAAAGCTT 
GGGCTACAAA 
TGCTTCCTTG 
TATATGTTGT 
TTTCTCTTGC 
ATGCTTCTTC 
TAAATGTATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



375 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TGAACTTAGT TTATGGGTTA 
ATACTTGACA TCTAAAATTT 
ATCAAAATTC TTTAGTTATA 
TGACTTTATG GAAAAAGAGA 
TCTTGTAGTG TTTGTTGCTA 
TAAACAACAG ACACATGTAA 
ATTGCAATTG TTAGCATATA 
GACACCACAC ATGTGTGTTA 
CTTTTGCAAA GTACATCCTG 
AGGTTCAGCA AATCTGCAAA 
AGAAGAACTT ATAGAATGGA 
CATGCCCACG ATGGCAAGTG 
TTATGAAGAC GCAGGCTTGA 
AGCAGCCGAA GAAGTGAAGC 
AGAGTCATGG TGTGTAAGAA 
AGAAGATCCT GCCAATGCTG 
ACCTCACTTC ACCACTGTAT 
ATGACTGCTA CATGACCTGC 
GCTAAGTCAT GTGTTGTTCA 
AACGTTTTAT TTGGTCAATT 
CAATTGGTTA CTATTTCAAT 
TACTTGACTT TAATATTTGT 
GGGTTGTGAC AAGGATGACA 
TATAGTGTAA GAACTATTAA 
TGGACATTTC AGTGATTGTA 
ACAGTAGATG GGGCAGACAT 
CTTGTGCGCT TTTTGTTCTC 
CCAAACTCTT GGACCTTGTG 
GTGTCTCTTT CTCTCTCTCT 
AATAAGTACT GTTTACTCAT 
ATTAATGGTA ACTGTATTTT 
CTTTAAACCA CTTTAAAGTT 
ACAACCCCAA ATATAGTGCA 
TGGAACAGTA GACTGTAGTA 
TGACTAATTT TGCTGTCAAA 
CTTTTAAGAT TGCCTGTCTT 
TCTAAAAACC ATCATTTCAG 
TTCTTCCTAT TTATTTTTAT 
ATTTTTGTTT CTTAGAAGTT 
ATAGCTCTGA GAAAAGGTTT 
AGACTACACA TTTAATTATA 
TGGATTGTOC TTCAGACCTT 
GCTGTTCCCT TTCTACCTTC 
CAGCTTGCTG ACATAAGTCA 
CCCCCAGTCT TGTGCCAGCG 
* CAGCCCAGGA GAGGGACTCT 
CTGGTTGACT GTTTTTATTT 
TTGATTTGTG GATTAAAAGC 
CGGTGGACAC TTGAGGCTGA 
TGCTTATCTG TGATTGTTGC 
ATTTTTAAAA ATTATACTTT 
TCCAAAGAAG TTCACATGTA 
CTATTATAAC TGCTTCATTT 
GAAGGATCCT TTTGTAGCAG 
TAGGGGAGCC AGTTTGGAGC 
TCGCAGCCCA AGCACTGTGG 
TTCCATAGGC GTACAAAACA 
AATATTTTTG CTTTAGTATG 
TTGCTACAAC ATTTTCGAAA 
AAAAATGCTT GGCAAAAAAA 
GAAAGTATGT ATCAGGAATA 
ATGCCAGTTG TTTACATGTA 



PCT/US02/12476 



TTCAAGGATG 
TTGGTATTGC 
AGGATTTTGA 
CTCCACTGAG 
TTGTTAGAAA 
GAAAACACCA 
CAGCCCTTGG 
TGGCATCACT 
GTGCTATTGT 
CCCAGTGGAA 
TCAAATATAG 
TTAAGCTCTC 
GAGCCAGAAC 
GAGAACTGAT 
GATCCAAGCC 
GGAAAACTCC 
TCCAGAACAG 
TGCCTAOGGA 
TATCCCAAAA 
TGAATGTCAT 
GCACCCTTTA 
GCTAAAGTGA 
TGAAAATACA 
TGCCCCTTGC 
AGTTCTTCGG 
GGAGTGTTTG 

GTTAGGAAAT 



TTAGTTGCTT 
TCTCATTTTT 
TTTTCATGTT 
TCTAGAAACT 
CATGGTAATT 
AATGTAAAGA 
TAATAAGACA 
TATAAGGAAT 
TTTGAAAAAT 
AATTTGTGTG 
TAGGGTTTTA 
CAGCTTCTCT 
AGrCCTCAGG 
TTACCAGAGC 
GCTGCAAAGG 
CCTAGAGCCG 
GGCAGCGTTC 
CTTAGGCTTT 
AACATTTGAG 
GGATGGGAGT 
TCACCTGAGT 
TACATTTATT 
ATAAGTAGAA 
GCTGGGAACC 
TGTTTATGAA 
AGAGGCCTGA 
AGCATCCACA 
GTATTAAAGC 
AGGAAAGTAA 
ACAAAGTTGG 
ATATAGTGTT 
AAGTGATATT 
CTATATATGT 



Seq ID NO: 499 Protein sequence 
Protein Accession ft: BAA74900.1 



PLVINTLKRP 
ACFYVAVIFI 
SYPPLVLQML 
GYIDICKLRK 
ELSLWVIQGC 
DFMEKETPLR 
XiQIiLAYTALG 
GSANLQTQWN 
YEDAGLRART 
EDPAKAGKTP 



11 
I 

NLYPEVILAS 
LNGLMMALFF 
LVTHILRATK 
1IYIHMISLA 
FWLFGTVILK 
YTKTliLLPW 
ILIMRI*KLFL 
IVGBFSNLPQ 
KIVYSMYSRK 
LCNLLVKDSK 



21 
I 

WVRIYTKIMD 
IYGTYLSGSR 
LYRGSLIALC 
LCFVLMFGNS 
YLTSKIPGIA 
LWFVAIVRK 
TPHMCVMASL 
EELIEWIKYS 
AAEEVKRELI 
PHFTTVFQNS 



TTTTTGGTTA 
AGATGACGCT 
TACTTTATTG 
ATACACAAAG 
GATTATTAGT 
GTTTGATCAT 
TATTTTAATT 
GATCTGCTCA 
GTTTGCTATA 
TATTGTAGGG 
TACTAAACCA 
TGCACTTCGG 
AAAAATAGTA 
AAAGTTAAAA 
TGGTTGCAGT 
CTTATGTAAC 
TGTTTACAAA 
GAACTACATC 
ACTTTTATAG 
TCTAATTATA 
AAATTTGCTA 
GCAAAGCTAC 
GGACAATTCT 
TTCTTTTTTC 
TCATGTCAGC 
CTATATAAAA 
TTAATGAAGC 
TCCCTTAACT 
CTCTTCTCCT 
ATCAAGTACT 
AGCATTATTC 
TAATTATAGT 
AATGTATATT 
TTTCTTTTAC 
ATAATGATAA 
AAGCCTTAAG 
AAGTATATTT 
TTCTACACCT 
AAATGAGATT 
AATTCTAAGC 
TTCTTAACCA 
CATGGTTTCT 
CCAAGGGCAG 
CTGAGGAGTG 
CAGCTCCCAG 
TTCAGATTTG 
TGCTAGTTTT 
CGATGATGCA 
TGACATGAGC 
GTGGCTGATT 
TTATATTTTT 
ATTCTGTATA 
ATTAAAAGTA 
TGTAAOCCCC 
AGGTCCCTGC 
CCTTTGATGG 
TCAGTGTTTT 
GGATGGGCAA 
GGCTGTATTT 
AAAATAGGCC 
GCATAGGAGT 
TAAATTAAAA 



31 
I 

LIGIQTKICW 
LGGLVTVLCP 
ISNVFFMLPW 
MLLTSYYASS 
DDAHIGNLLT 
IISDMWGVLA 
ICSRQLFGWL 
TKPDAVFAGA 
KLKVNYYILE 
VYKVLEWKE 



TTTGGAACTG 
CATATTGGCA 
TATACCTGTG 
ACATTATTGC 
GATATGTGGG 
GGAGAGCTGG 
ATGAGACTAA 
AGACAGCTAT 
TTAGCAGCAA 
GAGTTCAGCA 
GATGCAGTGT 
CCCATTGTGA 
TACTCAATGT 
GTGAACTATT 
ATGCCTGAAA 
CTCTTGGTGA 
GTCCTAGAAG 
TGTAATGGTT 
GTAACTGTTT 
AAAATGACTT 
TGCAAATGAG 
CTGTATAAAG 
GACAATGTAG 
TGCCTCTTGC 
CCCTGTCATC 
CTATCTGTTT 
TTTTCCTGCC 
TCCAGCCATA 
CTTCCCCATA 
TATTCTTGGT 
AAATGTTTAT 
TTTAAGAAAA 
TGATTAGACA 
TATTAAGATA 
ATGGAGTTTT 
CCTTATGTTA 
CGTCCTCCTC 
TCTTTGAATT 
CTTCAAAACG 
AAAGCGTGAC 
CAGGCAGATT 
GGTGCCCACT 
GCCTGGTCCC 
TGCCCTCAGA 
GGATGCTCCT 
TGGCCACTGT 
AGAAAATAGG 
CAACAGTCCA 
AGGGAGAGGG 
GTGTACATCC 
CTCACCCCCA 
GGAAAAAAGC 
ATATAAATTA 
AGCAAAATAT 
TATGCAGCCG 
CAATGCAGAT 
GCATATTGTT 
AGAAGCGATC 
CTTTAAAAAG 
AGTGATATTA 
ATTGTATTTT 
AAAATCATGA 



TCATACTTAA 
ACTTACTAAC 
CAGCGGAGTT 
TTCCAGTTGT 
GTGTCTTAGC 
TTTACCATGC 
AACTCTTCTT 
TTGGATGGCT 
TGTCAATACA 
ATTTGCCCCA 
TTGCGGGTGC 
ATCATCCACA 
ATAGTCGGAA 
ACATTCTAGA 
TTTGGGATGT 
AGGATTCCAA 
TTGTAAAAGA 
TTAATGTTTT 
TCAAATAGAA 
ACACCTTTAT 
TATATGCTTG 
AAAACACAGT 
GGGCTGATTT 
TCTTGTCTTT 
AACTTGAGTT 
GTTTTACTTC 
CATTATTAAT 
TGGCATTATC 
TTTTCTGTCA 
TTTAAAAAAA 
ATTTTAATAC 
ACTATTTTGA 
TCATTTATAG 
CAATAAAACA 
TTATATTTTA 
TAATTTTGGT 
TTTAGTTTTT 
CCTTGTATGA 
ATGAAACCTC 
TATGGCTGAC 
AACCTCATTG 
CCTGGAAGOC 
GGGGAAGCAG 
GAAGCACCGC 
TCCCTGGAGG 
TTCTCATTTG 
GAAGCAGCCC 
GGAAAATGGG 
AGGTGCGCGC 
AGCAGTTACA 
GTAATTTCCT 
ATTAAAAATA 
GCTTTTTCCA 
GGCTATATAT 
TGGCCACAGC 
TGGTAGCAGG 
AGCATTTACA 
AAAATAGCTA 
ATAAGCCTCT 
ATGAGAAAAT 
TATGAATTTT 
GAAATG 



41 
I 

TVTRGEGLSP 
FPNHGECTRV 
QFAQFVLLTQ 
LVIIWGILAM 
SKPPSYKDFD 
KQQTHVRKHQ 
PCKVHPGAIV 
MPTMASVKLS 
ESWCVRRSKP 



51 
I 

IESCEGLGDP 
MWTPPLRBSP 
IASLFAVYW 
KPKFLKINVS 
TLLYTCAAEP 
PDHGEXiVYHA 
FAILAAMSIQ 
ALRPIVNHPH 
GCSMPEIWDV 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 500 DNA sequence 

Nucleic Acid Accession ft: NM_001276.l 

Coding sequence: 127.. 1279 



51 



1 11 21 31 41 

I I I I I 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTAGG 
TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 



60 
120 



376 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA GGCTTTGTGG 
TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTQGT 
GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA 
GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA 
ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTQAAGA 
TGGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA 
TTCATCAAGT CAGTACOGCC ATTCCTGOGC ACCCATGGCT 
TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA 
GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC 
TCTGOGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG 
GATTTCATTA GCATCATGAC CTACGATTTT CATGGAGCCT 
CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG 
TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA 
CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA 
TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG 
ATCTGTGACT TCCTCCGCGG AGCCACAGTC CATAGAACCC 
GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG 
CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT 
TTCCAGGGCT CCTTCTGCGG CCAGGATCTG CGCTTCCCTC 
GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC 
CCCCCTCTGG CTCCAGCTGG COGGGAGCCT GATCACCTGC 
GCCTCAGTCT CCCTCCCTTO GGGCCTATGC AGAGGTCCAC 
GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT 
GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA 
TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC CCTAGCCCTC 
TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA 
TACCCCCTGC AAAOCCAGCT TGAAACCTTC ACTTAGGAAC 
ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA 
CGCTTTGCTT TGGTCTATCT TTGAGCGCCC ACTAGACCCA 
TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC 
ATGTT 

Seq ID NO: 501 Protein sequence 
Protein Accession #: NP_00 1267.1 



PCT/US02/12476 



MGVKASQTGF 
ISNDHIDTWE 
KSVPPFLRTH 
GKVTIDSSYD 
VGYMLRLGAP 
DFLRGATVHR 
GSFCGQDLRF 



11 
I 

WhVhLQCCS 
WNDVTLYGML 
GPDGLDLAWL 
IAKISQHLDF 
ASKLVMGIPT 
TLGQQVPYAT 
PLTMAIKDAli 



21 
I 

AYKLVCYYTS 
NTLKNRNPNL 
YPGRRDKQHF 
ISIMTYDFHG 
FGRSFTLASS 
KGNOWVGYDD 
AAT 



31 
I 

WSQYKEGDQS 
KTLLSVGGWN 
TTLIKEMKAE 
AWRGTTGHHS 
ETGVGAPISG 
QESVKSKVQY 



TCCTGGTGCT 
CCCAGTACCG 
CCCACATCAT 
ATGATGTGAC 
CTCTCTTGTC 
ACACCCAGAG 
TTGATGGGCT 
CCCTAATCAA 
TCCTGCTCAG 
CCAAGATATC 
GGCGTGGGAC 
ACAGATTCAG 
GTAAGCTGGT 

GGACCCTTGC 
TCGGCCAGCA 
AAAGCGTCAA 
GGGCCCTGGA 
TCACCAATGC 
GGGGGCCAAG 
CCTGCTGAGT 
AACACACAGA 
AGTGAGGCAT 
TGTTTACAGA 
CTTATCAAAG 
CACAGTGACC 
GTAATCGTGT 
AGAGTTTAAC 
CTGGACTCAC 
TTGCAGAGAT 



41 
I 

CFPDALDRFL 
FGSQRFSKIA 
FIKEAQPGKK 
PIiFRGQEDAS 
PGIPGRFTKE 
LKDRQLAGAM 



GCTCCAGTGC 
GGAAGGCGAT 
CTACAGCTTT 
GCTCTACGGC 
TGTCGGAGGA 
TCGCCGGACT 
GGACCTTGCC 
GGAAATGAAG 
CGCAGCACTG 
CCAACACCTG 
CACAGGCCAT 
CAACACTGAC 
GATGGGCATC 
AGCCCCAATC 
CTACTATGAG 
GGTCCCCTAT 
AAGCAAGGTG 
CCTGGATGAC 
CATCAAGGAT 
GATGCCCCGT 
CCCAGGCTGA 
TTTGAGCTCA 
CGCAATGTAA 
TCCCCAAGCC 
GACACCATTT 
ATACTAATTA 
CCCCTATCCT 
AGTGTGTTGG 
CTCCCCCATC 
GAAGGCCGCC 



51 
I 

CTHIIYSFAN 
SNTQSRRTFI 
QLLLSAALSA 
PDRFSNTOYA 
AGTLAYYEIC 
VWALDLDDFQ 



Seq IO NO: 502 DNA sequence 

Nucleic Acid Accession #: NM_006474.1 

Coding sequence: 181.. 669 



GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAAGG 
GAAGGAGCCA 
GTTGCCATGC 
AAGTCTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAG 
GCCATCGGTT 
TCGCCCTAAA 
TTCTGACTCT 
OGGGCCCATT 
TCACCAGATT 



11 

I 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 
TGTCAGCTCT 
GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAGCTGAAGG 
GTGGCCCTGT 
CAGATTCCAC 
TGGTTCTTAA 



21 
I 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCG7T 
GCCAGAAGAT 
AGATGATGTG 
GGTGGCAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCATCGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 
ACTTT 



Seq ID NO: 503 Protein sequence 
Protein Accession #: NP_006465.1 

1 11 21 31 . 41 51 

I I I I I I 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 
KSGLTTLVAT SVNSVTGIRI EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 
VEKDGLSTVT LVGIIVGVLL AIGFIGGIIV WMRKMSGRY SP 

Seq ID NO: 504 DNA sequence . 

Nucleic Acid Accession #: Bos sequence 

Coding sequence: 62 . . 895 



11 



21 



41 



31 

I I ! I ! 

CACTGCTCTG AGAATTTGTG AGCAGCCCCT AACAGGCTGT TACTTCACTA 
TATGATCATC TTAATTTACT TATTTCTCTT GCTATGGGAA GACACTCAAG 
CAAGGATGGA ATTTTTCATA ACTCCATATG GCTTGAACGA GCAGCCGGTG 
AGAAGCACGG TCTGGCAAAT ACAAGCTCAC CTACGCAGAA GCTAAGGCGG 
TGAAGGCGGC CATCTCGCAA CTTACAAGCA GCTAGAGGCA GCCAGAAAAA 



51 
I 

CAACTGACGA 
GATGGGGATT 
TGTACCACAG 
TGTGTGAATT 
TTGGATTTCA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114,0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
180 0 
1860 
1920 



60 
120 
180 
240 
300 
360 



31 


41 


51 




1 

CTCCCTCTCC 


1 

GGGGCTCCTG 


1 

CTCCCACCCC 


60 


GCTGGGCCTG 


TGGCCGCGGT 


GCTTTTAATT 


120 


CCCCAGGAGA 


GCAACAACTC 


AACGGGAACG 


1B0 


TTGGGAAGCG 


CGTCGCTCTG 


GGTCCTGGCA 


240 


GACACTGAGA 


CTACAGGTTT 


GGAAGGCGGC 


300 


GTGACTCCAG 


GAACCAGCGA 


AGACCGCTAT 


360 


AGTGTCAACA 


GTGTAACAGG 


CATTCGCATC 


420 


CACGCGCAAG 


AACAAAGTCC 


AAGCGCCACA 


480 


GAGAAAGTGG 


ATGGAGACAC 


ACAGACAACA 


540 


CTGGTTGGAA 


TCATAGTTGG 


GGTCTTACTA 


600 


GTGGTTATGC 


GAAAAATGTC 


GGGAAGGTAC 


660 


GCTTGCCAAC 


GTGCTTTAAA 


AAAAGACCGT 


720 


GTGGGGAGAA 


GATGACCCTG 


GGAACATTTG 


780 


CGTTTGCCAA 


ATTAACCGAG 


GAAAGACCTT 


840 



60 
120 



60 
120 
180 
240 
300 



377 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TGTCTGTGCT GCTGGATGGA TGGCTAAGGG CAGAGTTGGA 
GCCCAACTGT GGATTTGGAA AAACTGGCAT TATTGATTAT 
TGAAAGATGG GAT6CCTATT GCTACAACCC ACACGCAAAG 
AGATCCAAAG CAAATTTTTA AATCTCCAGG CTTCCCAAAT 
CTGCTACTGO CACATTAGAC TCAAGTATGG TCAGCGTATT 
TGACCTTGAA GATGACCCAG GTTGCTTGGC TGATTATGTT 
TGATGTCCAT GGCTTTGTGG GAAGATACTG TGGAGATGAG 
TACAGGAAAT GTCATGACCT TOAAGTTTCT AAGTGATGCT 
CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC 
TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGGAAGA 
AAAAAAAGGA TGATCAAAAC ACACAGTGTT TATGT TGGAA 
CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA 
TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA 
ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT 
TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT 
TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC 
TCATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT 
TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT 
ATTGTTGCAA TAAATATCCT TGAACACACA AAAAAAAAAA 

Seq ID NO: 505 Protein sequence 
Protein Accession #: Eos sequence 

1 11 



PCT/US02/12476 



TACCCCATTG 
GGAATCCGTC 
GAGTGTGGTG 
GAGTACGAAG 
CACCTGAGTT 
GAAATATATG 
CTTCCAGATG 
TCAGTGACAG 
AGTCAAGGAA 
TTTAGCCACT 
TCTTTTGGAA 
TGTGAAAGCA 
ATGAAACCTC 
TTCTTTCAGT 
TTGAAATTTT 
TTTATAAACA 
AAAATATTCT 
TTCAGGTCAT 
AA 



TGAAGCCAGG 
TCAATAGGAG 
6CGTCTTTAC 
ATAACCAAAT 
TTTTAGATTT 
ACAGTTACGA 
ACATCATCAG 
CTGGAGGTTT 
AAAATACAAG 
TATAAAAAAA 
CTCCTTTGAT 
ATACATAATT 
TCATAATCCC 
CATTTTTCTA 
GGAATCCTGC 
TTTTCTGAAA 
ATGATATGAA 
TTTCATAAAT 



51 



21 31 41 

I ) I I I 1 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCBP 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVKGPVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRPSHL 

Seq ID NO: 506 DNA sequence 

Nucleic Acid Accession ft: NM_007115.1 

Coding sequence: 69.. 902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTAOGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
.CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
rCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 
I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 

I. 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



Seq ID NO: 507 Protein sequence 
Protein Accession ft: NPJJ09046.1 

1 11 21 31 41 51 

I I 1 i I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YABAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKRIPKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 



Seq ID NO: 508 DNA sequence 

Nucleic Acid Accession ft: NM_001044.l 

Coding Beqaence t 12 9 .. 1 9 91 



ACCGCTCCGG 
AAAGCCCAGG 
GTGTGCCCAT 
CTAAGGAGCC 
ACGGAGTGCA 
AGGATCGGGA 
TGGACCTGGC 



11 
I 

AGCGGGAGGG 
CCCGGGCGGC 
GAGTAAGAGC 
CAATGCCGTG 
GCTCACCAGC 
GACCTGGGGC 
CAACGTCTGG 



21 

! 

GAGGCTTCGC 
CAGACCAAGA 
AAATGCTCCG 
GGCCCGAAGG 
TCCACCCTCA 
AAGAAGATCG 
CGGTTCCCCT 



31 

I 

GGAACGCTCT 
GGGAAGAAGC 
TGGGACTCAT 
AGGTGGAGCT 
CCAACCCGCG 
ACTTTCTCCT 
ACCTGTGCTA 



41 



51 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 




60 
120 
180 
240 



CGGCGCCAGG ACTCGCGTGC 
ACAGAATTCC TCAACTCCCA 
GTCTTCCGTG GTGGCCCCGG 
CATCCTTGTC AAGGAGCAGA 
GCAGAGCCCC GTGGAGGCCC 
GTCCGTCATT GGCTTTGCTG 
CAAAAATGGT GGCGGTGCCT 



60 
120 
180 
240 
300 
360 
420 
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TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT 
TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG COGCTGGTGT 
TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA 
TCATCATOGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT 
TCCACTGCAA CAACTCCTGG AACAGCCCCA ACT6CTCGGA 
GTGGAGACAG CTCGGGCCTC AAOGACACTT TTGGGACCAC 
AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA 
GGCAGCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT 
GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC 
CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT 
TGAGCGTTGA CTTCTACCGG CTCTGOGAGG OGTCTGTTTG 
TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC 
TCACCAACAA CTGCTACAGG GACGGGATTG TCACCACCTC 
TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT 
CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT 
TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CGGTGGTCTT 
TGGGTATCGA CAGCGCCATG GGTGGTATGG AGTCAGTGAT 
TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT 
TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTAOGT 
TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA 
TCTATGGTGT TGGGCAGTTC AGOGACGACA TCCAGCAGAT 
TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT 
TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA 
ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT 
ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT 
AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA 
TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT 
ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG 
CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT 
TCCGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG 
GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA 
TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG 
CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGACGC 
CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA 
GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA 
GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA 
GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT 
ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA 
GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG 
GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 
AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC 
AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 
TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 
GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 
CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC 
CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG 
TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA 
CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT 
ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT 
TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG 
CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA 
TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG 
TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA 
GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGOAT 
GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC 
GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC 
GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT 
AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG 

Seq ID NO: 509 Protein sequence 
Protein Accession #: NP 001035.1 



PCT/US02/12476 



KSKSKCSVGL 
ETWGKKZDFL 
GQFNRBGAAG 
NNSWNSPNCS 
TACLVLVIVL 
DFYRLCEASV 
GFWFSPLGY 
DSAMGGMBSV 
GTSILFGVLI 
VTFRPPHYGA 
RELVDRGEVR 



11 
I 

MSSWAPAKE 
LSVIGPAVDL 
VWKICPILJCG 
DAHPGDSSGD 
LYFSLWKGVK 
WIDAATQVCP 
MAQKHSVPIG 
ITGLIDEFQL 
EAIGVAWFYG 
YIFPDWANAI* 
QFTLRHWUCV 



21 
I 

PNAVGPKEVE 
ANVWRPPYLC 
VGFTVILISL 
SSGLNDTFGT 
TSGKWWITA 
SLGVGFGVLI 
DVAKDGPGLI 
LHRHRELPTL 
VGQFSDDIQQ 
GWVIATSSMA 



31 
I 

LILVKEQNGV 
YKNGGGAFLV 
YVGFFYNVII 
TPAAEYFERG 
TMPYWLTAL 
AFSSYNKFTN 
PIIYPEAIAT 
PIVLATFLLS 
WTGQRPSLYW 
MVPIYAAYKF 



GCCACTTTTC 
CTGGAAGATC 
TGTCGGCTTC 
CACCACGGAG 
TGCCCATCCT 
ACCTGCTGCC 
CGACCTGGGG 
CTACTTCAGC 
CATGCCATAC 
AGACGGCATC 
GATTGACGCG 
CTTCTCCAGC 
CATCAACTCC 
GGCACAGAAG 
CATCATCTAC 
CTTCATCATG 
CACGGGGCTC 
CATCGTCCTG 
CTTCACGCTC 
AGCCATCGGA 
GACCGGGCAG 
TCTCCTGTTC 
CATCTTCCCC 
GGTGCCCATC 
GGCCTACGCC 
GTTCAOGCTC 
CATCCTGCAA 
GGCAACTTCT 
CTTCTGACTG 
TAATAACGAC 
AAACGTCTAA 
CTCTGAGGCT 
ACCTGCTGAG 
AAAAGCCAAG 
TTCATGCAAA 
GTTGACACAT 
CTCACCAGGA 
GCTCAGGCTA 
AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
GACGCATGCA 
AGGAGCGTGT 
CCCAGGACGC 
CCCACAGGAG 
ACCAACACTC 
TTTCTCTCAG 
AAAATCATAT 
TGCTGATATT 
CCAAAAGCTG 
GTTTGAATTC 
TGCAGGGCCA 
ATGCTCGGTG 
GGCTTCCCCA 
CGCTGCAGTC 
AGTTTCCCCA 
TCTGGGGAGG 
TTCAGCATCG 
GAATTC 



41 
I 

QLTSSTLTNP 
PYLLFMVIAG 
AWALHYLFSS 
VLHLHQSHGI 
LLRGVTLPGA 
NCYRDAIVTT 
LPLSSAWAW 
LFCVTNGGIY 
RLCKKLVSPC 
CSLPGSFREK 



TACATGGAGC 
TGCCCCATAC 
TTCTACAACG 
CTCCCCTGGA 
GGTGACTOCA 
GAGTACTTTG 
CCTCCGCGGT 
CTCTGGAAGG 
GTGGTCCTCA 
AGAGCATACC 
GCCACCCAGG 
TACAACAAGT 
CTGACGAGCT 
CACAGTGTGC 
CCGGAAGCCA 
CTGCTCACCC 
ATCGATGAGT 
GCGACCTTCC 
CTGGACCATT 
GTGGCCTGGT 
CGGCCCAGCC 
GTGGTCGTGG 
GACTGGGCCA 
TATGCGGCCT 
ATTGCACCCG 
CGCCACTGGC 
TGGGAGAGAC 
ACTCTTCAAC 
TTTACACCTT 
GTAGATCTGT 
CTTCATGCTG 
GCCCCAGGGG 
AATCCCCGTG 
TGTCCTGCTT 
GTCCTTTCCC 
TGCACACACA 
AATTCTGTTT 
CTGCCACTCA 
CCTATCCCCG 
GGGCCCCCAC 
ACTACCCCAG 
GGGCCCCCAC 
CCTATCCCCG 
ATGCAGGGCC 
CGTGTACTAC 
TGCCTGGCCT 
GTGCGTGCCA 
TTACCTGTGA 
GCAGTTTTTG 
CTTTCCATGG 
ATGCCTCAAG 
GTCATGGCTG 
GGAGCCGTCA 
TTGCCTTCTG 
AGCACAGAGA 
TCGCCTTCTG 
GGCTCCGTGT 
TGTGGGTCCC 



51 
I 

RQSPVEAQDR 

MPLFYKELAL 

FTTELPWIHC 

DDLGPPRWQL 

IDGIRAYLSV 

SINSLTSFSS 

FFIMLLTLGI . 

VFTLLDHFAA 

FLLFWWSI 

IAYAIAPEKD 



Seq ID NO: 510 DNA sequence 

Nucleic Acid Accession ft: NMJ>01216.1 

Coding sequence i 43 . . 1422 



11 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3460 
3540 
3600 
3660 
3720 
3780 
3840 
3900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



21 31 41 51 

i I I I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 
AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 
CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 
TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 



60 
120 
180 
240 
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AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG 
GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC 
TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG 
AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC 
CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GOGGGCOGCT 
CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC 
CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC 
CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGOGGGAGT 
CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA 
CCTGCCGAGA TCCAOGTGGT TCACCTCAGC ACOGCCTTTG 
GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG 
AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG 
CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG 
TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG 
CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT 
GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC 
GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTOGGG 
AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT 
ACCAGOGTOG CGTTCCTTGT GCAGATGAGA AGGCAGCACA 
GTGAGCTACC GCCCAGCAGA GGTAGCOGAG ACTGGAGCCT 
TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA 
ATGOCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA 



Seq ID NO: 511 Protein sequence 
Protein Accession 8: NP_001207.1 



l 

r 

MAPLCPSPWL 
GEEDLPSEED 
DPQEPQNNAH 
ELLGFQLPPL 
VEGHRFPAEI 
EEGSETQVPG 
DTLWGPGDSR 
GLLPAVTSVA 



11 

r 

PLLIPAPAPG 
SPREEDPPGE 



PELRLRNNGH 
HWHLSTAFA 
LD IS ALL PSD 
LQLNPRATQP 
PLVQMRRQHR 



21 
I 

LTVQLLLSLL 
EDLPGEEDLP 
WRYGGDPPWP 
SVQLTLPPGL 
RVDEALGRPG 
FSRYPQYEGS 
LNGRVIEASF 
RGTKGGVSYR 



31 
I 

LLMPVHPQRL 
GEEDLPEVKP 
RVSPACAGRF 
EMALGPGREY 
GLAVLAAFLE 
LTTPPCAQGV 
PAGVDSSPRA 
PAEVABTGA 



AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



41 

1 

PRMQEDSPLG 



ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGOGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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Seq ID NOs 512 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence i 1..3978 



ATGGTGGGTG 
TTTGCAGAAA 
TTAGCACCCA 
ACGCCGGTGA 
TCGACATATG 
GTAGCAAGGG 
ACACGCGTGT 
CCGACAGTTC 
GTTGGCATTG 
GCCCTTGCCT 
TTGGTTTTTG 
CTCAATATAC 
CCAGCCACCA 
CCCACAGCTC 
GCCAAGCTCA 
ACAATGAATG 
TTTACCAACA 
TTTGTCCAAA 
ACATTATOCT 
ATTGCCATGT 
ATGGCTGAAG 
CCATCTTACA 
TTGACATGGG 
AGGCATTTAT 
GGAGCCACTG 
TTTGTGGTGA 
GCAGTGTTTG 
AAGGATGAAT 
GCCAAATACC 
CTCCTTGCAG 
ACTTTGGCCT 
CTCTTTGGAG 
CAGAAGGACC 
AACCTCTCTG 
CAGCTCTACC 
TTTGAGGAGT 
CTACAGTTCT 
AAGGGAACCC 
CTGCGAGGAT 
TTCAAGGAGA 
CTCTTCACTG 
CTGGGTCTCT 
ATGTGTGAGG 
ACTGCAAGCA 



11 
i 

AAGGACCCTA 
GATATGACCC 
ACCCGGTGGA 
TGGTGAAAGG 
ACTCATCTGA 
TGGGTCCTGA 
TGATGGACAT 
TCATTCACCA 
GACTGTGCAT 
GGGCCATCAA 
AAAACCTAGT 
TGTCAAGTGA 
TCCCGATCCT 
TCATCGGGAT 
ATTCAGCTTT 
AGTTTCTGAC 
CTATCCAAGA 
GTGGAAACTC 
GCCACATCCT 
TTAATGTAAT 
CGAATGTCTC 
TCACCCAACC 
AGCATGAAGC 
GCAAGAAACA 
GCCCAGAGGA 
GAAAGTTATG 
TTGGGAGAAT 
CTAGAAGGCT 
TGGGGAAGAT 
CTCTCCTAGG 
ACGTTTCACA 
AAAAGTATGA 
TGAGCAACCT 
GGGGGCAGAG 
TGCTGGACGA 
GCATTAAGAA 
TAGAGTCTTG 
ACAAGGAGTT 
TGCAGTTCAA 
GCCCTGCTGA 
TGTTCCTCTT 
GGTTGGACAA 
TCGGCGGGGT 
TGGTGTTCAT 



21 
I 

CCTTATCTCA 
CAGCCTGAAG 
TGATGCCGGG 
CTACCGGCAA 
CACCAATGCC 
GAAGGCCTCT 
CGTGGCCAAC 
AATCCTCCAG 
AGCCCTTTTT 
CTACCGCACG 
GTCCTTCAAG 
TAGCTATTCT 
AATGGTCTTT 
ATCAGTGTAT 
CCGAAGGTCA 
CTGCATGAGG 
TATAAGAAGG 
TGCCCTGGCC 
CCTGAGAOGC 
GAAGTTTTCC 
TCTAAGGAGA 
AGAAGACCCA 
CAGCAGGAAA 
GAGGTCAGAG 
GCAAAGTGAC 
TCGTTATCCC 
CATCAGAGGA 
TCTTACTTGG 
CTTGGGAATA 
ACAGATGCAG 
GCAGGCATGG 
TCACCAAAGG 
CCCCTATGGA 
GCAGAGGATT 
CCCCCTGTCG 
GACGCTCAGG 
TGATGAAGTT 
AATGGAGGAG 
GGATCCTGAA 
GAGAGAGGAA 
CCTCCTGATG 
GGGCTCAOGG 
GCTGGCAGAC 
GCTGGTGTTT 



31 
1 

GATCTGGACC 
ACCATGATCC 
CTACTCTCCT 
AGGCTGACCG 
AAAAGATTTC 
CTGAGCCACG 
ATCCTGTGCA 
CAGACTGAGA 
GCCACCGAGT 
GCCATCGGGT 
ACATTGACCC 
TTGTTTGAAG 
TGTGOGGGGT 
GTCATATTCA 
GCAATTTTGG 
CTGATCAAAA 
AGGGAAAGAA 
OCCATCGTGT 
AAACTCACCG 
ATTGCAATCT 
ATGAAGAAAA 
GATACTGTCT 
AGTACCCCAA 
GCATACAGTG 
AGCCTCAAAT 
GAAGCCCAGC 
TACAGGCCTC 
CCCCAAGAAG 
TGTGGGAATG 
CTGCAGAAAG 
ATCTTTCATG 
TATCAGCACA 
GACCTGACTG 
AGCCTGGCCC 
GCCGTGGACG 
GGAAAGACAG 
ATTTTATTAG 
AGAGGGOGCT 
CACCTTTACA 
GATGCTGGTA 
ATTGGCAGOG 
ATGACCTGTG 
ATOGGTCAGC 
GGCGTCACCA 



QSPVDIRPQL 
RALQLHLHWG 
EGPEENSAYE 
IWTVFNQTVM 
AEPVQLNSCL 



41 
I 

AGCGAGGCCG 
CAGTGCGACC 
TCGCCACATT 
TAGACACCCT 
GAGTCCTTTG 
TGGTGTGGAA 
TCATCATGGC 
GGACCTCTGG 
TTACCAAAGT 
TGAAGGTGGC 
ACATCTCTGT 
CTGCCTTGTT 
ACGCCTTTTT 
TACCCGTCCA 
TGACAGACAA 
TGTATGCCTG 
AATTACTGGA 
CCACCATAGC 
CACeOGTGGC 
TGCCCTTCTC 
TTCTCATAGA 
TGCTTTTAGC 
AGAAATTGCA 
AGAGGAGTCC 
CGGTTCTGCA 
TCCTGGCTTG 
ATGGATTTTC 
TGGATAGGAC 
TGGGAAGTGG 
GGGTGGTGGC 
GAAATGTGAG 
CAGTCCGOGT 
AGATTGGGGA 
GOGCTGTCTA 
CCCACGTGGG 
TCGTCCTGGT 
AAGATGGAGA 
ATGCAAAACT 
ATGCAGCAAT 
TAATCGGGTA 
CTGCCTTCAG 
GGCCCCAGGG 
ATGTGTACCA 
AAGGCTTCGT 



51 

I 

GGSSGEDDPL 
EDfcPTVEAPG 
AAPCPALRPL 



QLLSRLEEIA 
LSAKQLHTLS 
AAGDILALVP 




CATTCTGGGG 
GATGTTTATG 
GCGAGTTCAG 
GGAGAAATCT 
AAAAGCTGGA 
CATOGTGCTG 
ATTTAGTGTG 
CATCAAAGCA 
TAAAAGCCCC 
AAATGCCACC 
GAACCAGAAA 
ACCAGCCAAG 
CAGCATAAGC 
GAGGTGGCCA 
TGCTAAAGAC 
TCAAAGGGCA 
AAAGAGCTCC 
AGTCAATGGA 
AGAAAACATA 
CTGTGGCCTC 
GCGGGGCCTC 
CTCCGACCGT 
GAAGCACGTC 
GACCCACCAG 
GATTTGTGAA 
GATTCACAAC 
GGTGGAAGCC 
CCTCCTTTCT 
CAACTGGTGG 
CAACAGGACC 
GTGGGTGTAC 
CTTCACCAAG 



60 
120 
180 
240 
300 
360 
420 
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300 
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420 
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540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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2340 
2400 
2460 
2520 
2580 
2640 
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ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT 
CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA 
ATGGACGAGC TGGATGTGAG GCTGCOGTTT CACGCAGAGA 
ATGGTGGTGT TTATTCTCGT GATCTTGOCT GCTGTGTTTC 
GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT 
CTCAAGAAGG TGGAGAATGT CAGCOGGTCA CCCTGGTTCA 
CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA 
TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG 
CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG 
TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG 
CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA 
CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC 
AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT 
AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC 
ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG 
TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC 
ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG 
GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG 
TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG 
GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG 
CTCAACACAG TTCTCAACTG CGATCAOGTC CTGGTTATGG 
TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG 
GCAGAAGTCA GATTGTAG 

Seq ID NO i 513 Protein sequence 
Protein Accession # : Eos sequence 



PCT7US02/12476 



MVGEGPYLIS 
TPVMVKGYRQ 
TRVLMDIVAN 
ALAWAINYRT 
PATIPILMVF 
TMNEFLTCIR 
TLSCHILIiRR 
PSYITQPEDP 
GATGPEEQSD 
KDESRRLLTW 
TIAYVSQQAW 
NLSGGQRQRI 
LQFLESCDEV 



MCEVGAVLAD 
PMSFPDTTPT 
ASLAVGPFIL 
SKGLSLSYII 
PSGGEITFRD 
SGTIFIDEVD 
MRDTIMKLPE 
VQNTIKDAFK 
AEVRL 



11 - 
I 

DLDQRGRRRS 
RLTVDTLPPL 
IIiCIIMAAIG 
AIRUCVALST 
CAAYAFFILG 
LIKMYAWEKS 
KLTAPVAFSV 
DTVLLLANAT 
SLKSVLHSIS 
PQEVDRTQRA 
IFHGNVRENI 
SLARAVYSDR 
ILLEDGEICE 
DAGIIGYLLS 
IGQHVYQWVY 
GRLKNRFSKD 
LRIFHRGVQB 
QLSGLLQVCV 
YQMRYRDNTP 
ICILSLEDLR 
KLQAEVTENG 
GCTVLTIAHR 



21 
I 

FAERYDPSLK 
STYDSSDTNA 
PTVLIHQILQ 
LVFENLVSFK 
PTALIGISVY 
FTNTIQDIRR 
IAMFKVMKFS 
LTWEHEASRK 
PWRKLCRYP 
AKYLGKILGI 
LPGEKYDHQR 
QLYLLDDPLS 
KGTHKEXiMEE 
LPTVFIiPIiLM 
TASMVFMLVF 
MDELDVRLPF 
LKKVENVSRS 
RTGTETQAKF 
LVLDSLNLNI 
TKLTVIPQDP 
ENFSVGERQL 
LNTVLNCDHV 



31 
I 

TMIPVRPCAR 
KRFRVLWDEE 
QTERTSGKVW 
TLTHISVGEV 
VIFIPVQMFM 
RERKLLEKAG 
IAILPPSZKA 
STPKKLQNQK 
EAQLLAWRWP 
CGNVGSGKSS 
YQHTVRVCGL 
AVDAHVGKHV 
RGRYAKLIHN 
IGSAAPSNWW 
GVTKGFVFTK 
BAENPLQQFP 
PWPTHITSSM 
TSVELIiREYI 
QSGQTVGIVG 
VLFVGTVRYN 
LCVARALLRN 
LVMENGKVIE 



TTGATAAGAT 
TQAACCGTTT 
ACTTTCTGCA 
CTGCTGTCCT 
TCCACAGAGG 
CCCACATCAC 
GCTGCATCAC 
GACTGCTCCA 
AGCTGCTCAG 
GGACCTGTCC 
GATACAGAGA 
AGACAGTCGG 
TTCGTCTGGT 
TCAGCTTGGA 
TAGGTACAGT 
AGGTTCTGGA 
CAGAAGTCAC 
CCCGAGCTCT 
ACTCCAAGAC 
TGCTGACCAT 
AAAATGGGAA 
CATTTGCGAT 



41 
I 

LAPNPVDDAG 
VARVGPERA8 
VGIGLCIALP 
LNILSSDSYS 
AKLNSAFRRS 
FVQSGNSALA 
MAEANVSLRR 
RHLCKKQRSE 
AVFVGRIIRG 
LLAALLGQMQ 
QKDLSNLPYG 
FEECIKKTLR 
LRGLQFKDPE 
liGLWLDKGSR 
TTLMASSSLH 
MWFILVILA 
QGLGXIHAYG 
STCVPECTHP 
RTGSGKSSLG 
LDPFESHTDE 
SKIILLDEAT 
PDKPEVLAEK 



CTTAAAGAGC 
TTCCAAGGAT 
GCAGTTTTTT 
TTTAGTCGTG 
AGTCCAGGAG 
CTCCTCCATG 
CTATACTTCA 
AGTGTGTGTG 
GGAATACATT 
CAAGGACTGG 
CAACACCCCC 
GATTGTTGGA 
GGAGCCAGCC 
AGACCTCAGA 
AAGGTACAAC 
GAGAACATTC 
AGAAAATGGA 
TCTCCGTAAT 
TGACACCCTG 
CGCCCACCGC 
GGTGATTGAG 
GTTACTAGCA 



2700 
2760 
2620 
2B80 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



Seq ZD NO: 514 DMA sequence 
Nucleic Acid Accession #i Z31560 
Coding sequence i 1-966 



51 
I 

LLSFATFSWL 
LSHWWKFQR 
ATEFTKVFPW 
LFEAALPCPL 
AILVTDKRVQ 
PIVSTIAIVL 
MKKILIDKSP 
AYSERSPPAX 
YRPHGFSAKD 
LQKGWAVNG 
DLTEIGERGL 
GKTWLVTHQ 
HLYNAAMVEA 
MTCGPQGNRT 
DTVPDKILKS 
AVFPAVLLW 
KKESCITYTS 
LKVGTCPKDW 
MALFRLVBPA 
MLWQVLERTF 
ASMDSKTDTL 
PDSAFAMLLA 



51 

I 

CCCGCAGCAA 
CCAGAAAAAC 
CGGGCAGCGG 
GCGCCTGGGC 
GGCTAAGCGG 
GCGGAAAACC 
CCCCGGCGGC 
GAACCAGCGC 
GATGCAGGAC 
GATGCAGCCC 
GCAGACCTAC 
TGGCATGGCT 
TGTGGTTACC 
GATCAGCATG 
CATGTCCCAG 
CCTCTCACAC 
ACGAGGGAAA 
TACGCTCAAA 



Seq ID NO: 515 Protein sequence 
Protein Accession ft: CAA83435 

1 11 21 31 41 51 

I ! I I I I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKBHP DYKYRPRRKT 

KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
160 



381 
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QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPOT AINGTLPLSH M 

Seq id NO: 516 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequence: 29.. 541 

1 11 21 31 41 51 

I 1 I I I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT . 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTQ ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG 540 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 

Seq ID NO: 517 Protein sequence 
Protein Accession #: AAB50564 

1 11 21 31 41 51 ' " 

I I ] I ) I 

MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWXMTLLN 60 

VCSliVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 120 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 518 DNA sequence 

Nucleic Acid Accession ft: NMJ)06536.2 

Coding sequence: 109.. 2940 

1 11 21 31 41 51 

1 I I I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TAOGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 



382 
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CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA 
ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC 
CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA 
ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT 
CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT 
GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT 
AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA 
TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC 
TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC 
CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT 
TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC 
TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT 
TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT 
TACCTAGGAA A 

Seq ID NO; 519 Protein sequence 
Protein Accession #t NP_0C6S27.1 



PCT/US02/12476 



MTQRSIAGPI 
IKEMITEASP 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TPSLVQAGDK 
-RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAP 
AGADVIKNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
LILKGVLTAM 



11 

I 

CNLKFVTLLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS 
DDRKLLVSYL 
PTVLSSGSTI 
PQQHIQLEST 
NNPITNLTFR 
VERDSLHPPH 
IYSRYFFSFA 



DFDQGQATSY 
QPNGETHESH 
GLIGIICLII 



21 

I 

ALSSELPFLG 
FPRNIKIIjIP 
FTPUFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI 
HSIALGSSAA 
GENVKPHHQL 
TASIiWIPGTA 
PVMIYAKVKQ 
ANGRYSLKVH 
PSRVSSGGSF 
EIRMSKSLQM 
RIYVAIRAMD 
WTHHTLSRK 



31 
I 

AGVQLQDNGY 
ATWKANNNSK 
LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LQQAAEFYLM 
SICSGLKKGP 
PNXiEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFNNAIL 
RNSLQSAVSN 
KRADKKENGT 



ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCCN 



41 
I 

NGLLIAINPQ 
IKQESYEKAN 
VFVHEWAHLR 
LFKEGCTFIY 
DSADFHHSFP 
QIVEIHTFVG 
EWEKLNGKA 
GGLKPPVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD 
AHS IPG SHAM 
PDVFPPCKII 
VNTSKRNPQQ 
IAQAPLFIPP 
KLIt 



ATTATTATAA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGCCT 



51 
I 

VPENQNLISN 
VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
MNGTELPPPP 
IASFDSKGEI 
YQSVMILVTS 
SNSNSMIDAP 
QASGPPEIIL 
VTVTSRASNS 
PVTLRLLDDG 
YVPGYTANGN 
DLEAVKVEEE 
AGIREIPTFS 
NSDPVPARDY 



Seq ID NO: 520 DNA sequence 

Nucleic Acid Accession #: nm_000228. 

Coding sequence: B2..3600 



GCTTTCAGGC 
GGATCACCCC 
CTCCTGCATG 
CTTGTTGGGA 
ACCTACTGCA 
CCTCACAACT 
TGGTGGCAGT 
TTCCAGCTTC 
GAGCGCTCCT 
ACCTCCACCT 
CAGTCCCTGC 
ATGGATTTAG 
ATCACAAACT 
CCTCCCAGCG 
GGCCATGCTG 
CAGGTCCACG 
GCACCCTTCT 
CAAAGGTGCG 
GCCAGCCAGG 
AACTGTGAGC 
GAGACCTGCA 
CCAGTGACCG 
AAGCCGGGCT 
AACATCCTGG 
CTGCCCAACG 
AGTGGCCAGG 
CAACCAGTTC 
GCAGCCATCC 
TGTGACTGTG 
CTCTG CCGCC 
CGCTACCCGG 
GAGCAGGCCC 
GGGCTGGAGG 
ATCCGAGCAG 
GCCATCCTCT 
GAGACGTTGT 
ACTATGTATC 
GCCTTCCGGA 
GACAGCTCGC 
CGGCAGGCGG 
ATGTCTTCGT 
ATGGCTTGCA 
TGTGGCTCCC 
CAGGTGGCTG 



11 

I 

GATCTGGAGA 
ATTGGCTGAA 
CCCAACAAGC 
GGACCCGGTT 
CCCAGTATGG 
ACTACAGTCA 
CCCAGAATGA 
AAGAAGTCAT 
CAGACTTCGG 
TCCCTCGGGT 
CTCAGAGGCC 
TGTCTGGGAT 
TGAGAGTCAA 
CCTACTATGC 
ATCGCTGCGC 
ATGTCTGTGT 
ACAACAACCG 
ACTGCAATGG 
GGGCATATGG 
GGTGTCAGCT 
TCTCCTGCGA 
GGCAGTGTGT 
TCACTGGACT 
GGTCCCGGAG 
TGGTGGGTCC 
GCTGTGAACC 
ACAGGGCAGT 
GCCAGTGTCC 
ATTTCCGGGG 
CTGGCTTGAC 
TGTGCGTGGC 
TGCGCTTTGG 
ACOGTGGCCT 
TTCTCAGCAG 
CCCTCAGGCG 
CCCTTCCGAG 
AGAGGAAGAG 
TGCTGAGCAC 
GCCTTTTGGA 
GAGGAGGAGG 
TGCCTGACCT 
CCCCAATATC 
GCTGCAGGGG 
AGCAGCTGCG 



21 
I 

AAGAACGGCA 
GATGAGACCA 
CTGCTCCCGT 
TCTCCGAGCT 
CGAGTGGCAG 
CCGAGTAGAG 
TGTGAACCCT 
GATGGAGTTC 
TAAGACCTGG 
CCGCCAGGGT 
TAATGCACGC 
TCCAGCAACT 
T7TCACCAGG 
TGTGTCCCAG 
ACCCAAGCCT 
CTGCCAGCAC 
GCCCTGGAGA 
GCACTCAGAG 
AGGTGTGTGT 
GCACTATTTC 
GTGTGATCCG 
GTGCAAGGAG 
CACCTACGCC 
GGACATGCCG 
CAAATGTGAC 
GTGTGCCTGC 
GCCCTGTCGG 
AGACCX3GACC 
AACAGAGGGC 
OGGGCCCCGC 
CTGCCACCCT 
TAGACTCCGC 
GGCCTCCCGG 
CCCCGCAGTC 
AACTCTCCAG 
AGACCTGGAG 
GGAGCAGTTT 
AGCCTACGAG 
CCAGCTCAGG 
AGGCACCGGC 
GACACCCACC 
ATGCCCTGGT 
TGTCCTTCCC 
GGGCTTCAAT 



31 
I 

GAACACACAG 
TTCTTCCTCT 
GGGGCCTGCT 
TCATCTACCT 
ATGAAATGCT 
AATGTGGCTT 
GTCTCTCTGC 
CAGGGGCCCA 
CGAGTGTACC 
CGGCCTCAGA 
CTAAATGGGG 
CAAAGTCAAA 
CTGGCCCCTG 
CTCCGTCTGC 
GGGGCCTCTG 
AACACTGCCG 
CCGGCGGAGG 
ACATGTCACT 
GACAATTGCC 
CGGAACCGGC 
GATGGGGCAG 
CATGTGCAGG 
AACCCGCAGG 
TGTGACGAGG 
CAGTGTGCTC 
GACCCGCACA 
GAAGGCTTTG 
TATGGAGACG 
CCGGGCTGCG 
TGTGACCAGT 
TGCTTCCAGA 
AATGCCACCG 
ATCCTAGATG 
ACAGAGCAGG 
GGCCTGCAGC 
AGTCTTGACA 
GAAAAAATAA 
CAGTCAGCCC 
GACAGCCGGA 
AGCCCCAAGC 
TTCAACAAGC 
GAGCTATGTC 
AGGGCCGGTG 
GCCCAGCTCC 



41 

1 

CAAGGAAAGG 
TGTGTTTTGC 
ATCCACCTGT 
GTGGACTGAC 
GCAAGTGTGA 
CATCCTCCGG 
AGCTGGACCT 
TGCCCGCCGG 
AGTACCTGGC 
GCTGGCAGGA 
GGAAGGTCCA 
AAATTCAAGA 
TGCCCCAAAG 
AGGGGAGCTG 
CAGGCCCCTC 
GCCCAAATTG 
GCCAGGACGC 
TTGACCCCGC 
GGGACCACAC 
GCCCGGGAGC 
TGCGAGGGGC 
GAGAGCGCTG 
GCTGCCACCG 
AGAGTGGGCG 
CCTACCACTG 
ACTCCCCTCA 
GTGGCCTGAT 
TGGCCACAGG 
ACAAGGCATC 
GCCAGCGAGG 
CCTATGATGC 
CCAGCCTGTG 
CAAAGAGTAA 
AGGTGGCTCA 
TGGATCTGCC 
GAAGCTTCAA 
GCAGTGCTGA 
AGGCTGCTCA 
GAGAGGCAGA 
TTGTGGCCCT 
TCTGTGGCAA 
CCCAAGACAA 
GGGCCTTCTT 
AGCGGACCAG 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



51 
I 

TCCTTTCTGG 
CCTGCCTGGC 
TGGGGACCTG 
CAAGCCTGAG 
CTCCAGGCAG 
CCCCATGCGC 
GGACAGGAGA 
CATGCTGATT 
TGCCGACTGC 
TGTTCGGTGC 
ACTTAACCTT 
GGTGGGGGAG 
GGGCTACCAC 
CTTCTGTCAC 
CACCGCTGTG 
TGAGCGCTGT 
CCATGAATGC 
TGTGTTTGCC 
CGAAGGCAAG 
TTCCATTCAG 
TCCCTGTGAC 
TGACCTATGC 
CTGTGACTGC 
CTGCCTTTGT 
GAAGCTGGCC 
GCCCACAGTG. 
GTGCAGCGCT 
ATGCCGAGCC 
AGGCCGCTGC 
CTACTGCAAT 
GGACCTCCGG 
GTCAGGGCCT 
GATTGAGCAG 
GGTGGCCAGT 
CCTGGAGGAG 
TGGTCTCCTT 
TCCTTCAGGA 
GCAGGTCTCC 
GAGGCTGGTG 
GAGGCTGGAG 
CTCCAGGCAG 
TGGCACAGCC 
GATGGCGGGG 
GCAGATGATT 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



383 



WO 02/086443 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTO CCCAGCGCTT GGAGACCCAG 2700 

GTGAGOGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC COGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 

CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGQACC TGACAGGACT GGAGAAGCGT 3540 

GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3600 

TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3660 

GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3840 

CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3960 
AAAATCTTTG G 

Seq ID NO: 521 Protein sequence 
Protein Accession #: NPJ)00219.1 

1 11 21 31 " 41 51 

I II I I I 

MRPPPLLCFA LPGLLHAQQA CSRGACYPPV GDLLVGRTRP LRASSTCGLT KPETYCTQYG 60 

EWQMKCCKCD SRQPHNYYSH RVENVASSSG PMRWWQSQND VNPVSLQLDL DRRFQLQEVM 120 

NEFQGPMPAG MLIERSSDFG KTWRVYQYLA ADCTSTPPRV RQGRPQSWQD VRCQSLPQRP 180 

NARLNGGKVQ LNLMDLVSGI PATQSQKIQE VGEITNLRVN PTRLAPVPQR GYKPPSAYYA 240 

VSQLRLQGSC FCHGHADRCA PKPGASAGPS TAVQVHDVCV CQHNTAGPNC ERCAPFYNNR 300 

PWRPAEGQDA HECQRCDCNG HSBTCHPDPA VFAASOGAYG GVCDNCRDHT EGKNCEROQL 360 

HYFRNRRPGA SIQETCISCE CDPDGAVPGA PCDPVTGQCV CKEHVQGERC DLCKPGFTGL 420 

TYANPQGCHR CDCNILGSRR DMPCDEESGR CLCLPNWGP KCDQCAPYHW KLASGQGCEP 480 

CACDPHNSPQ PTVQPVHRAV PCREGFGGLM CSAAAIRQCP DRTYGDVATG CRACDCDPRG 540 

TEGPGCDKA6 GRCIiCRPGLT GPRCDQCQRG YCNRYPVCVA CHPCFQTYDA DLREQALRFG 600 

RLRNATASLW SGPGLEORGL ASRILDAKSK IEQIRAVLSS PAVTEQEVAQ VASAILSLRR 660 

TLQGLQLDLP LEEETLSLPR DLESLDRSFN GLLTMYQRKS BQFEKISSAD PSGAFRMLST 720 

AYEQSAQAAQ QVSDSSRLLD QLRDSRREAE RLVRQAGGGG GTGSPKLVAL RLEMSSLPDL 780 

TPTFNKLCGN SRQMACTPIS CPGELCPQDN GTACGSRCRG VLPRAGGAFL MAGQVAEQLR 840 

GFNAQLQRTR QMIRAAEESA SQIQSSAQRL ETQVSASRSQ MEEDVRRTRL LIQQVRDFLT 900 

DPDTDAATIQ EVSEAVIALW LPTDSATVLQ KMNEIQAIAA RLPNVDIiVLS QTKQDIARAR 960 

RLQAEAEEAR SRAHAVEGQV EDWGNLRQG TVALQEAQDT MQGTSRSLRL IQDRVAEVQQ 1020 

VLRPAEKLVT SKTKQLGDFW TRMEELRHQA RQQGAEAVQA QQLAEGASEQ ALSAQEGFER 1080 

IKQKYAELKD RLGQSSMLGE QGARIQSVKT EAEELFGETM EMMDRMKDME LELLRGSQAI 1140 
MLRSADLTGL EKRVEQIRDH INGRVLYYAT CK 

Seq ZD NO: 522 DNA sequence 

Nucleic Acid Accession #: NMJJ01944.1 

Coding sequence: 84..30B3 

1 11 21 31 41 51 

) I I I I t 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG ' 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAQAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATOGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCAOGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTOGAT ACCGAGTTCA GTCAACOOCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC . 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 
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GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC 
CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG 
TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGQ 
QTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG 
GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA 
GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA 
TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT 
GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC 
CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG 
GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG 
TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCA3GAAGCA 
TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC 
GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT 
TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA 
CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA 
CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT 
CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT 
TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA 
CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCGATT 
CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC 
ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA 
ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG 
TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG 
TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 523 Protein sequence 
Protein Accession #i NP_001935.1 



PCT/US02/12476 



MMGLFPRTTG 
GEDNSKRNPI 
PSFLITCRAL 
ATDADEPNHL 
DKDGEGLSTQ 
WLAVYFFTSG 
SRYRVQSTPV 
SNVKYVMGRN 
VYVRVPDFND 
ITTLNATSAL 
TTSPGTRYGR 
GSBGTIHOWG 
MTTKLGAATE 
DGAISMNFLD 
LDDSFLDSLG 
LSGSQGASAL 
VTERVICPIS 



11 

I 

AIAIFVWIL 
AKITSDYQAT 
NAQGLDVEKP 
NSKIAFKIVS 
CECNIKVKDV 
NEGKWFEIQT 
TIQVINVREG 
DGGYLMIDSX 
NCPTAVLEKD 
LRAQEQIPPG 
PHSGRLGPAA 
IBGAHPEDKE 
SGGAAGFATG 
SYFSQKAFAC 
PKFXKLAEIS 
SASGSVQPAV 
SVPGNLAGPT 



21 
I 

VHGELRIETK 
QKITYRISGV 
LILTVKILDI 
QEPAGTPMFIi 
NDNFPMFRDS 
DPRTNEGILK 
IAFRPASKTF 
TAEIKFVKNM 
AVCSSSPSW 
VYHISLVLTD 
IGLIiLLGLItl) 
ITNICVPPVT 
TVSGAASGFG 
AEEDDGQEAN 
LGVDGEGKEV 
SIPDPLQHGN 
QLRGSHTMLC 



31 

I . 

GQYDEEEMTM 
GIDQPPFGIF 
NDNPPVFSQQ 
LSRNTGEVRT 
QYSARIEENI 
WKALDYEQL 
TVQKGISSKK 
NRDSTFIVNX 
VSARTLNNRY 
SQNNRCEMPR 
LLLAPLLLLT 
ANGADFMESS 
AATGVGICSS 
DCLI/IYDNBG 
QPPSKDSGYG 
YLVTETYSAS 
TEDPCSRLI 



41 

1 

QQAKRRQKRE 
WDKNTGDIN 
IFMGEIEENS 
LTNSLDREQA 
LSSBLLRFQV 
QSVKLSIAVK 
LVDYILGTYQ 
TITAEVLAID 
TGPYTFALED 
SLTLEVCOCD 
CDCGAGSTGG 
EVCTNTYARG 
GQSGTMRTRH 
ADATGSPVGS 
IESCGHPIEV 
GSLVQPSTAG 



AGGCCGCACT 
CTGCTGCTGT 
GGAGTGACAG 
GGAATTGAAG 
ACAGCCAATG 
GGCACAGCGG 
GAATCTGGAG 
GGAGCAGCCA 
CATTCCACTG 
GACTCCTACT 
AATGACTGCT 
TCCGTGGGTT 
GGACCCAAAT 
GTTCAGCCAC 
GTCCAGCAGA 
TTGTCCGCCT 
AACTATTTAG 
GGCTTTGATC 
TCCAGTGTTC 
TGTACAGAGG 
CCAAATCTGG 
GCTAATAATT 
TTTGGGTTCA 
TTGTAGTAAA 



51 
I 

WKFAKPCRE 
ITAIVDREET 
ASNSLVMILN 
SSYRLWSGA 
TDLDEEYTDN 
NKAEFHQSVI 
AIDED TNKAA 
EYTGKTSTGT 
QPVKLPAVWS 
NRGICGTSYP 
VTGGFIPVPD 
TAVEGTSGME 
STGGTNKDYA 
VGCCSFIADD 
QQTGFVKCQT 
FDPLLTQNVI 



Seq id NO t 524 DNA sequence 

Nucleic Acid Accession #: XM_058069.2 

Coding sequence: 1..1413 



1 

I 

ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 
GATGCAGCTG 
TGGAGGTATG 
AACTTCCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 
I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTTCTTTT 
ATCCCAAGAG 
TTTTTAACCC 
ATGAAAGGAG 
GAATCGGGCC 
AAGGATCTAA 
GCAATAGCTG 



21 

I 

GCTCCTGCAG 
AAATAATGTG 
ACTrCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 
ACGTTTTTAT 
ACAGATGATG 
TAAAATTGAT 
CCAATTTGAA 
GTTTGGTTGT 



31 

I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 
AGGACCTACT 
GACCCTGGTT 
GCAGTCTTCT 
TATGACTTCC 
TGA 



41 
I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 
TCTTTGTAGA 
ATCCCAAACT 
ACTCTAAAAA 
TACTCCAACG 



Seq ID NOt 525 Protein sequence 
Protein Accession #: P39900 
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1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



60 
120 
180 
240 
300 
360 
420 
480 
.540 
600 
660 
720 
780 
840 
900 
960 



51 
t 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 

TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGAGATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAGTAT 
GATTACCAAG 
CAAATACTAC 
TATCACCAAA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



1 11 21 31 41 51 

I I I I I I 

MKEUjILIiLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKbPV TKMKYSGNLM 
KBKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 



60 
120 



385 



WO 02/086443 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GRSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLPKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNPVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPOYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO i 526 DNA sequence 

Nucleic Acid Accession §t NM_024423.1 

Coding sequencet 64.. 2 5 90 

1 11 21 31 41 51 

I I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 360 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 10 B0 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGOGCCAT TTOCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA IB 00 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATCCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 

TAAAAATTAA ACATAAAAGA AATTGCATCO ATGTAATCAG AATGAAGACC GCATGCCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 

CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 

ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 2820 

TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 

TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2940 

AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3000 

TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3060 

GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 

ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 

AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 

TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 

GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGOGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
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CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT 
CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA 
TGCATGATGA 6TCTGAAGGC ATTTGCAGGA TGAGCCTGAA 
AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC 
CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT 
TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT 
TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC 
GACTAGAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT 
TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC 
TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC 
GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA 
TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC 
GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC 
TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA 
CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT 
TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT 
TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA 
AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC 
TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT 
AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT 
TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG 
TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT 
CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT 
AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA 
TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT 
TTTAGAGATT . AAATAATTCT AAGATGATCA CTTTGCAAAA 
GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG 
TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT 
TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT 
GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT 
ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA 
CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA 
GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA 
GOAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA 
AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA 
ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA 
ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG 
GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT 
ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT 
ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA 
ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT 
ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT 
CATATATATA ATCCCGAAAC ATG 

Seq ID NOt 527 Protein sequence 
Protein Accession #j NP_077741.l 



PCT/US02/12476 



i 
I 

KAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVPTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPPYP 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGNQTLE 



11 
I 

RGAVCLHLLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 



21 
I 

TLVIPSRDGE 
TARAVALSDK 



VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APFARDIPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 



31 
I 

ACKKVILNVP 
KRSPTIWLSD 
PFPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WAILAILLGI 
FMTQTTHNSS 
VDNCRYTYSE 



TGAGTCCGGT 
ATCAAGGAGA 
CTGGTTGrGC 
CTTCTGGGCA 
ACATTTTCTG 
TGCCCCCCCC 
GTGGCTCCGA 
TCAGCCTCCT 
ATTTTTAATA 
GTGATCCGCC 
GGCCTTGTTT 
AGTTGATCAT 
AGGGGGAGAA 
TAGTTGCTGA 
CCCACTGTGT 
ACTAGTGCCG 
CTAACCATCT 
ATTTGTAATT 
ACATATGTAG 
GCAAGAAAAT 
ATGTAAATAT 
TGGGGTTTGT 
TTGCTTTTAA 
TTACAGATGT 
ATTGGATGCA 
GGTTTCTTGT 
TTATGCTTAT 
GGAATATTTT 
GTTGGGAAGA 
GTATAATAAA 
TCCATGAATA 
ACATGAGTTA 
GATAATGCCT 
GTGAATCTTT 
AAAATCTATT 
TTCCAGTAAC 
TTTTTGTTTG 
TTGAATATTG 
GAATCCTGGA 
TAGCTTTCTC 
TTAGTTAATA 
GTGAGGTAGT 
TTAGGGAAAT 



41 
I 

SKLEADXIIG 
KRKQTQKEVT 
SDAAQNYTVP 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGPQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 



GAGGGATCAG 
CAGAGCTGAC 
AGAACAAACA 
CTAAGAAGGT 
TTTTCTAATT 
CCCTTTTTTT 
TCTCTGCTCA 
GAGTAGCTGG 
GAGACGGGGT 
TGCCTCGGCC 
TCCGTTTAAA 
ACGAATTGGA 
AGAACTCAGG 
AATTTCCTGC 
TTTGCTCACT 
ATAAACTTTC 
CTTTGTTCTT 
CTTTTCTCTC 
TATTATTATT 
ATATTTTTAA 
ACAGAATGTT 
TTTGCAATXjT 
AGAAACTTGG 
GGGGAGATGT 
TAAAGTAATA 
TTTTGCTGTA 
GGCTGGCATG 
GGACAATGTT 
GATCACTATT 
TTTTGATCGG 
GTACACTGAC 
AAAAGAAAAG 
GAACTTTAAT 
TCCCCATGCA 
AGCTATATCA 
TTCTATTGTA 
TAATAGTAAA . 
GGTCATAATG 
TCTGTCACTT 
ATTAAAATCA 
TACCTAAAGT 
TGGTAAAATT 
AAAGTTTGTG 



51 

I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 



4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 



SPGLFSVHPS 
HAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VCGVFGATKG 



Seq ID NO; 528 DNA sequence 

Nucleic Acid Accession #: NM_001941.2 

Coding sequence: 64.-2754 



GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG 
CTATGCAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 



21 



CCCTCCCGGC 
CCGGCGCTCC 
TOGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 



31 
I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 



41 
I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



EESIRGHTG 



51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 



387 



WO 02/086443 

AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 1080 

ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1360 

GAAGOGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGGGG 1S00 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG' GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAAOGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGAOG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT . TGCATCGATG TAATCAGAAT 2580 

GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 

TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 

ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 

ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 

GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 

TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 

GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 

TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 

AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 

CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

CCCCCCCCCC TTTTTTTTTQ AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

GCTCOGATCT CTGCTCACTG AAAGCTCCGC CTCCOGGGTT CATGCCATTC TCCTGCCTCA 4860 

GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCAOG CCCGGCTAAT TTTTTGTATT 4920 

TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTOGTG 4980 

ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGO CATGACCCAC CGCTCCCGGC 5040 

CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAACACCC 5280 

ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

TAACCATCTC TTTQTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

TG C TTTT A AA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 

TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

AATAGAAATA CTCAATTATG TCTTTQTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATOGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
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WO 02/086443 

AGGGGTTTTA CTTTGAGQAC CAGTGTAGTC AAGGGAAAAC 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT 
AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA 
AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT 
CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT 
ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT 
TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA 
GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT 
ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 529 Protein sequence 
Protein Accession #: NP 001932.1 



ATGAGTTAAA 
TAATGCCTGA 
GAATCTTTTC 
AATCTATTAG 
CCAGTAACTT 
TTTGTTTGTA 
GAATATTGGG 
ATCCTGGATC 
GCTTTCTCAT 
AGTTAATATA 
GAGGTAGTTG 
AGGGAAATAA 



AAGAAAAGCA 
ACTTTAATGA 
CCCATGCAGG 
CTATATCAAA 
CTATTGTAAC 
ATAGTAAAAT 
TCATAATGGT 
TGTCACTTAC 
TAAAATCAAT 
CCTAAAGTAC 
GTAAAATTAT 
AGTTTGTGCA 



MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNXiFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPFYF 
TKLLRVNLCE 
RRFPEDLAQQ 
MMKGGNQTLE 
DRMPSQDYVL 



11 
I 

RGAVCIiHXiLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 



TYNYBGRGSP 



21 
I 

TLVIFSRDGE 
TARAVALSDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APFARDIPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 
AGSVGCCSEK 



31 

I 

ACKKVILNVP 
KRSFTIWLSD 
PPPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QPPGLIGTST 
VNPTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARXiSYQKN 
WAILAILLGI 
FMTQTTNNSS 
VDNCRYTYSE 
QEEDGLDFLN 



41 
I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVP 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSKD 
NGHPKISTDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 
MIiEPKFITLA 



Seq ID NO: 530 DNA aequence 

Nucleic Acid Accession fh NM_016583.2 

Coding sequence : 72 . . 842 



GGAGTGGGGG 
TAAGAGCAAA 
CCATGGCCCA 
ATCCAGCCCT 
ATGGCCTGCT 
TGAAGCCTGG 
CAGTGATTCC 
AACTTGGCCT 
TAAAGCTCCA 
TGGACATCAC 
TTGGTGACTG 
CCCTCCCCAT 
AGTTGGTTCA 
CCCTGGTGCA 
AAGCCTTCCA 
GCCCATGTGC 
TCCCACCAGG 
AAAAAAAAAA 



11 
I 

AGAGAGAGGA 
GATGTTTCAA 
GTTTGGAGGC 
GCCCTTGAGT 
GTCTGGGGGC 
AGGAGGTACT 
TGGCCTGAAC 
TGTGCAGAGC 
AGTGAATACG 
TGCAGAAATC 
CACCCATTCC 
TCAAGGTCTT 
GGGCAACGTG 
TGACATTGTT 
GGAAGGGGCT 
TGGAAGATGA 
OGTGTGTAAC 
AAAAAAAAAA 



21 
I 

GACCAGGACA 
ACTGGGGGCC 
CTGCCCGTGC 
CCCACAGGTC 
CTGTTGGGCA 
TCTGGTGGCC 
AACATCATTG 
CCTGATGGCC 
CCCCTGGTCG 
TTAGCTGTGA 
CCTGGAAGCC 
CTGGACAGCC 
TGCCCTCTGG 
AACATGCTGA 
GGCCTCTGCT 
CACAGTTGCC 
ATCCCATGTG 
AAAAAAAAA 



31 
I 

GCTGCTGAGA 
TCATTGTCTT 
CCCTGGACCA 
TTGCAGGAAG 
TTCTGGAAAA 
TCCTTGGGGG 
ACATAAAGGT 
ACCGTCTCTA 
GTGCAAGTCT 
GAGATAAGCA 
TGCAAATTTC 
TCACAGGGAT 
TCAATGAGGT 
TCCACGGACr 
GAGCTGCTTC 
TTCTCTCCGA 
CCTCACCTAA 



41 
I 

CCTCTAAGAA 
CTACGGGCTG 
GACCCTGCCC 
CTTGACAAAT 
CCTTCOGCTC 
ACTGCTTGGA 
CACTGACCCC 
TGTCACCATC 
GTTGAGGCTG 
GGAGAGGATC 
TCTGCTTGAT 
CTTGAATAAA 
TCTCAGAGGC 
ACAGTTTGTC 
CCAGTGCTCA 
GGAACCTGCC 
TAAAATGGCT 



51 
I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 
IRVEDENDNH 
SPGLFSVHPS 
NAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDIIiAVDPD 
TVKDRAGQAA 
VCGVFGATKG 
MKNGGQETIE 
EKLHRCNQNE 
EACTKR 



51 
I 

GTCCAGATAC 
TTAGCCCAGA 
TTGAATGTGA 
GCCCTCAGCA 
CTGGACATCC 
AAAGTGACGT 
CAGCTGCTGG 
CCTCTCGGCA 
GCTGTGAAGC 
CACCTGGTCC 
GGACTTGGCC 
GTCCTGCCTG 
TTGGACATCA 
ATCAAGGTCT 
CAGATGGCTG 
CCCTCTCCTT 
CTTCTTCTGC 



Seq ID NO: 531 Protein sequence 
Protein Accession #: NP_05*7667.1 



11 



21 



1 
I 

CTCAGGGCAG 
TCCTGGAACT 
TCTCCCTCGG 
TCACTTCTAA 
TTCAATGTOG 
TTTGGCTACA 
GTAATAGGAA 
CCCAATGCAT 



11 
I 

AGGGAGGAAG 
CAAGCTCTTC 
CCCCTCCCCA 
CCTTCTGGAA 
CAGAGGGGAA 
GCTGGTACAA 
CTCAACAAGC 
CCCTGCTGAT 



21 

I 

GACAGCAGAC 
TCCACAGAGO 
CAGATGGTGC 
CCOGCCCACC 
GGAGGTGCTT 
AGGTGAAAGA 
TACCCCAGGG 
CCAGAACATC 



31 
I 

CAGACAGTCA 
AGGACAGAGC 
ATCCCCTGGC 
ACTGCCAAGC 
CTACTTGTCC 
GTGGATGGCA 
CCCGCATACA 
ATCCAGAATG 



41 

I 

CAGCAGCCTT 
AGACAGCAGA 
AGAGGCTCCT 
TCACTATTGA 
ACAATCTGCC 
ACCGTCAAAT 
GTGGTCGAGA 
ACACAGGATT 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6640 
6900 



PCT/US02/12476 



31 41 51 

I I I 

MFQTGGLIVF YGULAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 
SGGLUGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLMNIIDIKV TDPQLIjELGL 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 

THSPGSLQIS lldglgplpi oglldsltgi lnkvlpelvq gnvcplvnev lrglditlvh 

DIVNMIjIHGI* QFVIKV 



Seq ID NO: 532 DNA sequence 

Nucleic Acid Accession #: NM_004363. 

Coding sequence: 115. .2223 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 



51 
I 

GACAAAACGT 
GACCATGGAG 
GCTCACAGCC 
ATCCACGCCG 
CCAGCATCTT 
TATAGGATAT 
GATAATATAC 
CTACACCCTA 



60 
120 
180 
240 
300 
360 
420 
480 



389 



WO 02/086443 

CACGTCATAA AGTCAGATCT TGTGAATOAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 54 0 

GAGCTGCCCA AGCOCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC OGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGOGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATAOGTG CCAAGCCCAT 1020 

AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAQTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

COGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTT C TT T ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2560 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 533 Protein sequence 
Protein Accession ft: NP_004354.1 

1 11 21 31 41 51 

I I I I II 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGRBI IYPNASLLIQ NI IQNDTGFY 120 

TLHVIKSDLV NEEATGQFRV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGEKLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTOQ 300 

AHNSDTGLNR TT V TTITVYA EPPKPPITSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPDDPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELPISNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQMT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNL 660 
ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 

Seq ID NO: 534 DNA sequence 

Nucleic Acid Accession #: NMJ)06952.1 

Coding sequence > 11 . . 7 93 

1 11 21 31 41 51 

! I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGOGTGCCT GGTTTTTATC ACAATCAQGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 535 Protein sequence 
Protein Accession #i NP_008B83.1 

1 U 21 31 41 51 

) II I I ) 



390 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFPVSDQ HSLYPLLEAT DNDDIYGAAW 
IGIFVGICLP CLSVXX3IVGI MKSSRKILLA YFILMFIVYA FBVASCITAA TQRDFFTPNL 
FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO: 536 DNA sequence 
Nucleic Acid Accession #: NM_002638.1 



PCT/US02/12476 



60 
120 
180 
240 



1 
I 

CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



,473 










21 


31 


41 


51 




1 

CCCTTGTMA 


1 

TACCACAGAC 


1 

CCGCCCTGGA 


1 

GCCAGGCCAA 


60 


GTATGGCCTT 


AGCTCTTAGC 


CAAACACCTT 


CCTGACACCA 


120 


ATCGTGGTGG 


TGTTCCTCAT 


CGCTGGGACG 


CTGGTTCTAG 


180 


CCTGTTAAAG 


GTCAAGACAC 


TGTCAAAGGC 


CGTGTTCCAT 


240 


AAAGGACAAG 


TTTCAGTTAA 


AGGTCAAGAT 


AAAGTCAAAG 


300 


CCAGTCTCCA 


CTAAGCCTGG 


CTCCTGCCCC 


ATTATCTTGA 


360 


CCCCCTAACC 


GCTGCTTGAA 


AGATACTGAC 


TGCCCAGGAA 


420 


TCTTGCGGGA 


TGGCCTGTTT 


CGTTCCCCAG 


TGAAGGGAGC 


480 


CCGTCCCCAG 


AGCTACAGGC 


CCCATCTGGT 


CCTAAGTCCC 


540 


CACTGTCCAT 


TCTTCCTCCC 


ATTCAGGATG 


CCCACGGCTG 


600 


TTTCCAATAA 


A 









Seq ID NOj 537 Protein sequence 
Protein Accession th NP_002629.1 

1 11 21 31 41 51 

I I II .1 I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVXGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 538 DNA sequence 

Nucleic Acid Accession ft: NM_001793.2 

Coding sequence: 71.. 2560 



AAAGGGGCAA 
CTCTGCAGCC 
CTGGCTGCAG 
CTTGGAGGCG 
CTGCCCTGGG 
TGGCGAGACA 
ATCCAAACGT 
TGAAAATGGC 
AGACACCAAG 
CTTCGCTGTA 
GATTGCCAAG 
CCCCATGAAC 
GGACACCTTC 
GACAGCCACG 
CCATAGCCAA 
CACCATCAGC 
CATCCAGGCC 
GATCCTTGAT 
GCCTGAGAAT 
CAACTCACCA 
TACCATCACC 
TTTTGAGGCC 
GCTGAAGCTC 
ACCTGTGTTT 
GCCTGTGTGT 
CATCCTGAGA 
TGTGGGCACC 
GGTCTTGGCC 
ACTGATTGAT 
CCAAAGCCCT 
CCCTTTCCAG 
GGAAGGTGAC 
GCACCTTTCT 
GTGCGACTGC 
CCCTGTGCTG 
GAGAAAGAAG 
CGTCTTCTAC 
GCTCCACCGA 
CATCATCCCG 
TATAATTGAG 
CTTGGTGTTC 
CTCCGCCTCC 
GAAGCTGGCA 
GGGACCAAAC 
GACTTCGGAG 
ACGTTAGAGT 
AGCACTGAAA 
TCTTACCTGC 
TACAGTGGAC 



11 
I 

GAGCTGAGCG 
ATGGGGCTCC 
TGCGCGGCCT 
GGAGGCGCGG 
CAAGAGCCAG 
GTCCAGGAAA 
ATCTTACGAA 
AAGGGTCCCT 
ATTTTCTACA 
GAGAAGGAGA 
TATGAGCTCT 
ATCTCCATCA 
CGAGGGAGTG 
GATGAGGATG 
GAACCAAAGG 
GTCATCTCCA 
ACAGACATGG 
GCCAATGACA 
GCAGTGGGCC 
GCGTGGCGTG 
ACCCACCCTG 
AAAAACCAGC 
CCAACCTCCA 
GTCCCACCCT 
GTCTACACTG 
GACCCAGCAG 
CTCGACCGTG 
ATGGACAATG 
GTCAATGACC 
GTGCGCCAGG 
GCCCAGCTCA 
ACAGTGGTCT 
CTGTCTGACC 
CATGGCCATG 
GGGGCTGTCC 
CGGAAGATCA 
TATGGCGAAG 
GGTCTGGAGG 
ACACCCATGT 
AACCTGAAGG 
GACTATGAGG 
GACCAAGACC 
GACATGTACG 
GTCAGGCCAC 
CTTGTCAGGA 
GGTTGCTTCC 
ACCTCTCCAC 
CGTAAAATGC 
TTTCTCTCTG 



21 
1 

GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGTCACT 
GACACAAGAG 
TCCCCCAGAG 
GCATCACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGACGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGTCCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 
AGAGCATCTC 
AGTGGCCGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 



31 
I 

CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
CGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACATCTAC 
GAAGTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 
CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTCCTGGGCC 
TTCTTAGGCC 



41 
I 

GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAGCCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGACG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
CGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 
GCGGCCTGCC 
CAGTTCCCCC 
GGAGACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGOCTGCT 
TCCTGGTGCA 



51 
I 

TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAGCTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGTCAACGA 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 
TTTTGTTGGT 
CCCGTGACAA 
ACATCACCCA 
TGGCACCAAC 
TCGGCAACTT 
ACGACACCCT 
CCCTCACCTC 
GCOGCTTCAA 
TGCAGGGCTG 
TTCAGCTGAG 
TATGAGTCTG 
GTTTGACTTC 
TCCAGAAGCC 
GTGACTGACC 
ACTTAATTTT 



60 . 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



391 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 
TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 
GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATOGTGTAT ATGTACTAGA ACTTTTTTAT 
TAAAGAAACT TTTCCCAGAA AAAAA 



PCT/US02/12476 



3000 
3060 
3120 
3180 



Seq ID NO i 539 Protein sequence 
Protein Accession 8: NP_001784.2 



I 

MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALLFLL 
GLEARPBWL 
DYEGSGSDAA 



11 
I 

LLXiLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IPYSITGPGA 
ISIIVTDQND 
KPKDPHDLMF 
ANDNAPMFDP 
THPESNQGII* 
VPPSKWEVQ 
LDREDEQFVR 
VRQVItfJTDX 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYUl 



31 
I 

FREAEVTLEA 
NPLXIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
BGIPTGBPVC 
NNIYEVMVLA 
DLSPHTSFFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFI IE 
EWGSRFKKLA 



.Seq ID NO: 540 DNA sequence . 
Nucleic Acid Accession #x Eos sequence 
Coding sequence i 1 . . 672 



ATGAGGCTCC 
CGGGGCTCCC 
AAGGGCGGGG 
CTGCTCGCCT 
GCGAGACAAC 
TGTCATGTTT 
ACAGAGCCAT 
AAGCAGTGCT 
CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
I 

AAAGACCCCG 
CCTACCGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
GAGATCCAGA 
GTGAGAGAGA 
ACTGCGTTAT 
CCGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 
I 

ACAGGCCCCG 
AGACCCGGGG 
GCGCGCTGAC 
CGTGGCCCTA 
GGACTCCCAG 
AAACACTTTC 
AGCGGCCGTG 
TGCAGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 
I 

GCGGGTGGGA 
AGAGGCGCGC 
CCTCCCTGGG 
CCGCGGGTGT 
CGAACGGACG 
GAGTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CTCAAGTGTT 
TTCAAAGAAT 
CTGCTGCTGG 



41 
I 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTIiYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 
YGEEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



41 

I 

GGCGCGCGCC 
GGAGGCTGCG 
CACCGCTGGG 
GGACAGACGC 
AGGGTGACAA 
ACCCAAGGAG 
CACGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCTCCATTGC 



51 

! 

LGKVPMGCPG 
VAPISVPENG 
KPtiDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTLID 
WTAEVNEEGD 
WKGGFIliPVL 
QDYDITQLHR 
APPYDTLLVF 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GACGATGGCG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGCCGGCCTC 



Seq ID NO: 541 Protein sequence 
Protein Accession #: Eos sequence 



51 



1 11 21 31 41 

i ) I I I I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 
LEGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 

Seq ID NO: 542 DNA sequence 

Nucleic Acid Accession ft: XM_035292.2 

Coding sequence: 53.. 1576 



GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTOCTQ 
CATCATCCTC 
GTGGCTCCTC 



11 

I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 



21 
I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TQGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 



31 

I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGOAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 



41 

( 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
T6GAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 



51 

I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



392 



WO 02/086443 

CCCCCAGGAQ ACATAGCCAG GAGGCCQAGT GGCTGCOGGA GGAGCATGC 



PCT/US02/12476 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO; 543 Protein sequence 
Protein Accession #: XP_035292.2 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAP 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 

I 

LAAPAAEEKE 
TPTGVLKEAG 
LKliWIELLII 
KAATRVQDAF 
LFAY6GWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNLALPVFFI 
STTVLCQKLM 



21 

t 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAI* 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAFS 
LACLFLIAVS 
QWPQET 



31 
I 

SADGSAPAGE 
AOGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAI I I 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



Seq ID NO: 544 DNA sequence 

Nucleic Acid Accession #: NM_005268.1 

Coding sequence: 168.. 989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGXiP 



41 
I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
OCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 
I 

TLLNGVAIIV 
SKSGGDYAYM 
AKLVACLCVL 
FSPEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



51 
I 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



1 
I 

CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGOGTGTGAT 
CAATGCTCAG 
AAAGGCCAAA 
GCCCCTGGTG 
CACCAGTGCC 
ACTCOCCAGC 
TGAGCCTCCC 
ATTACTAAGA 
TAATAT 



Seq ID NO: 547 Protein sequence 
Protein Accession #: NP_002382.l 



11 

I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 
TGCCAGGAGA 
GCCAAGAAAG 
TCACATGGGG 
TTCTGTCTGC 
CCCACCCCTA 
CCAAAGCAAT 
AACACATCAA 



21 
I 

GCGAGATGCA 
COGOGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 
CCATCCGCGT 
GGAAGGGAAA 
CCTGGCCACG 
TCGTTAGCTT 
AGTGOOCAAA 
GTGAGTCCCA 
ATAAACTGAC 



31 
I 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAGACC 
CGACTGCAAG 
CCGCCAAGGC 
CACCAAGOCC 
GGACTAGACG 
CCCTCCCTCT 
TAATCAATCA 
GTGGGGAGGG 
GAGCCCGCTT 
TTTTT CCCOC 



41 
I 

TTCCTCCTCC 
GATAAGGTGA 
ACCCCCAGCA 
CAGCGCATCC 
TACAAGTTTG 
ACCCTGAAGA 
TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 
TGCCCTGCCT 
ACAAGGGATT 
TTGTTCTTCC 
CAATAAAAGC 



31 



41 



51 



1 11 21 

I I I I ! i 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



'60 
120 
180 
240 
300 
360 
420 
4B0 



60 
120 
180 
240 
300* 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 545 Protein sequence 
Protein Accession #: NP_0052S9.1 

1 11 21 31 41 51 

I I I I 1 I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
SNVCFDEPFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: 546 DNA sequence 

Nucleic Acid Accession #: NMJ>02391.1 

Coding sequence: 26.. 457 



60 
120 
180 
240 



51 

I 

TCACCCTCCT 
AGAAGGGCGG 
GCAAGGATTG 
GGTGCAGGGT 
AGAACTGGGG 
AGGCGCGCTA 
AGACCAAAGC 
ATGCCAAGGA 
AGATGTGACC 
TGTCCCTCTC 
CTGGGAAGCT 
CCACAATTCC 
TCTTCTTTTT 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 



60 
120 



Seq ID NO: 548 DNA sequence 



393 



WO 02/086443 

Nucleic Acid Accession #: 
CtxJing sequencer 1..786 



PCT/US02/12476 



NM 006783.1 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



i 
I 

ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATOC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



11 

I 

GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



21 
I 

CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GA7AGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACOGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



31 

I 

GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGTCAAA 



41 
I 

ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 
TCCGGCTGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAATCAC 



51 
I 

CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CAGCAGCATC 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



Seq ID NO: 549 Protein sequence 
Protein Accession #: NP_006774.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 
KNVCYDHFFP VSHIRLWALQ. LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 
IKKHKVRIEG SLWWTYTSSI PPRIIFEAAP MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 
FISRPTEKTV FTIFMISASV ICMLLKVAEL CYLLLKVCFR RfiKRAQTQKN HPNHALKESK 
QNEMNELISD SGQHAITGFP S 

Seq ID NO i 550 DNA sequence 

Nucleic Acid Accession 8: NMJ>02571.1 

Coding sequence: 99.. 587 



1 
I 

CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



11 

I 

GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GATCATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



21 

I 

TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAGATCGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATTCATCA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



31 

I 

CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
ATCCAAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
CCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



41 

I 

AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



51 
I 

TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



Seq ID NO: 551 Protein sequence 
Protein Accession #: NP_0O2562.l 

1 11 21 31 41 51 

I I I I I 1 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHTTSLLPTP EDNLEIVLHR 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 552 DNA sequence 

Nucleic Acid Accession fh NMJ506500.1 

Coding sequence: 27.. 1967 



1 

I 

ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 
GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
TCATCTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 



11 
I 

CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 
GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CAAGAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GAAGGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAGGAAGGC 
CCAGTGGCTG 



21 

I 

CCAAGCATGG 
CCTCGCGTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 
AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGAAGTCA 
GTGGGAATGC 
CCACCACACT 
ACCAAOGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGAGAAGAGA 




TGAAGGAAGG 
TCAGCATCAG 
ACGGGGTCCT 
CCTGGAACTT 
ATGTGTCTGA 
CCCTGACCTG 
CAGACCAGGT 



41 
I 

GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCCG 
GGACCGCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 



51 

I 

GCCTTCTTGC 
GAGCAGCCTG 
GGCCTCTCCC 
AAGCGGACGC 
GAGCAGCGGC 
CAAGACGAGC 
CAGCTCCGCG 
CCTGTGAACA 
ATTCCTCAAG 
CACATTCAGT 
AAGGCACAGC 
CTGCCCAGTG 
ACAGAAAAAG 
GAAATCAGGT 
CCCAGCACCA 
CCTGCCCGGA 
ATATCGCTGC 
AGTCCCGCAG 
AGTAGCCAGG 
GGGCCTGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



394 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA 
GTGAAGCGTC AGGGCACCCC OGGCCCACCA TCTCCTGGAA 
AACAAGACCA AGATCCACAG OGAGTCCTGA GGACCCTGAA 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAAOGACCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC 
CCGAGCGGGT AGOAGAGTTT CTTGCAGAAC GTGTTTTTTC 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT 
TCACAAAGTC AGGAOGAGAC CATCCTGGCT AACACGGTGA 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT 
CTGAAGCAGG AGAATGGTA? GAATCCAGGA GGTGGAGCTT 
CACTGCACTC CAGOCTGGGC AACACAGCGA GACTCCGTCT 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC 
GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC 
AGAATGGTA C TTAGGGATGG AAAACGGGGC CTGGCTAGAG 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT 
TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA 
CTACCCTACT TTTCAGCAGC AAAAOGTCCC GTATGACGCA 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA 



TCGCTQCGTG 
GGCCATTTTT 
TATGGTGTTG 
CGTCAACGGC 
TGTCCTCGTG 
GGGCAAAAAC 
CTCCAACACA 
CACCTCCACA 
GATTGTGTGC 
GGGCAAGCTG 
GACCGAACTT 
GGGCAGCAGC 
GCATTAGCCC 
CACTCTTCTC 
CTGCACACCC 
AAGCCGCTTT 
AAGCAAGGAG 
TTTACACACA 
TCTGAGCTGG 
AGGACACACC 
AGAGCACCCC 
CCTCTTCAAA 
CCTTAAAAGA 
GGGAGGCCGA 
AACCCTGTCT 
AGTCCCAGCT 
GCAGTGAGCC 
OGAGGAAAAA 
TTCAGGTGAA 
ACGTAAAACT 
GGATTAAAGC 
CCTAGAAGGG 
CTTOGGGTGT 
CAGGTGTGTA 
TATATATATA 
TTCTACATGG 
AACCGTTTCC 
AGCTCTACCA 
GCACGAAGGG 
CTT 



GCGTCTGTGC 
GGCCCCCCTT 
AATCTGTCTT 
ACGGCAAGTG 
ACCCCGGAGC 
ACCAGCATCC 
ACCACTGGCC 
GAGAGAAAGC 
ATCCTGGTCC 
CCGTGCAGGC 
GTAGTTGAAG 
GGTGACAAGA 
CGAATCACTT 
TCAGCCAAAG 
CCTTTCAGAG 
TCAGGGACCA 
CCCCAGTCTC 
TTATGGCTGT 
TTTCCTGCCC 
GGAGCCAGGC 
AGCGGCATCC 
GTCTCCTGTG 
TAOGTGCCGG 
GGGGGGCGGA 
CTACTAAAAA 
ACTOGGAAGG 
GAGACOGTGC 
AAAAGAAAAG 
TTAGCCTCAA 
GAAAGGCAGC 
TATGGTTATA 
CCCAAATGAG 
GTGTGTCTGT 
AATTTGCAAA 
TATGAAAAAT 
GTACCACAGG 
AGTTGGCAGC 
GAGCAGACAG 
CCTGGCAGGC 



Seq ID NO: 553 Protein sequence 
Protein Accession 8: NP_006491.l 



1 
I 

GLPRLVCAPL 
WFSVHKBKRT 
RSQEYRIQLR 
KEEKNRVHTQ 
TVPVFYPTEK 
NGVLVLEPAR 
TLTCEAESSQ 
LVKLAIFGPP 
STLNVLVTPE 
RANSTSTERK 
PSRKTELWE 



11 
I 

LAACCCCPRV 
LIFRVRQGQG 
VYKAPEEPNI 
SSQTVESSGL 
VHLEVEPVGM 
KEHSGRYECQ 
DLEFQWLREE 
WMAFKERKVW 
LLETGVECTA 
LPEPESRGW 
VKSDKLPEEM 



21 

I 

AGVPGEAEQP 
QSEPGEYEQR 
QVNPLGIPVN 
YTLQSILKAQ 
LKEGDRVEIR 
AHNLDTMISL 
TDQVLERGPV 
VKENMVLNLS 
SNDLGKNTSI 
IVAVIVCILV 
GLLQGSSQDK 



31 
I 

APELVEVEVG 
LSIiQDRGATL 
SKEPEBVATC 
LVKEDKDAQF 
CLADGNPPPH 
LSEPQELLVN 
LQLHDLKREA 
CEASGHPRPT 
LFLELVNLTT 
LAVIfGAVLYF 
RAPGDQGEKY 



Seq ID NO: 554 DNA sequence 

Nucleic Acid Accession #: NM_003183.3 

Coding sequence: 165.. 2639 



TCGAGCCTGG 
GCTAGGCCGG 
CGAAGGCTGC 
TATTCCTGAC 
TCGGCCCCCA 
CTTTATCTAA 
TAGAAACACT 
GTACTGAACG' 
AGTACACTGC 
GGGTTCTAGC 
AATATAACAT 
TTTATAAATC 
ATTTAAAAGT 
AAGAGCTTGT 
AATTATTGGT 
CAACTACAAA 
CATGGGATAA 
AGTCTCCACA 
ATGAAGAAAA 
CTGAGGAAGC 
GAACTCTTGG 
CAAAGGCTTA 
GCACAAAGAA 



11 
I 

CGGTAGAATC 
GCGGATCCCG 
CCAGAGAGGT 
CAGCGTGGTT 
CCAGAGACTC 
TATCCAGCAG 
ACTAACTTTT 
TTTTTCACAA 
AAAATGGCAG 
CCACATAAGA 
AGAGCCACTT 
TGAAGATATC 
GGATAATGAA 
TCATCGAGTG 
GGTAGCAGAT 
TTACTTAATA 
TGCAGGTTTT 
AGAGGTAAAA 
GGATGCTTGG 
ATCTAAAGTT 
ATTAGCTTAT 
TTATAGCCCA 
TTATGGTAAA 



21 
I 

TTCCCAGTAG 
TCCTCCCCCG 
GGAGTCGGTA 
CCTTTCGTGC 
GAGAAGCTTG 
CATTCGGTAA 
TCAGCTTTGA 
AATTTCAAGG 
GACTTCTTCA 
GATGATGATG 
TGGAGATTTG 
AAGAATGTTT 
GAGTTGCTCC 
AAAAGAAGAG 
CATCGCTTCT 
GAGCTAATTG 
AAAGGCTATG 
CCTGGTGAAA 
GATGTGAAGA 
TGCTTGGCAC 
GTTGGCTCTC 
GTTGGGAAGA 
ACCATCCTTA 



31 

I 

GCGGCGCGGG 
ATGTGAGCAG 
GCGGGGCCGG 
TGGCGCCGCG 
ATTCTTTGCT 
GAAAAAGAGA 
AAAGGCATTT 
TCGTGGTGGT 
CTGGACACGT 
TTATAATCAG 
TTAATGATAC 
CACGTTTGCA 
CAAAAGGGTT 
CTGACCCAGA 
ACAGATACAT 
ACAGAGTTGA 
GAATACAGAT 
AGCACTACAA 
TGTTGCTAGA 
ACCTTTTCAC 
CCAGAGCAAA 
AAAATATCTA 
CAAAGGAAGC 



41 
I 

AGGGAAAAGA 
TTTTCCGAAA 
GAACATGAGG 
ACCTCCGGAT 
CTCAGACTAC 
TCTACAGACT 
TAAATTATAC 
GGATGGTAAA 
GGTTGGTGAG 
AATCAACACA 
CAAAGACAAA 
GTCTCCAAAA 
AGTAGACAGA 
TCCCATGAAG 
GGGCAGAGGG 
TGACATCTAT 
AGAGCAGATT 
CATGGCAAAA 
GCAATTTAGC 
ATACCAAGAT 
CAGCCATGGA 
TTTGAATAGT 
TGACCTGGTT 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
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41 


51 




1 

STALLKCGLS 


1 

QSQGNLSHVD 


60 


ALTQVTPQDE 


RIFLCQGKRP 


120 


VGRNGYPIPQ 


VIWYKNGRPL 


180 


YCELNYRLPS 


GNHMKESREV 


240 


FSISKQNPST 


REAEEETTND 


300 


YVSDVRVSPA 


APERQBGSSL 


360 


GGGYRCVASV 


PSIPGLNRTQ 


420 


ISWNVNGTAS 


EQDQDPQRVL 


480 


LTPDSNTTTG 


LSTSTASPHT 


540 


LYKKGKLPCR 


RSGKQEITLP 


600 


IDLRH 







51 
I 

GGATTGAGGG 
CCCCGTCAGG 
CAGTCTCTCC 
GACCGGGGCT 
GATATTCTCT 
TCAACACATG 
CTGACATCAA 
AACGAAAGCG 
CCTGACTCTA 
GATGGGGCCG 
AGAATGTTAG 
GTGTGTGGTT 
GAACCACCTG 
AACACGTGTA 
GAAGAGAGTA 
CGGAACACTT 
CGCATTCTCA 
AGTTACCCAA 
TTTGATATAG 
TTTGATATGG 
GGTGTTTGTC 
GGTTTGACGA 
ACAACTCATG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AATTGGGACA TAATTTTGQA GCAGAACATG ATCCGGATGG 
ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC 
ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAATCTA 
CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG 
GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA 
GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG 
AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA 
CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG 
TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT 
AGCAGCTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC 
ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA 
AAGGAAAGCC CTGTACAGTA GGATTTTGTG ACATGAATGG 
AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT 
AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT 
CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA 
CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT 
. TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT 
CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT 
CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA 
GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC 
CTGCCTCCTT TAAACTGCAG CGTCAGAATC GTGTTAACAG 
TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT 
TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT 
TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG 
TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT 
TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG 
TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG 
ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT 
ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA 
GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT 
CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT 
TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA 
TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT 
TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA 
CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC 

Seq ID NO: 555 Protein sequence 
Protein Accession #: NP_003174.2 
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MRQSLLFLTS 
QTSTHVETLL 
GEPDSRVLAH 
PKVCGYLKVD 
RGEESTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNKDQG 
SRVDEGEECD 
TCKGVSYCTG 
KVCCRDLSGR 
INTFGKPLAD 
SASVRIIKPP 
PFPNSSTAAK 



11 
I 

WPFVIAPRP 
TFSALKRHFK 
IRDDDVIIRI 
NEELLPKGLV 
LIELIDRVDD 
AWDVKMLLEQ 
SPVGKKNIYL 
GKYVMYPIAV 
PGIMYLNNDT 
NSSECPPPGK 
CVPYVDAEQK 
NIVGSVLVFS 
PAPQTPGRLQ 
SPEDL.TDHPV 



21 

I 

PDDPGFGPHQ 
LYLTSSTERF 
NTDGAEYNIE 
DREPPEELVH 
IYRNTSWDNA 
FSFDIAEEAS 
NSGLTSTKNY 



CCNSDCTLKE 
AENDTVCLDL 
NLFLRKGKPC 
LIFWIPFSIL 
PAPVIPSAPA 
ARSEKAASFK 



31 
I 

RLEKLDSLLS 
SQNFKWWD 
PLWRFVNDTK 
RVKRRADPDP 
GFKGYGIQIB 
KVCLAHLFTY 
GKTILTKEAD 
SNCSKQSIYK 
GVQCSDRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCVDKKLDK 
APKLDHQRMD 
LQRQNRVNSK 



TCTAGCAGAA 
TGTGAGTGGC 
TAAGACCATT 
GAACTCGAGG 
CACCTGCTGC 

TGCTACTTGC 
AAATGCTGAA 
CCCTTTCTGC 
CTGCAAGGTG 
AAAGAACTTA 
CAAATGTGAG 
GAGCATCAAT 
CTCCTTGATA 
TAAACAGTAT 
GGATTCTGCA 
GCAGCCTGCC 
GGACACCATC 
GGACCCCTTC 
GGTCGCCAGA 
CAAAGAAACA 
TTATAGATTT 
TAGCAGATGC 
CCCTTCTCCT 
TTTAGTTTTT 
GTTATGAATA 
GAAATGATCA 
TTATTTGTGA 
AAACAAAGGA 
TACCCAGAGT 
AATATGGCTC 
GGGCTATACA 
TGCTGGGCAG 
TACATGTCTT 
ACTTGGAGAQ 
CTC 



41 
I 

DYDILSLSNI 
GKNESEYTAK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPQE 
QDFDMGTLGL 
LVlTHEIiGHN 
TIESKAQECP 
CCKNCQFETA 



Seq ID NO: 556 DNA sequence 

Nucleic Acid Accession g: NMJJ21B32. 

Coding sequence: 164.. 2248 



1 
I 

TCGAGCCTGG 
CTAGGCCGGG 
GAAGGCTGCC 
ATTCCTGACC 
CGGCCCCCAC 
TTTATCTAAT 
AGAAACACTA 
TACTGAACGT 
GTACACTGTA 
GGTTCTAOCC 
ATATAACATA 
TTATAAATCT 
TTTAAAAGTG 
AGAGCTTGTT 
ATTATTGGTG 
AACTACAAAT 
ATGGGATAAT 
GTCTCCACAA 
TGAAGAAAAG 
TGAGGAAGCA 
AACTCTTGGA 
AAAGGCTTAT 
CACAAAGAAT 
ATTGGGACAT 



11 
I 

CGGTAGAATC 
CGGATCCCGT 
CAGAGAGGTG 
AGCGTGGTTC 
CAGAGACTCG 
ATCCAGCAGC 
CTAACTTTTT 
TTTTCACAAA 
AAATGGCAGG 
CACATAAGAG 
GAGCCACT7T 
GAAGATATCA 
GATAATGAAG 
CATCGAGTGA 
GTAGCAGATC 
TACTTAATAG 
GCAGGTTTTA 
GAGGTAAAAC 
GATGCTTGGG 
TCTAAAGTTT 
TTAGCTTATG 
TATAGCCCAG 
TATGGTAAAA 
AATTTTGGAG 



21 

I 

TTCCCAGTAG 
CCTCCCCCGA 
GAGTCGGTAG 
CTTTCGTGCT 
AGAAGCTTGA 
ATTCGGTAAG 
CAGCTTTGAA 
ATTTCAAGGT 
ACTTCTTCAC 
ATGATGATGT 
GGAGATTTGT 
AGAATGTTTC 
AGTTGCTCCC 
AAAGAAGAGC 
ATCGCTTCTA 
AGCTAATTGA 
AAGGCTATGG 
CTGGTGAAAA 
ATGTGAAGAT 
GCTTGGCACA 
TTGGCTCTCC 
TTGGGAAGAA 
CCATCCTTAC 
CAGAACATGA 



31 

I 

GCGGCGCGGG 
TGTGAGCAGT 
CGGGGCCGGG 
GGCGCCGCGA 
TTCTTTGCTC 
AAAAAGAGAT 
AAGGCATTTT 
CGTGGTGGTG 
TGGACACGTG 
TATAATCAGA 
T AATGA TACC 
ACGTTTGCAG 
AAAAGGGTTA 
TGACCCAGAT 
CAGATACATG 
CAGAGTTGAT 
AATACAGATA 
GCACTACAAC 
GTTGCTAGAG 
CCTTTTCACA 
CAGAGCAAAC 
AAATATCTAT 
AAAGGAAGCT 
TCCGGATGGT 



CEKRVQDVIE 
QYESLSLFHP 
TIQEDPSTDS 
ETEC 



41 

I 

AGGAAAAGAG 
TTTCCGAAAC 
AACATGAGGC 
CCTCCGGATG 
TCAGACTACG 
CTACAGACTT 
AAATTATACC 
GATGGTAAAA 
GTTGGTGAGC 
ATCAACACAG 
AAAGACAAAA 
TCTCCAAAAG 
GTAGACAGAG 
CCCATGAAGA 
GGCAGAGGGG 
GACATCTATC 
GAGCAGATTC 
ATGGCAAAAA 
CAATTTAGCT 
TACCAAGATT 
AGCCATGGAG 
TTGAATAGTG 
GACCTGGTTA 
CTAGCAGAAT 



TGTGCCCCGA 


1440 


GATCACGAGA 


1500 


GAAAGTAAGG 


1560 


GTGGATGAAG 


1620 


AACAGCGACT 


1680 


AAAAACTGTC 


1740 


AAAGGCGTGT 


1800 


AATGACACTG 


1860 


GAGAGGGAAC 


1920 


TGCTGCAGGG 


1980 


TTTTTGAGGA 


2040 


AAACGAGTAC 


2100 


ACTTTTGGAA 


2160 


TTTTGGATTC 


2220 


GAATCTCTGT 


2280 


TCGGTTCGCA 


2340 


CCTGTGATCC 


2400 


CAGGAAGACC 


2460 




2520 


AGTGAAAAGG 


2580 


GAGTGCTAAT 


2640 


GACCTACAAA 


2700 


TGGTCATGTG 


2760 


TTTGAAAAGG 


2820 


AAAATATCTT 


2880 


TTTACGTTTT 


2940 




3000 


GAAAAGTGGA- 


3060 


GATAAATTTA 


3120 


TTTTATGTAG 


3180 


TTCATAATTC 


3240 


TGGTAGCCAG 


3300 


TTTTTCTGTA 


3360 


AGAAAATTCA 


3420 


GCTGAGGTTG 


3480 


51 




1 

QQHSVRKRDL 


60 


WQDFFTGHW 


120 


DIKKVSRliQS 


180 


ADHRFYRYMG 


240 


VKPGEKHYNM 


300 


AYVGSPRANS 


360 


FGAEHDPDGL 


420 


QERSNKVCGN 


480 


QKKCQEAINA 


540 


CACNETDNSC 


600 


RFWDFIDQLS 


660 


SNVEMLSSMD 


720 


HMDEDGFEKD 


780 



51 
I 

GATTGAGGGG 
CCCGTCAGGC 
AGTCTCTCCT 
ACCCGGGCTT 
ATATTCTCTC 
CAACACATGT 
TGACATCAAG 
ACGAAAGCGA 
CTGACTCTAG 
ATGGGGCCGA 
GAATGTTAGT 
TGTGTGGTTA 
AACCACCTGA 
ACACGTGTAA 
AAGAGAGTAC 
GGAACACTTC 
GCATTCTCAA 
GTTACCCAAA 
TTGATATAGC 
TTGATATGGG 
GTGTTTGTCC 
GTTTGACGAG 
CAACTCATGA 
GTGCCCCGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



396 



WO 02/086443 

TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1500 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 1860 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGOG AGAGGGAACA 1920 

GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 

TTTCAGCATT CTTGTCCATT GTGTGTAACG TOGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTO CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA OCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2520 

GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGATGCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2820 

AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGO TTATGAGTAT 2880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 

GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3000 

GAAAAGTGGA ATAGTTTTTT ' m T I TTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 3120 

TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCTCAGAACC T AACCACT AC CTTACAGTGA GGGCTATACA 3240 

TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 

TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 3360 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTC GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 557 Protein sequence 
Protein Accession #: NP_06B604.1 

1 11 21 31 41 51 

I I I I I I 

MRQSLLFLTS WPFVWVPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKVWVD GKNESEYTVK WQDFPTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180, 

PKVCGYLKVD NEELLPKGLV D REP PEEL VH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 300 

AKSYPNEEKD AWDVKMLLEQ PSPDIABEAS KVCIAHLFTY QDFDMGTLGL AYVGSPRANS 360 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 420 

AECAPNEDQG GKYVMYPIAV SGDHENNKMF SNCSKQSIYK TIESKAQECP QERSNKVCGK 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AEDD7VCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC ■ 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 
INTFGKPLAD NIVGSVLVFS LIFWIPFSIL VHCV 

Seq ID NO: 558 DNA sequence 

Nucleic Acid Accession &: NM_004994.1 

Coding sequence: 20.. 2143 

1 11 21 31 41 51 

I I I I I I 

AGACACCTCT GCCCTCACCA TGAGCCTCTO GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 300 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACTCGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGA3C CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 
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20 
25 
30 
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WO 02/086443 

CCXTATCGCC GACAAGTGGC COGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 
GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 
GGTGCTGGGC CCGAGOCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 
GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 
CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 
CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 
GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 
GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seq ID NO: 559 Protein sequence 
Protein Accession ft* np 004985.1 



PCT/US02/12476 



i 
I 

MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSB 
PDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSD3 
CTSEGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 



LDKLGLGADV 
THDVFQYREK 



11 
I 

LLVLGCCFAA 
LLLLQKQIiSL 
DLPRAVIDDA 
APPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYPCQDRFYW 



21 

I 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHPDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSBLNQ 



31 
I 

FPGDliRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 



41 



PXWRFDVKAQ 
VDQVGYVTYD 



I 

DRQLAEEYLY 
VPDLGRFQTP 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPCQP 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIAEIGNQ 
FFSGRQVWVY 
MVDPRSASBV 
ILQCPED 



Seq ID NO i 560 DNA sequence 

Nucleic Acid Accession #: NMJJ00213.1 

Coding sequence: 127.. 5385 



1 
I 

CGCCCGCGCG 
CCCCGAGGTA 
AAGAGGATGG 
AGCGTCAGCC 
ACGGAGTGTG 
CGGCGCTGCA 
GTCATGGAGA 
AGCCAGATGT 
GAGCTGGAGG 
TCCAACTCCA 
GTCCTGAGCC 
AGCGTCCCGC 
CCCCCCTTCT 
AAACTGCAGG 
ATCCTGCAGA 
CTGGTCTTCT 
GGCATCATGA 
TACAGGACAC 
ATCATCCCCA 
TATTTCCCTG 
CTGGAGGAGG 
CGAGGCCTTC 
CACATCCGGC 
GATGGGACGC 
TCCTTCTCCG 
CTGCAAAAAG 
TGTGTGTGCA 
. GACATTCAGC 
CAGTGCGGGC 
GACAACTTCC 
ATGGGCCAGT 
AATGCCACCT 
GGCCGCTGCC 
TCGGCGATCC 
GGCACCGGCG 
GACGAGCTTA 
GACTGCACCT 
C7GGTGCACA 
CTCCTCCTCC 
TGCAAGGCCT 
GAAGACCACT 
CTGOGCAGCG 
CAGCGGCCTG 
GGGCTGTCCT 
GAGTGCGCCC 
TCCGGTGTAC 
CAAGACCACA 
CTGAAGCTTA 
GGCTACTACA 
GTGGAGCTGG 
CAGCTGCTGG 



11 
I 

CTGCAGCCCC 
GGTCCAGGAC 
CAGGGCCACG 
TCTCTGGGAC 
TCCGTGTGGA 
ACACCCAGGC 
GCAGCTTCCA 
CCCCCCAAGG 
TGTTTGAGCC 
TGTCOGATGA 
AGCTCACCAG 
AGACGGACAT 
CCTTCAAGAA 
GAGAGCGGAT 
CAGCTGTGTG 
CCACCGAGTC 
GCCGCAACGA 
AGGACTACCC 
TCTTTGCTGT 
TCTCCTCACT 
CCTTCAATCG 
GGACAGAGGT 
GGGGGGAAGT 
ACGTGTGCCA 
ACGGCCTCAA 
AGGTGCGGTC 
GOGAGGGCTG 
CCTGCCTGOG 
ACTGTGTGTG 
AGTGTCCCOG 
GTGTGTGTGA 
GCATCGACAG 
ACTGCCACCA 
ACCCGGGCCT 
AGAAGAAGGG- 
AGAGAGCCGA 
ACAGCTACAC 
AGAAGAAGGA 
TGCCGCTCCT 
GCCTGGCACT 
ACATGCTGCG 
GGAACCTCAA 
GCTTTGCCAC 
TGCGCCTGGC 
AGCTGCGCCA 
ACAAGCTCCA 
CCATTGTGGA 
CAGAGAAGCA 
CCCTCACTGC 
TGGAOGTACG 
TGGAGGCCAT 



21 
I 

ATCTCCTAGC 
GGGCGCACAG 
CCCCAGCCCA 
CTTGGCAAAC 
TAAGGACTGC 
GGAGCTGCTG 
AATCACAGAG 
CCTGCGGGTC 
ACTGGAGAGC 
TCTGGACAAC 
CGACTACACT 
GAGGCCTGAG 
CGTCATCAGC 
CTCAGGCAAC 
CACGAGGGAC 
AGCCTTCCAC 
TGAAOGGTGC 
GTCGGTGCCC 
CACCAACTAC 
GGGGGTGCTG 
GATCOGCTCC 
CACCTOCAAG 
GGGTATATAC 
GCTQCCGGAG 
GATGGACGCG 
AGCTCGCTGC 
GAGTGGCCAG 
GGAGGGCGAG 
CTACGGCGAA 
CACTTCCGGG 
GCCTGGTTGG 
CAATGGGGGC 
GCAGTCGCTC 
CTGCGAGGAC 
GCGCACGTGT 
GGAGGTGGTG 
CATGGAAGGT 
CTGCCCTCCG 
GGCCCTGCTA 
TCTCCCGTGC 
GGAGAACCTG 
GGGCCGTGAC 
TCATGCCGCC 
CCGCCTTTGC 
GGAGGTGGAG 
GCAGACCAAG 
CACAGTGCTG 
GGTGGAACAG 
AGACCAGGAC 
GGTGCCCCTC 
CGACGTGCCC 



31 

I 

GGCAGCCCAG 
CAGCAGCCGA 
TGGGCCAGGC 
CGCTGCAAGA 
GCCTACTGCA 
GCCGCGGGCT 
GAGACCCAGA 
CGTCTGCGGC 
CCCGTGGACC 
CTCAAGAAGA 
ATTGGATTTG 
AAGCTGAAGG 
CTGACAGAAG 
CTGGATGCTC 
ATTGGCTGGC 
TATGAGGCTG 
CACCTGGACA 
ACCCTGGTGC 
TCCTATAGCT 
CAGGAGGACT 
AACCTGGACA 
ATGTTCCAGA 
CAGGTGCAGC 
GACCAGAAGG 
GGCATCATCT 
AGCTTCAACG 
ACCTGCAACT 
GACAAGCCGT 
GGCCGCTACG 
TTCCTCTGCA 
ACAGGCCCAA 
ATCTGTAATG 
TACACGGACA 
CTACGCTCCT 
GAGGAATGCA 
GTGCGCTGCT 
GACGGCGCCC 
GGCTCCTTCT 
CTGCTGCTAT 
TGCAACCGAG 
ATGGCCTCTG 
GTGGTCCGCT 
AGCATCAACC 
ACCGAGAACC 
GAGAACCTGA 
TTCCGGCAGC 
ATGGCGCCCC 
AGGGCCTTCC 
GCCCGGGGCA 
TTTATCCGGC 
GCAGGCACTG 



GCGCGGAGGG 
GGCTGGCCGG 

AGGCCCCAGT 
CAGACGAGAT 
GCCAGCGGGA 
TTGACACCAC 
CCGGTGAGGA 
TGTACATCCT 
TGGGGCAGAA 
GCAAGTTTGT 
AGCCCTGGCC 
ATGTGGATGA 
CTGAGGGCGG 
GCCCGGACAG 
ATGGCGCCAA 
CCACGGGCAC 
GCCTGCTCGC 
ACTACGAGAA 
CGTCCAACAT 
TCCGGGCCCT 
AGACGAGGAC 
TGCGGGCCCT 
GCAACATCCA 
GTGATGTGTG 
GAGACTTCGT 
GCTCCACCGG 
GCTGCGGCCG 
AGGGTCAGTT 
ATGACCGAGG 
GCTGTGACTG 
GACGTGGCCA 
CCATCTGCGA 
GCGTGCAGTG 
ACTTCAAGGT 
CCTTCCGGGA 
CTGGGCCCAA 
GGTGGCTCAT 
GCTGGAAGTA 
GTCACATGGT 
ACCACTTGGA 
GGAAGGTCAC 
CCACAGAGCT 
TGCTGAAGCC 
ACGAGGTCTA 
AGCCCAATGC 
GCTCGGCCAA 
ACQACCTCAA 
TGGTGGAGTT 
CTGAGGATGA 
CCACCCTCGG 



51 
I 

AGCGAGTCCG 
GAGAGGGAGG 
AGCCTTGATC 
GAAGAGCTGC 
GTTCAGGGAC 
GAGCATCGTG 
CCTGCGGCGC 
GCGGCATTTT 
CATGGACTTC 
CCTGGCTCGG 
GGACAAAGTC 
CAACAGTGAC 
GTTCCGGAAT 
CTTCGATGCC 
CACCCACCTG. 
CGTGCTGGCT 
CTACACCCAG 
CAAGCACAAC 
GCTTCACACC 
CGTGGAGCTG 
AGACAGCCCC 
TGGGTCCTTT 
TGAGCACGTG 
TCTGAAACCT 
CACCTGCGAG 
GTGCGGACAG 
CTCTCTGAGT 
TGGGGAGTGC 
CTGCGAGTAT 
ACGCTGCTCC 
TCCCCTCAGC 
CTGTGAGTGT 
GATCAACTAC 
CCAGGCGTGG 
CAAGATGGTG 
CGAGGATGAC 
CAGCACTGTC 
CCCCCTGCTC 
CTGTGCCTGC 
GGGCTTTAAG 
CACGCCCATG 
CAACAACATG 
GGTGCCCTAC 
TGACACTCGG 
CAGGCAGATC 
CGGGAAAAAG 
GCCGGCCCTG 
GGTGGCCCCC 
CCAGGAGGGC 
CGACGAGAAG 
CCGCGGCCTG 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



51 
I 

RYGYTRVAEM 

EGDLKWHHHN 

GVAEHGDGYP 

HFPFIPEGRS 

PFIPQGQSYS • 

FTFLGKEYST 

HSSVPEALMY 

GPPTVHPSER 

LYIiPKDGKYW 

TGASVLGPRR 

DRMPPGVPLD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



398 



10 
15 
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WO 02/086443 

GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC OGCATCCCTG TCATCCGGCG TGTCCTGGAC 
GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 
TCACAGCCAC CCCCTCACGG OGACCTGGGC GCCCOGCAGA ACCCCAATGC TAAGGCCGCT 
GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 
CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 
TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 
GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 
CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 
CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 
AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGOGATG ACGTTCTACG CTCTCCATCG 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 
TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 
ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 
GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 
CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 
GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 
GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 
TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 
GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 
GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 
GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 
AGOGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 
GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 
CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 
TCCTGGGAGG CATGAAGGGG GCAAGGTCOG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 
AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 
TACTG 

Seq ID NO: 561 Protein sequence 
Protein Accession #: NP_000204.l 



PCT/US02/12476 



i 
I 

MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 
CSEGWSGQTC 
FQCPRTSGFL 
CHCHQQSLYT 
LKRAEEVWR 
LLPLLALLLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RKIHFNWLPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
FPGSTNSLHR 
STLTSVSSHD 
ELHRLNIPNP 
LPGSAFTLST 
RVDGDSPBSR 
LPQHPLQSEY 
LSTHMDQQPF 



11 
I 

RLLLAALISV 
GCQRES1WM 
DLYILMDFSN 
KEPWPKSDPP 
WRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIHLKPSF 
NCSTGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQBVP 
KKVLVDNPKN 
DIPIVDAQSG 
MTTTSAAAYG 
SRLTAGVPDT 
AQTSWVEDL 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTHTSA 
QT 



21 

[ 

SLSGTLANRC 
ESSFQITEET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
SDGLKKDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IKPGLCEDLR 
TYSYTMEGDG 
AdtAIiLPCCN 
PGFATHAASI 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVBGELLFQP 
IIRDPDELDR 
YWZQGDSESE 
SEPGRLAFNV 
RMbLIENLRE 
EDYDSFLMYS 
THLSPHVPHR 
PTRLVFSALG 
LPNHSYVFRV 
ALSPDSLQLS 
PYKFKVQART 
TEPFLVDGPT 



31 

I* 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
EDVDEFRNKL 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPKSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
VIiSTSSTLTR 
PTSLRVSWQE 
RAQSQEGWGR 
WERPRRPNGD 
" TEGFGPEREG 
LGAQHLEAGG 



41 
I 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 
MSRNDERCHL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYMLRENLMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTE 
DYNSLTRSEH 
PRCERPLQGY 
EREGVITIES 
IVGYLVTCEM 
IITIESQDGG 
8LTRHVTQEF 



51 



FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 



CNFKVKMVDE 
FWWLIPLLLL 
SDHLDTPMIjR 
NLLKPDTRBC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGIiV 
EAIINLATQP 
HLVNGRMDFA 
SHSTTLPRDY 
SVEYQLLNGG 
QVHPQSPLCP 
AQGGGPATAF 
PFPQLGSRAG 
VSRTLTTSGT 



3120 
3160 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
55B0 
5640 



CTDEMFRDRR 



60 
120 
180' 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 562 DNA sequence 

Nucleic Acid Accession #: NM__013332.1 

Coding sequence : 1 . . 63 
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WO 02/086443 

GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC 
AGTAACCGAC TTTCCTCOGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC 
CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT 
GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC 
GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG 
GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA 
CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC 
ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA 
TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC 
CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGGGGTG GCTCATGCCT 
CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA 
CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG 
GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA 
GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG 
CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG 
TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA 
TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT 
TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC 
TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCAGAAGA GCTCCTCCAT 
CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT 
TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC 
GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG 
TTTTTT CTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA 

Seq 10 NO: 563 Protein sequence 
Protein Accession ft: NP_037464.1 



PCT7US02/12476 



GCTGGTGCTT 
GATCCACTCC 
CTGTTTCTCT 
TGTTAGGTGT 
GCTTACTAGA 
CAGAGCCCAC 
CATACTGGCC 
GCAGACAAGC 
TAACTGGATG 
GTAATCCTAG 
CCAGCCTCGC 
TGGTGGCAGA 
CCCGGGGACG 
TGCAAGACTC 
AGTGGATTGC 
TGCTGTGCTT 
TGGGAGATGG 
TTGAATGTCT 
GTTTGGATAG 
CAGTGCCCAT 
TGAAGGGGGT 
GCTTAAATGA 
AA 



11 



21 



31 



41 



51 

! I I 

MKHVLNLYLL GWLTLLSIF VRVMESLEGL LESPSPGTSW TTRSQLANTE PTKGLPDHPS 
RSM 

Seq ID NO i 564 DNA sequence 

Nucleic Acid Accession ft: NM_023 915.1 

Coding sequence: 250.. 1326 



GGCACGAGGG 
TCAAAGCTTA 
GTGAATGGAC 
COCACGCCTC 
AACTGAAGAA 
CAAGAGAGTC 
AATGAATTTG 
TTGCTGAATG 
TTCTATCTCA 
ATAGTCCATG 
TCAGTTTTGT 
GATCGCTATC 
ACGAAGGTTT 
.ATCCTGACAA 
CCTTTGGGGG 
GTGCTGGTGA 
AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGA7CAC 
GTGTAGGCCT 
TTCATTATCC 



11 

I 

TTTCGTTTTC 
TTCTTAATTA 
AGCCAGCCAC 
AATCGTCCCC 
TGGGGTTCAA 
ACAATTCAGG 
ACACAATTGT 
GTTTAGCAGT 
AAAACATAGT 
ATGCAGGATT 
TTTATGCAAA 
TGAAGGTGGT 
TATCTGTTTG 
ATGGTCAGCC 
TCAAATGGCA 
TTCTGATCGG 
TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



21 
I 

ATGCTTTACC 
GAGACAAGAA 
CACAATGAAA 
AAGTGTTTCC 
CTTGACGCTT 
CAACAGGAGC 
CTTGCCGGTG 
GTGGATCTTC 
GGTTGCAGAC 
TGGAOCTTGG 
CATGTATACT 
CAAGCCATTT 
TGTTTGGGTG 
AACAGAGGAC 
TACGGCAGTC 
ATGTTACATA 
AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATOG 
GTTGGAATCG 
AA 



31 
I 

AGAAAATCCA 
ACCTGTTTCA 
GAAATCAAAC 
TGACACGCAT 
GCAAAATTAC 
GACGGGCCAG 
CTTTATCTCA 
TTCCACATTA 
CTCATAATGA 
TACTTCAAGT 
TCCATCGTGT 
GGGGACTCTC 
ATCATGGCTG 
AATATCCATG 
ACCTATGTGA 
GCCATATCCA 
CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



41 
I 

CTTCCCTGCC 
ACTTGAAGAC 
CAGGAATAAC 
CTTTGCTTAC 
CAAATAACGA 
GAAAGAACAC 
TTATATTTGT 
GGAATAAAAC 
CGCTGACATT 
TTATTCTCTG 
TCCTTGGGCT 
GGATGTACAG 
TTTTGTCTTT 
ACTGCTCAAA 
ACAGCTGCTT 
GGTACATCCA 
ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



Seq ID NO: 565 Protein sequence 
Protein Accession ft: NP_076404 



1 11 

I 1 
MGPNLTLAKL PNNELHGQES 
GLAVWIFFHI RNKTSFIFYL 
FYANMYTSIV FLGLISIDRY 
NGQPTEDNIH DCSKLKSPLG 
ISQSSRKRKH NQSIRVWAV 
PLSACNVCLD PIIYFFMCRS 



51 



Seq ID NO: 566 DNA sequence 

Nucleic Acid Accession ft: NM_005365.l 

Coding sequence : 1 . . 94 8 



1 11 

I I 
ATGTCTCTCG AGCAGAGGAG 
GAGGACTTGG GCCTGATGGG 
TCCTCTGACA GCAAGGAGGA 
CCTCAGGGAG GCGCTTCCTC 
GAGGGCTCCA GCAGTCAAGA 
GAGTTCATGT TCCAAGAAGC 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 



31 

I 

AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 



41 

I 

AAGACCTTGA 
AGGAGGAGGA 
CATCAAGTCC 
CTTTATGGAG 
CGGTCGACCC 
AGTTGGTTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
9S0 
1020 
1080 
1140 
1200 
1260 
1320 



51 
I 

GACCTTAGTT 
ACCGTATGAG 
CTATGCTGAA 
AGTGCATCAC 
GCTGCACGGC 
CACCCTTCAC 
GGCAAGCATC 
CAGCTTCATA 
TCCATTTCGA 
CAGATACACT 
GATAAGCATT 
CATAACCTTC 
GCCAAACATC 
ACTTAAAAGT 
GTTTGTGGCC 
CAAATCCAGC 
CAGGGTTGTT 
TTTTACTTTT 
CTGCAAAGAA 
CTTTTTCATG 
GAGTGAAAGC 
TTACACTGAT 
AATGTTTCTT 



21 31 41 

I I I i 

BNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIPVASILLN 
KNIWADLIM TLTFPFRIVH DAGFGPWYFK PILCRYTSVL 
LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 
VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 
PFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 
PSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 



60 
120 
180 
240 
300 



51 
I 

AGCCCAAGGA 
GACTACCTCC 
TCCCCAGAGT 
CCAATTCGAT 
AGCTCAGCTG 
TTTCCTGCTC 



60 
120 
180 
240 
300 
360 



400 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CACAAATATC GAGTCAAGGA GCCGGTCACA AAGGCAGAAA 
AATTACAAGC GCTACTTTCC TGTGATCTTC GGCAAAGCCT 
TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT 
CTTGGCCTCT CGTGCGATAG CATGCTGGGT OATGOTCATA 
CTGATCATTG TCCTGOOTOT GATCCTAACC AAAGACAACT 
TGGGAAGCGT TGAGTOTGAT GGGGGTGTAT GTTGGGAAGG 
CCCA6GAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT 
CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT 
AGCTATGAGA AGGTCATAAA TTATTTGGTC ATGCTCAATG 
CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG 

Seg ID NO: 567 Protein sequence 
Protein Accession It: NP 00S3S6.1 



PCTAJS02/12476 



TGCTGGAGAG 
CCGAGTTCAT 
CCTACATCCT 
GCATGCCCAA 
GCGCCCCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 



CGTCATCAAA 
GCAGGTGATC 
TGTCACTGCT 
GGCCGCCCTC 
AGAGGTTATC 
CTACGGGGAG 
COGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



MSLEQRSPHC 
PQGGASSSIS 
HKYRVKEPVT 
LGLSCDSMLG 
PRKLLTQDWV 
PSLYEEVLGE 



11 

I 

KPDEDLEAQG 
VYYTLWSQPD 
KAEMLESVIK 
DGHSMPKAAL 
QENYLEYRQV 
EQEGV 



21 



51 



31 41 

I I I ! 

EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 
EGSSSQEEEE PSSSVDPAQL EFMFQEALKL KVAELVHPLL 
NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 
LIIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMFYGE 
PGSDPAHYEF LWGSKAHAET SYEKVDJYLV MLNAREPICY 



Seq ID NOr 568 DNA sequence 
Nucleic Acid Accession 8: NM_014400 
Coding sequence: 86.. 1126 



GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAAOGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTO 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



11 

I 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 



21 

1 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 

CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 



GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 

I 

CCCGAGCTGG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 

CTGTGAGCTC 
CATAAATGCC 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG. 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



51 

I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



Seq ID NO: 569 Protein sequence 
Protein Accession ft: NP_055215 



1 

i 

MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 



11 
I 

AMIWTAGWLL 
IHGQPSLAVX 
SAYPPNGVBC 
RGCVQDEPCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 



31 



41 

I 



51 



Seq ID NO: 570 DNA sequence 

Nucleic Acid Accession #: NM_005329.1 

Coding sequence: 1..1662 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 




GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGAOGTGCCG 
ATTGCCGCGT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 



41 
I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 



420 
460 
540 
600 
660 
720 
780 
840 
900 



60 
120 
1B0 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



LIiUoRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 
GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 
YSCVGItSREA CQGTSPPWS CYNASDHVYK GCFDGNVTLT 
RDGVTGPGPT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 
APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 
PHNKGCVAPT AGLAALLLAV AAGVLL 



60 
120 
180 
240 
300 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



401 



WO 02/086443 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC AT,CAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA COGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTO TCTCTGGGGC TATACTGTAT IS 60 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATOGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

Seq ID NO: 571 Protein sequence 
Protein Accession §: NP_005320.1 

1 11 21 31 41 51 

I I I I I 1 

MPVQLTTALR WGTSLFALA VLGGILAAYV TGYQPIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAPLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDLKWM 120 

WDGNRQEDA YMLDIPHEVL GGTEGAGFFV WRSNPHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IOVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQILNK YDSWISFLSS VRYWMAPNVE RACQSYFGCV QCISGPLGMY RKSXtliQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LKQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFLLTVQLVG 420 

I IKATYACFL RGNAEMIPMS LYSLLYMSSL LPAKIFAIAT XNKSGWGTSG RKTIWNFIG 480 

IiIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAIIARROG 540 
KKPEQYSLAF AEV 

Seq ID NO: 572 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence; 148-7095 

1 11 21 31 41 51 

! I I I I I 

CACACATACG CACGCACGAT CTCACTTOGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGOGCT ATTGTGAATC CT6GTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

rCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GG AAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 



402 



WO 02/086443 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4B60 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 573 Protein* sequence : 
Protein Accession #: Eos sequence 

5 1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KPQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQ I YCPDA DRFSSFEEAV KGKGKLRALS 180 

10 ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDXYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLKDY LQNNPREQQY KFSRQVFSSY 300 

TGXEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

15 RSPTRGSEPS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS OTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATfiAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

20 LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SUSGPSHIP IPKSSIilTPT ASLLQPTHAL SGDGEWSGAS SDSEFUiPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

25 LNASI/QETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MIiHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG KGHVAITAVS 1380 

30 PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAH AVLTSDEESG SGQGTSDSLK 1560 

ENETSTDFSP ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSEHSEVFH VSEAEASNSS 1620 

KESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

35 VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FBTLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV BKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

40 QYVFIHDTLV EAILSKETEV LDSHIHAYVK AltLIPGPAGK TKLEKQFQLiL SQSNIQQSDY 2040 

SAALXQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EPVYWPNKDE PINCESFKVT U4AEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVPADIE QYQFLYKVIL 2280 
SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 
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Seq ID NO: 574 DNA sequence 

Nucleic Acid Accession St Eos sequence 

Coding sequence: 148-4518 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

OGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT OGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG OGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGOCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTOGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
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GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AAOGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTOGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACOCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACGAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCOGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3 GOO 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TOCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTQACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA S220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 575 Protein sequence: 
Protein Accession ft : Eos sequence 

1 11 21 31 41 51 

I " I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 160 

ILFEVGTEEN LDFKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY IiQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MMLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SEN PET IT YD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDBSEKTT KSFSAGPVMS GGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWLVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VBVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN PTLRNTKIKK 1020 

GSQKGRPSGR VVTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRKAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTB EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIKGYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

MIPDGONMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCL SNEBKLIIQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 
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Seq ID NOt 576 DNA sequence 
Nucleic Acid Accession #: EOS sequence 
^ Coding sequence! 148-4494 

1 11 21 31 41 51 

I 1 I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTOG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

10 CGGCGAGOOO CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

15 AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACOGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720, 

20 GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

25 TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

„ GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

30 TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGGGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

35 AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

A - AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

40 GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AC AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

45 TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AAOGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

50 CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

_ GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2B20 

55 ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 

GCTGCCCAAG GOCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

60 GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

T GGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 

_ AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGGAGC AGATTCAACA CGAAGGAACT 3420 

65 . GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 

nri GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

70 CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 

AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 

_ _ GATGAGCCTA TAAATTGTGA GAGCTTTAAG* GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 

75 CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 

ATGATTGTTC ATGATGAGCA TQGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

_ n CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

80 AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

_ _ ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

85 GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATT AT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 



406 
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WO 02/086443 

AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 
AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 
QTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTQTAATTG 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 
CTTTGTTTCA GCATOTAATT TTAACTTTTG TGGAAAATAG 
AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 
TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 



Seq ID NO: 577 Protein sequence: 
Protein Accession #: EOS sequence 



PCT/US02/12476 



CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



AGCAATTATC 
TCTTCCATAT 
GTTACTTATT 
AQATTTTTAT 
TGACGTAGTT 
AAGTCATTAA 
ATTTTGAAAG 
TCCAAGGAAT 
AAAAAAAAAA 



MRILKRPLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIKBAV 
HBPLTDGYQD 
LIGTEEIIKE. 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAPTP 
LVIVSALTFI 
IKHFPKHVAD 
KLAQLAEKDG 
NLVEKGRRKC 
GRWTQYHYT 
SMLQQXQBEG 
YVNALLIPGP 
SSLSGEGTDY 
AEDEFVYWPN 
CPKWPNPDSP 
SVDVYQVAKM 
AESLESLV 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCmsSDGS 
LDFKAIIDGV 
DTVSISESQI* 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESIi 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTBE 
KLTDYINANY 
DQYWPADGSE 
QWPDMGVPEY 
TVNIPGFLKH 
AGKTKLEKQF 
INASYIMGYY 
KDEPINCESF 
ISKTFELISV 
INLMRPGVFA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEB 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
PEEVQSCTVD 
VDGYNRPKAY 
EYGNFLVTQK 
SLPVLTFVRK 
IRSQRNYLVQ 
QLLSQSNIQQ 
QSNEFIITQH 
KVTLMAEEHK 



31 



DIEQYQFLYK 



KLVEEIGWSY 
EMTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGIiYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
LGITADSSKH 
IAAQGPLKST 
SVQVLAYYTV 
AAYAKRHAVG 
TEEQYVFIHD 
SDYSAALKQC 
PLIiHTIKDFM 
CLSNEBKLII 
PMIVHDEHGG 
VILSLVSTRQ 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMI EKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLRDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIP 
PDNKHKNRYI 
AEDFWRMIWE 
RNFTLRNTKI 
PVWHCSAGV 
TLVEAIL5RE 
NREKNRTSSI 
RMIWDHNAQL 
QDFILEATQD 
VTAGTFCALT 
EENPSTSLDS 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
NIVAYDHSRV 
HNVEVIVMIT 
KKGSQKGRPS 
GRTGTYIVLD 
TEVLDSHIHA 
IPVERSRVGI 
WMIPDGQNM 
DYVLEVRHFQ 
TLMHQLEKEN 
NGAALPDGNI 



Seq ID NO: 578 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: SO 1-4514 



1 

I 

CACACATACG 
CAAAAAAAAC 
06GCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AATATCCAAC 
AAGTAAATGT 
ACACATTCAT 
TCAGCGGAGG 
AATGCAATAT 
AGATGCAAAT 
GAAAAGGGAA 
ATTTCAAAGC 
TAGATCCATT 
ATGGCTCATT 
CAGTTAGCAT 
CTGGTTATGT 
TCTCTAGACA 
GTTCAGAACC 
GGGAAAGACC 
AGTTGGATGG 
GTGCTATTCT 
GCACTAATGG 
ATCCTGAACT 
AAGAGGGAAA 
ACCAAATCAG 
CGAAATACAA 
AGGGTGATGT 
CAGAAAAAGA 
AAGGTACTTC 
ACTTGTCGGG 
GTTTATTGAC 
CAACTTCTGC 
AAAACCCAGA 
AAGATTCAAC 
TGTGGTTTCC 
GCTTTCTCCA 
CCTTTTCTGC 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
ATGTAATAGC 
GAATCTTAAG 
TCATAACACT 
AGTTTCAGAA 
GTCATCTGAT 
CTACTGCTTT 
GTTAAGAGCT 
GATTATTGAT 
CATACTGTTG 
GACATCTCCT 
CTCTGAAAGC 
CATGCTGATG 
GGTGTTTTCC 
AGAAAATGTT 
TCGAGTCGTT 
AGAGGACCAA 
CAATAATTTG 
CTTATATGGA 
TGATCTTTTC 
AGACATTGAA 
GAAAAAGGAA 
TGAAGCCAAG 
TCCCAATACA 
TATTTCCTTG 
AGCCTCTTTA 
GACTGCAGAA 
CAGTTTCAAG 
TATCCCATTC 
GACAATAACA 
TTCATCAGGT 
TAGCTCTACA 
GACTAATTAC 
AGGCCCAGTG 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCAAAACAAT 
AAACTTAAAT 
GGGAAAACAG 
ATGGTGTTTA 
GGATCAGAGC 
GATGCGGACC 
TTATCCATTT 
GGAGTCGAAA 
AACCTTCTGC 
CCCTGCACAG 
CAGTTGGCTG 
GACTACTTAC 
TCATACACTG 
CAGGCTGACC 
TATGATACCA 
ACCAAGCATG 
CTACCCAATA 
AAATACAGCG 
CCTGAATTAA 
GAAGGCGCTA 
CCCCAGATTT 
ACTAACCGAT 
TCTTTAAATT 
ACTTCTCAGA 
AATGATGGCT 
TCCTTAAATA 
CTTGATACTG 
ATCTCTGAGA 
TATGATGTCC 
TCAGAAGAAT 
GACATAACAG 
ACTGAGATAC 
ATGTCACAGG 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
CTCCTATCAA 
TTCAGGGTTO 
TGGAAATTAA 
AAGCAAGCAA 
ATAGTTTAGA 
GATTTTCAAG 
TGTTTGAGGT 
GTGTTAGTCG 
CAAACTCAAC 
ACACAGTTGA 
TTTTTTGTGA 
AAAACAATTT 
GAAAGGAAGA 
CAGAGAATTA 
TGATTGAGAA 
AATTTTTGAC 
TGAGTTATGT 
ACCAACTGAT 
TTGGAACTGA 
TTGTGAATCC 
CTACCACAAC 
CCCCAACAAQ 
CCACTTCCCA 
CTGTGACTGA 
CTAAAACTGT 
CAGTTTCTAT 
GAGCTGAAGA 
ACATATCCCA 
TTATACCAGA 
CACTAAAGGA 
CACAGCCCGA 
GTGTTGATGA 
GTCCCTCAGT 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAT 
TATTGATGAA 
GGATAAAACA 
TCTCACTAAT 
GATAACTTTT 
AGGACAAAAA 
TTTTGAGGAA 
TGGGACAGAA 
TTTTGGGAAG 
TGACAAGTAT 
CTGGATTGTT 
AGTTCTTACA 
TCGAGAGCAA 
GATTCATGAA 
TACCAGCCTT 
GTTTGCAGTT 
AGATGGCTAT 
TCTTCAGATA 
TGTCGACATG 
AGAAATAATC 
TGGTAGAGAC 
ACACTACAAT 
AGGAAGTGAA 
ACCAGTCACT 
ACTGCCACCT 
TCTTAGATCT 
AACAGAATAT 
TTCTTCAGGC 
AGGGTATATA 
ATCTGCTAGA 
TCCTTCTATG 
TGTTGGATCA 
ATCTGAGAAG 
TACAGATCTG 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TGGGGAAAGA 
GATCTTACAC 
TCATTGGAAA 
GACTACCGTG 
CACTGGGGAA 
TTTCCACTTG 
GCAGTCAAAG 
GAAAATTTGG 
CAGGCTGCTT 
TACATTTACA 
TTTAAAGATA 
ATGCAACAAT 
CAGTACAAGT 
GCAGTTTGTA 
CTTGTTACAT 
TTGTACCAGC 
CAAGACTTGG 
GTAGCCATAT 
CCTACTGATA 
AAGGAGGAGG 
AGTGCTACAA 
CGCATAGGGA 
TTCTCTGGAA 
AAATTAGCCA 
CACACTGTGG 
CCACATATGA 
GAGGAGGAGA 
TCCAGTCCCG 
TTTTCCTCOG 
AATGCTTCCG 
GAGGGAAATG 
GGCAGAGAGA 
ACAACCAAGT 
GAAATGCCAC 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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ATTATTCTAC CTTTGCCTAC TTCCCAACTG A6GTAACACC 
CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT 
AATCOGAGAA GAAGGCAGTT ATACCCCTTO TGATCGTGTC 
TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG 
ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG 
TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA 
TAAGAAACAC AAAAATAAAA AAGGGCTGCC AGAAAGGAAG 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT 
GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT 
GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG 
•TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT 
TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA 
CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT 
CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA 
CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA 
ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT 
ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC 
CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG 
AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA 
TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC 
ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA 
TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA 
ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA 
TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT 
CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC 
AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC 
GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC 
TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT 
TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA 
TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT 
AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA 
ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 579 Protein sequence: 
Protein Accession #: BOS sequence 



PCT/US02/12476 



TCATGCTTTT 
CTCGCAGACA 
TGGTCTAGCT 
AGCCCTGACT 
CTTCCAGACT 
TCCAACACCT 
AAAGCATGTT 
AGAGTTTTAC 
CAACCACCCA 
CAGGGTTAAG 
CAATTATGTT 
ATCCACAGCT 
GATAACAAAC 
GAGTGAGGAG 
TACTGTGAGG 
ACCCAGTGGA 
AGAGTACTCC 
AGTGGGGCCT 
GCTAGACAGT 
AAAACACATC 
TCATGATACA 
TCATGCCTAT 
ACAATTCCAG 
GCAATGCAAC 
TGGCATTTCA 
CTATTACCAG 
TTTCTGGAGG 
AAACATGGCA 
GAGCTTTAAG 
TATAATTCAG 
CTTTCAGTGT 
AAGTGTTATA 
TGGAGGAGTG 
AGAAAATTCC 
CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCTGAGCATT 
TGTTGATTTC 
TATCATTAAC 
AAATGATTGA 
TTTTAACAGA 
TTTTAGTGTC 
TTTTAATACA 
TGCAGTATTC 
TCCATGGACC 
TTTCTAGTTC 
CCAGTTTTCT 
TTAACTTTTG 
CCTTACCAAA 
TTGCCATTAA 



1 
I 

MVFKASKITF 
LSILFEVGTE 
PCTDTVDWIV 
SYTGKEEIHE 
TKHEFLTDGY 
PELXGTEEII 
TNRSPTRGSE 
NDGSKTVLRS 
ISENISQGYI 
DITAQPDVGS 
FPTEVTPHAF 
IPLVIVSALT 
IPIKHFPKHV 
IVAYDHSRVK 
NVEVIVMITN 
KGSQKGRPSG 
RTGTYIVLDS 
EVLDSHIHAY 
PVERSRVGIS 
VMIPDGQNMA 
YVLEVRHFQC 
LMHQLEKENS 
GAALPDGNIA 



11 
I 



ENLDFKAIID 
FKDTVSISES 
AVCSSEPENV 
QDLGAILNNL 



FSGKGDVPNT 
PHMNLSGTAE 
FSSENPETIT 
GRESFIjQTNY 
TPSSRQQDLV 
FICLWLVGI 
ADLHASSGFT 
LAQLAEKDGK 
LVEKGRRKCD 
RWTQYHYTQ 
MLQQIQHEGT 
VNALLIPGPA 
SLSGEGTDYI 
EDEFVYWPNtf 
PKWPNPDSPI 
VDVYQVAKMI 
ESLESXjV 



31 

I 

FPLEWQIYCF 
OAALDPFILL 
MQQSGYVMLM 
LVTWERPRW 
VAICTNGLYG 
SATNQIRKKE 
KLATEKDISL 
EEESLLTSFK 



21 
I 

GSEHSLEGQK 
GVESVSRFGK 
QLAVFCEVLT 
QADPENYTSL 
LPNMSYVLQI 
EGAIVNPGRD 
SLMSTSQPVT 
SLNTVSITBY 
YDVLIPESAR 
TBIRVDESEK 
STVNWYSQT 
LIYWRKCFQT 
EEPETLKEFY 
LTDYINANYV 
QYWPADGSEB 
WPDMGVPEYS 
VNIFGFLKHI 
GKTKLEKQPQ 
NASYIMGYYQ 
DEPINCESFK VTLMAEEHKC 
SKTFELISVI KEEAANRDGP 
NLMRPGVFAD IEQYQFLYKV 



41 

I 

DADRFSSFEE 
NliLPNSTDKY 
DYLQNNFREQ 
YDTMIEKFAV 
KYSDQIiIVDM 
PQISTTTHYN 
TSQTVTELPP 
LDTGAEDSSG 



TTKSFSAGPV 
TQPVYNEASN 
AHFYLEDSTS 
QEVQSCTVDL 
DGYNRPKAYI 
YGNFLVTQKS 
LPVLTFVRKA 
RSQRNYLVQT 
LLSQSNIQQS 



MSOGPSVTDL 
SSHESRIGLA 
PRVISTPPTP 
GITADSSNHP 
AAQGPIiKSTA 
VQVLAYYTVR 
AYAKRHAVGP 
EEQYVFIHDT 
DYSAALKQCN 
LLHTIKDFWR 
LSNEEKLIIQ 
MIVHDEHGGV 
ILSLVSTRQB 



ACCCCATCCT 
ACCCAACCGG 
GAGGGGTTGG 
TTTATCTGTC 
GCACACTTTT 
ATCTTTCCAA 
GCAGATTTAC 
CAGGAAGTGC 
GACAACAAGC 
CTAGCACAGC 
GATGGCTACA 
GAAGATTTCT 
CTOGTGGAGA 
TACGGGAACT 
AATTTTACTC 
CGTGTGGTCA 
CTGCCAGTGC 
GTTGTCGTCC 
ATGTTGCAGC 
CGTTCACAAA 
CTGGTTGAGG 
GTTAATGCAC 
CTCCTGAGCC 
AGGGAAAAGA 
TCCCTGAGTG 
AGCAATGAAT 
ATGATATGGG 
GAAGATGAAT 
GTCACTCTTA 
GACTTTATCT 
CCTAAATGGC 
AAAGAAGAAG 
ACGGCAGGAA 
GTGGATGTTT 
ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCACCTG 
AATGTGTGCC 
ATTTTACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACCTAAAGTA 
AAATTTATAT 
TCTGTAATTG 
GACATTGTAT 
TGGAAAATAG 
CATTGTTCAA 
AAAAAAAAAA 



51 
I 

AVKGKGKLRA 
YIYNGSLTSP 
QYKFSRQVFS 
LYQQLDGEDQ 
PTDNPELDLF 
RIGTKYNEAK 
HTVEGTSASL 
SSPATSAIPF 
EGNVWFPSST 
EMPHYSTPAY 
EGLESEKKAV 
IFPISDDVGA 
DNKHKNRYIN 
EDFWRMIWEB 
NFTLRNTKIK 
VWHCSAGVG 
LVEAILSKET 
REKNRTSSXI 
MIWDHNAQLV 
DFILEATQDD 
TAGTFCAtiTT 
ENPSTSLDSN 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2B20 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3640 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4360 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 



Seq ID NO: 580 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4632 



11 



21 



31 



51 



408 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGA6CCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360. 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA S40 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GA7TTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC . AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

rTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGOGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA .4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
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ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT 
ATACCTTCAT TTTQAAAGAA GTTTTTATGA GAATAACACC 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 581 Protein sequence: 
Protein Accession 8: EOS sequence 



PCT/US02/12476 



TCTAGTTCTG TGTAATTGTT 
AGTTTTCTGA CATTGTATTG 
AACTTTTGTG GAAAATAGAA 
TTACCAAACA TTGTTCAAAT 
GCCATTAAAA AAAAAAAAAA 



S220 
5280 
5340 
5400 
5460 



• 1 
I 

MRILKRFLAC 
QSPIKIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRG5EFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAPTP 
IiVIVSALTFI 
•IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRVAGTILLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 



LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSb 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVGTRQEENP 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
L.EMQIYCFDA 
AIiDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SKD5TSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSXjDSNGAA 



41 
I 

TGALNQKNVJG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



51 

I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFFE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDBP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



Seq ID NO: 582 DNA sequence 

Nucleic Acid Accession #i NM_002 851.1 

Coding sequence: 148.. 7092 



CACACATACG 
CAAAAAAAAC 



CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATGCAATA 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGATCCAT 
AATGGCTCAT 
ACAGTTAGCA 
TCTGGTTATG 
TTCTCTAGAC 
AGTTCAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGTCGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 
ACCCCTTTGT 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTATTGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTOGAGTCGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
GTGAGACACC 
TGCTTGACAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCGGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAA7 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TCTTCAACCT 
TCAGATCCTC 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTTCAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAACTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCOG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
TCCTACAGTA 
AACACTACCC 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
GTGAAGTCTT 
CTGCTGCTTC 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGTCAAA 
AGAAAATTTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGGAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TCCTCTAGTC 
AAGTAGTGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
. 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATQ ATQGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAQ TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTQATTTGCT ATTAGAGCCC 2760 

5 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA AOGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

10 CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

15 CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

20 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

25 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCOGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

30 GGGCATGTTG CCATTACAGC TGTTT CTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 

GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 

GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 

35 ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 

GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 

CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 

CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 

. CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 

40 ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 

TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 

GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 

TCCGAGAAGA AGGCAGTTAT ACOCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 

GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC S160 

45 TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 

TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 5280 

GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 

AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 

A AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT S460 

50 GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 5520 

AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 

AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5640 

GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 

CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 

55 AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 

CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 

ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 

TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 

ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCOG TTCACAAAGA 6060 

60 AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 

ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 

CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 

65 GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 6600 

_ n GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 

70 GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTOC TAAATGGCCA 6720 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTOTTATAAA AGAAGAAGCT 6780 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 

75 CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 

OA AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 

80 TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 

GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 

_ _ TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 

85 AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
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TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 

ATACCTTCAT TTTGAAAGAA QTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 583 Protein sequence 
Protein Accession #: NP_002842.1 

1 ll 21 31 41 51 

1 I 1*1 1 1 

MRILKRFLAC IQLLCVCRLD WANGYYkQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTPIHNTGK TVEINLTNDY RVSGGVSEMV 120 

PKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDPKAI IDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHBAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG KVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLiLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL. PFSSASFSSE LFRHLHTVSQ 840 

I LPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV KDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELOIPSFN EMVYPSESTV MPNMYDNVNK 1080 

UJASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELPQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK 1440 

CMSCSSYRES QEKVMNDSDT HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSANGL SQKHNDGKEE NDIQTGSALL PLSPESXAWA VLTSDEESGS GQGT5DSLNE 1560 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSEVFHV SEAEASNSSH 1620 

ESRIGLAEGL ESEKKAVIPL VIVSALTPIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEEF ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPLKSTAEDF WRMIWEHNVE VIVMITNLVE RGRRKCDQYW PADGSEEYGN FLVTQKSVQV I860 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD MGVPEYSLPV LTFVRKAAYA 1920 

KRHAVGPVW HCSAGVGRTG TYIVLDSWLQ QIQHEGTVNI FGFLKHIRSQ RNYLVQTEEQ 1980 

YVFIHDTLVE AILSKETEVL DSHIHAYVNA LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYIKAS YIRGYYQSNE FIITQHPLLH 2100 

TIKDFWRMIW DHKAQLWMI PDGQNMAEDE FVYWPNKDEP DJCESFKVTL MAEEHKCLSN 2160 

EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISXT FELISVIKEB AANROGPMIV 2220 

HDEKGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSLDSNGAA LPDGNIAESL ESLV 

Seq ID NO: 584 DNA sequence 

Nucleic Acid Accession ft: NM_005688.1 

Coding sequence: 126.. 4439 ~ 

1 11 21 31 41 51 

I I I I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG C TTTT TTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA OGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600. 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTQ GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA OGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTC3ATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT G C TTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 
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ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTG6TTGG 
GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT 
TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC 
TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA 
ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG 
GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG 
TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT 
TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT 
. TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA 
GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA 
CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT 
AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATGCAGGC 
TCCTGGTTAT TATGGCCCTT TTCATOCTGA ATGTAGGCAG 
GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC 
CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT 
CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT 
TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC 
GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC 
TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC 
AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC 
CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC 
TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA 
CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA 
CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG 
CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT 
TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT 
GCATAGCTAO AGCCCTGCTC CGCCACTGTA AGATTCTGAT 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC 
GTTCCOGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA 
GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA 
TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA 
TA2TAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT 
TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG 
CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA 
CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC 
TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAOACATT 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA 
GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG 
ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT 
CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC 
CAGCTCTTGC TAATCAGTGT CTCACACTGG GGTAGAAGTT 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC 
GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC 
ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 565 Protein sequence 
Protein Accession NP 005679.1 
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MKDIDIGKEY 
LDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYRTGVRL 
PWAILGMIY 
VLTYIKFIKM 
EMTLGFDLTA 
NKPASPHIKI 
VLAEQKGHLL 
KTSLISAILG 
CCLRPDIAIL 
NHIFNSAIRX 
FNNXiLZiGETP 



I 

IIPSPGYRSV 
ILDEEHPKGK 
EDVWSLSKHE 
LAGFSGPAFM 
RGAILTMAFK 
NVIILGPTGF 
YAWVKAFSQS 
AQAFTWTVF 
EMKNATLAWD 



QMTLLEGSIA 
PSSDLTEIGE 
HLKSKTVLFV 
PVEINSKKET 



21 
I 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLE 
VKHLLEYTQA 
KILKLKNIKB 
LGSAVFILFY 
VQKIREBERR 
NSMTFALKVT 
SSHSSIQNSP 
EEEGKHXHLG 
ISGTFAWAQ 
RGANLSGGQR 
THQLQYLVDC 



31 
I 



IRTTSKHQHP 
RLWQEELNEV 
TESNLQYSLL 
KSLGELINIC 
PAMMFASRLT 
ILEKAGYFQG 
PFSVKSLSEA 
KLTPKMKKDK 
HLRLQRTLHS 
QAWILNATLR 
QRISLARALY 
DEVIFMKEGC 
GPKTGSVKXB 



AATCTGCGGC 
GACGCTTCTA 
CTGGATCCTC 
AAGATACAAC 
CAGCGACCTG 
GATCAGCCTT 
CAGTGCCTTA 
CAAGTCCAAG 
AGTGATCTTC 
TTTAAATGGT 
TGAGATCAAT 
TAAAACAGGA 
GCTGGAAGAG 
TGCTGGGGGC 
CACCGCCTTC 
TGTGACTCGA 
GTACTATGCC 
AGGAGTTGTC 
CCGAAGGATC 
CAACAGGTTT 
GTTCATCCAG 
GTGGTTCCTT 
CAGGGTCCTG 
CCACATCACG 
GTTTCTGCAC 
GTGTGOGATG 
CACGGGGCTG 
CATCTCTTAT 
GACAGAAGCT 
GGAAGCACCT 
GGTGACCTTT 
AGTATCCTTC 
GAAGTCCTCG 
GATTGATGGA 
CATTCCTCAA 
CCAGTACACT 
TGCTCAGCTA 
GGGGGAACGG 
TTTAGATGAA 
CCGAGAAGCA 
AGGCTCCGAT 
GGTCCTTCTG 
GGTCGCTGTC 
CATTCCCTGC 
ATTTTATCTT 
ATATTTTGAT 
ATTGCACTCT 
GCTTTATAOG 
AATGTAAGCT 
TTCTATCATT 
AAGAGTAGCA 
CCAAAGGAAG 
AGCCGCTCCA 
GTTCTCAGGG 
GGGGCGAAGC 
CCTCCGAGCC 
ATCAGTCTAT 
GCTCTCCAGC 
TGGCACTTTT 
TCGTGGGTCT 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTGAATGG 
TGCTGAACAC 
TAAAAAAAAA 



41 
I 

TRPLBCQDAL 
VDNAGLFSCM 
GPDAASLRRV 
LVLGliLLTEI 
SNDGQRMFEA 
AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSL 
RASRGKKEKV 
IDLEIQEGKL 
DNILFGKEYD 
SDRSIYILDD 
ITERGTHEEL 
KAVKPEEGQL 



AGTGTGGGAA 
GAGGGCAGCA 
AATGCTACTC 
TCTGTGCTGA 
ACGGAGATTG 
GCCCGGGCCT 
GATGCCCATG 
ACAGTTCTGT 
ATGAAAGAGG 
GACTATGCTA 
TCAAAAAAGG 
TCAGTAAAGA 
AAAGGGCAGG 
CCCTTGGCAT 
AGCACCTGGT 
GGGAACGAGA 
AGCATCTACG 
TTTGTCAAGG 
CTTCGAAGCC 
TCCAAAGACA 
AACGTTATCC 
GTGGCAGTGG 
ATTCGGGAGC 
TCCAGCATAC 
AGATACCAGG 
CGGTGGCTGG 
ATGATCGTTC 
GCTGTCCAGT 
CGATTCACCT 
GCCAGAATTA 
GAGAACGCAG 
ACGATCAAAC 
CTGGGGATGG 
GTGAGAATCA 
GAGCCGGTGC 
GAAGACCAGA 
CCTCTGAAAC 
CAGCTCTTGT 
GCCACAGCTG 
TTTGCAGACT 
AGGATTATGG 
TCCAACGACA 
AAGGGCTGAC 
CTGGGGCGGG 
TCGCACAGCA 
TATTGTATTT 
AAAAGGTTCA 
TGTAGCTATA 
GTTTATTTTA 
TTTGTACAGT 
TTTCATTCTT 
ACGTGTGGCA 
GGGGTGGCTG 
CTCCTGOCTT 
CCAGGCCCCT 
GGGGAGTTTC 
CCACAGAGAG 
TCCAAGACCT 
TCATTTGCCT 
GTTTTCCTTT 
CAACTTTAAG 
AAAGAGACCT 
TTTGTGCTGT 
TCAGCGTTGC 
CTTGTGGAAG 
AAAAAAAAAA 



51 
I 

ETAARAEGL5 
TFSWLSSLAR 
VWIFCRTRLI 
VRSWSLALTW 
AAVGSLLAGG 
TDERVQKMNE 
VIASWTFSV 
FLMEEVHMIK 
RQLQRTEHQA 
VGICGSVGSG 
EERYNSVUIS 
PLSALDAHVG 
MNLNGDYATI 
VQLEEKGQGS 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 . 

5280 ; 

5340 
5400 
54 60 
5520 
5580 
5640 
5700 
5760 
5820 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
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VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFPDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FPCVGMIAGV PPWPLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPP LSHITSSIQG IATIHAVNRG QEPLHRYQEL 1080 

5 LDDNQAPFFL PTCAMRWLAV RLDLISIALI TTTGtiMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNP SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
10 MLTIAHRLHT VLGSDRIMVL AQGQWEPDT PSVLLSNDSS RPYAMFAAAE KKVAVKG 



15 



40 
45 



Seq ID NO: 586 DMA sequence 

Nucleic Acid Accession ft: NM_001327.1 

Coding sequence : 8 9 . . 6 3 1 



1 11 21 31 41 51 

I I I I I I 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT OGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT COGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

20 GAOGGGOGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG OGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCOG CATGGCGGOG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCOGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

25 GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCOGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT OCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 

30 GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NO: 587 Protein sequence 
Protein Accession ft: NP_001318.1 

35 1 11 21 31 41 51 

till I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGKAGAT GGRGPRGAGA ARASGPGGGA 60 
PRGPHGGAAS GLNGCCRGGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF IAQPPSGQRR 



Seq ID NO: 568 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 52.. 459 



11 21 31 41 SI 

! I I I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 

CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

50 GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 

GACAGCCGCC TGCTTCAGTT COGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

, ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

55 TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTOGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

^ Seq ID NO: 5B9 Protein sequence 

60 Protein Accession #> Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 

65 PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 

Seq ID NO: 590 DNA sequence 
Nucleic Acid Accession ft: NM_005562.1 
70 Coding sequence: 90.. 3671 

1 11 21 31 41 51 

I I I I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

75 AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACAC7G ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

80 CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

85 GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 



414 
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10 
15 
20 
25 
30 
35 
40 
45 
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55 
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TTGTGGCTCC 
TTGACTACCG 
GTGCTGGTCT 
TCACCAAGAC 
TGAGTTACTT 
CATATGGAGA 
TCTCTGGAOC 
' AATTCTGCCA 
GCACCTGTAT 
GTTATTCAGG 
ACGATCCGCA 
CAGTGATGCC 
CCOGCTGTGA 
TGAGGCCTTG 
GTGACCGGCT 
ACCAGTGCAA 
GAGCTTGCAA 
GTGTTTGCAA 
CTTGCTATAA 
AGGCCCTGAT 
GCAGGATGCA 
AAGGTGCTAG 
ACCAGAGCCG 
AGTACCAGAA 
CAGAAAGTGA 
CAAATGGCTT 
CAGCCAGTAA 
CACTGGTGCG 
CTGTGGTGCA 
CAAGGGAGGC 
TCCTGGATTC 
CAAAGAGGAT 
AGTTCAAGCG 
AGAATGGAAA 
AAAGCAGAGC 
TCCTTAAAAA 
AAGCCATGAA 
AGCAAGCAGA 
CCGGGGAGGC 
AAGCCAATGT 
GTGAGATGAG 
TGGATGCAGT 
CTGGGGTTAC 
AGCCTCTCAG 
AGACCCAGAT 
AGCAGAGGGG 
AGAACTTGGA 
AGCAACAGTG 
GGCTCGGGAG 
TATGCTCAGG 
TGCACCATAC 
ATGATCAAGG 
ATGTTTGCCT 
ATAGTCAACT 
ATGAAATTCT 
ACTATTGCCT 
ACCCAGGGTG 
AGGACCTGTA 
GTTCTGGACC 
ATTTTTATTA 
GTTTCAAAGT 
ATTAGTCCTA 
ATCTTATTTT 
CACACTTCAG 
TTACCTCCAT 
GTGGGACAGT 
AGCATTTTTA 
GCAATAACCG 
CATGGGGGCA 
CATTCCAGCT 
TAACACCAGT 
TGGTGCTGCC 
CAATTGTTAG 



TGCCAAATTT 
TGTGGACAGA 
ACGGATCACA 
TTACACATTC 
TGAGTATCGA 
ATACAGTACT 
CCCAGCACCC 
GGATTGTGCT 
TCCTTGTAAC 
GGATGAGAAT 
CGACCCCCGC 
GGAGACGGAG 
GCTCTGTGCT 
TCAGCCCTGT 
GACAGGCAGG 
AGCAGGCTAC 
CTGTAACCCC 
GCCAGGATTT 
TCAAGTGAAG 
TTCAAAGGCT 
GCAGGCTGAG 
CAGATCCCTT 
CCTGGATGAC 
CCGAGTTCGG 
AGCTTCCTTG 
TAAAAGTCTG 
CATGGAGCAA 
CAAGGCCCTG 
AGGGCTTGTG 
CACTCAAGCG 
AGTGTCTCGG 
CAAACAAAAA 
TACACAAAAG 
AAGTGGGAGA 
ACAAGAAGCA 
CCTCAGAGAG 
GAGACTCTCC 
AAGAGCCCTG 
CCTGGAAATC 
GACAGCAGAT 
GGAAGTGGAA 
ACAGATGGTG 
AATCCAAGAC 
TGTAGATGAA 
CAACAGCCAA 
CCACCTCCAT 
GAACATTAGG 
AAGCTGCCAT 
CCATGTCATG 
TCAACTGACC 
TCCTTGCTTC 
ATCTGGACCC 
CATAATAGTC 
TATTCTTTGA 
TCCTAATGTC 
CATATTGTCC 
TGAACATGTT 
AGGCAGGCCC 
TGGGCATGAC 
AAGCATTTCC 
GATAGAAAAG 
ATTCAATCCT 
CTCAATCTCC 
CTGGGTCACA 
CCATCCTTCC 
GGTGACATAG 
AAAAATAAAT 
CTTGGTTTGC 
CTTGAGTTTT 
GTCACTCTGT 
GGGAATTGCT 
TTGCTTCTGT 
ATGCC 



CTTGGGAATC 
GGAGGCAGAC 
GCTCCCTTGA 
AGGTTAAATG 
AGGTTACTGC 
GGGTACATTG 
TGGGTTGAAC 
TCTGGCTACA 
TGTCAAGGGG 
CCTGACATTG 
AGCTGCAAGC 
GAGGTGGTGT 
GATGGCTACT 
CAATGCAACA 
TGTTTGAAGT 
TTCGGGGACC 
ATGGGCTCAG 
GGTGGCCCCA 
ATTCAGATGG 
CAGGGTGGTG 
CAGGCCCTTC 
GGTCTCCAGT 
CTCAAGATGA 
GATACTCACA 
GGAAACACTA 
GCTCAGGAGG 
CTGACAAGGG 
CATGAAGGAG 
GAAAAATTGG 
GAAATTGAAG 
CTTCAGGGAG 
GCGGATTCAC 
AATCTGGGAA 
GAGAAATCAG 
CTGAGTATGG 
TTTGACCTGC 
TACATCAGOC 
GGGAGCGCTG 
TCCAGTGAGA 
GGAGCCTTGG 
GGAGAGCTGG 
ATTACAGAAG 
ACACTCAACA 
GAGGGGCTGG 
CTGCGGCCCA 
TTGCTGGAGA 
GACAACCTGC 
AAATATTTCT 
TGAGTGGGTG 
TGACCCCATT 
CTGATGCTGG 
CAAAGAATAG 
GTAAGTGGAG 
GTAATGTGAC 
AGAACAGAGT 
TCTGCAAGCT 
CTCCATTTTC 
ATTCAGAGCT 
ATCCTTTCTT 
TACCAGCAAA 
TGTGGCTTGG 
ACTTT TCGA A 
TCTCTCTTTC 
TCCATCCCTC 
AACATATATT 

TTAAACTTAC 
AACCTCTTTG 
GGCAAGGCTG 
GCCTTTCTAC 
GGAGGAACCA 
ATTTCCTTGG 



AACAGGTGAG 
ACCCATCTGC 
TGCCACTTGG 
AGCATCCAAG 
GGAATCTCAC 
ACAATGTGAC 
AGTGTATATG 
AGAGAGATTC 
GAGGGGCCTG 
AGTGTGCTGA 
CATGTCCCTG 
GCAATAACTG 
TTGGGGACCC 
ACAATGTGGA 
GTATCCACAA 
CATTGGCTCC 
AGCCTGTAGG 
ACTGTGAGCA 
ATCAGTTTAT 
ATGGAGTAGT 
AGGACATTCT 
TGGCCAAGGT 
CTGTGGAAAG 
GGCTCATCAC 
ACATTCCTGC 
CCACAAGATT 
AAACTGAGGA 
TOGGAAGOGG 
AGAAAACCAA 
CAGATAGGTC 
TCAGTGATCA 
TCTCAAOGCT 
ACTGGAAAGA 
ATCAGCTGCT 
GCAATGCCAC 
AGGTGGACAA 
AGAAGGTTTC 
CTGCTGATGC 
TTGAACAGGA 
CCATGGAAAA 
AAAGGAAGGA 
CCCAGAAGGT 
CATTAGACGG 
TCTTACTGGA 
TGATGTCAGA 
CAAGCATAGA 
CCCCAGGCTG 
CAACTGAGGT 
GGATGGGGAC 
CCTGATCCCA 
GCAATGAGGC 
ACTGGATGGA 
TCCTGGAATT 
TAAAGGAAAA 
GCAACCCAGT 
TCTTGCTGAT 
AAGCTGGAAG 
ATGGTGCTTG 
TTAATGATGC 
GCAAATGTTG 
GCATTGAAAG 
CACCAAAAAT 
CTCCACCCAT 
CATTCATCCT 
TATTGAGTAC 
CATAGAGTTG 
AAACTTTGTT 
CTCAACAGAA 
ACAGAGCTCT 
AACTGATTGC 
GAGGCACTTC 
ATTTTCCTGA 



CTATGGGCAA 
CCATGATGTG 
CAAGACACTG 
CAATAATTGG 
AGCCCTCCGC 
CCTGATTTCA 
TCCTGTTGGG 
AGCGAGACTG 
TGATCCAGAC 
CTGCCCAATT 
TCATAACGGG 
CCCTCCCGGG 
CTTTGGTGAA 
CCCCAGTGCC 
CACAGCCGGC 
CAACCCAGCA 
ATGTCGAAGT 
TGGAGCATTC 
GCAGCAGCTT 
ACCTGATACA 
GAGAGATGCC 
GAGGAGCCAA 
AGTTCGGGCT 
TCAGATGCAG 
CTCAGACCAC 
AGCAGAAAGC 
CTATTCCAAA 
AAGCGGTAGC 
GTCCCTGGCC 
TTATCAGCAC 
GTCCTTTCAG 
GGTAACCAGG 
AGAAGCACAG 
TTCCCGTGCC 
TTTTTATGAA 
CAGAAAAGCA 
AGATGCCAGT 
ACAGAGGGCA 
GATTGGGAGT 
GGGACTGGCC 
GCTGGAGTTT 
TGATACCAGA 
CCTCCTGCAT 
GCAGAAGCTT 
GCTGGAAGAG 
TGGGATTCTG 
CTACAATACC 
TCTTGGGATA 
ATTTGAACAT 
TGGCCAGGTG 
AGATAGCACT 
AAGACAAACT 
TGGACAAGTG 
AACTTTGACT 
CACACTGTGG 
CAGAGTTCCT 
AAGTGAGCAG 
CTGGTGCCTG 
CATGGCAACT 
GGAAAGTATT 
AGGTAAAA7T 
GATGCGCATC 
AATAAGAGAA 
TCCATCCATC 
CTACTGTGTG 
ATTGTCTAGT 
TGTCACAAGT 
CATATGTTGC 
GGGTTGTGCA 
AACAGACTGT 
CACCTTGGCT 
AAGTGTTTTT 



AGCCTGTCCT 
ATTCTGGAAG 
CCTTGTGGGC 
AGCCCCCAGC 
ATCCGAGCTA 
GCCCGCCCTG 
TACAAGGGGC 
GGGCCTTTTG 
ACAGGAGATT 
GGTTTCTACA 
TTCAGCTGCT 
GTCACOGGTG 
CATGGCCCAG 
TCTGGGAATT 
ATCTACTGCG 
GACAAGTGTC 
GATGGCACCT 
AGCTGTCCAG 
CAGAGAATGG 
GAGCTGGAAG 
CAGATTTCAG 
GAGAACAGCT 
CTGGGAAGTC 
CTGAGCCTGG 
TACGTGGGGC 
CACGTTGAGT 
CAAGCCCTCT 
COGGACGGTG 
CAGCAGTTGA 
AGTCTCCGCC 
GTGGAAGAAG 
CATATGGATG 
CAGCTCTTAC 
AATCTTGCTA 
GTTGAGAGCA 
GAAGCTGAAG 
GACAAGACCC 
AAGAATGGGG 
CTGAACTTGG 
TCTCTGAAGA 
GACACGAATA 
GCCAAGAACG 
CTGATGGACC 
TCCCGAGCCA 
AGGGCACGTC 
GCTGATGTGA 
CAGGCTCTTG 
CAGATCTCAG 
GTTTAATGGG 
GTTGTCTTAT 
GGGTGTGAGA 
GCACAGGCAG 
CTGTTGGGAT 
TTGCCCAGGC 
CCAGTAAAAT 
CCTACTTACA 
TGTTGGAGTG 
CCACCTTCAA 
TAGAGATTGC 
TACTTTTTCG 
CTCTAGATTT 
AATGTATTTT 
TGTTCCTACT 
TTTCCATCCA 
CCAGGGGCTG 
GAGGAAGACA 
GGTGTTTATT 
AAGACCCTCC 
CATTTCTTTG 
TGAGTTATGA 
GGGAAGACTA 
AAATAAAGAA 



Seq ID NO: 591 Protein sequence 
Protein Accession #» NP_005553.l 



900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
46B0 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 



1 
I 

MPALWLGCCL 
DGIHCEKCKN 
HHLTDAGCTQ 
PEGCTQCFCY 
SSAQRLDPVY 
MPLGKTLPCG 
DNVTLISARP 
GGACDPDTGD 



11 
I 

CPSLLLPAAR 
GFYRHRERDR 
DQRLLDSKCD 
GHSASCRSSA 
FVAPAKFLGN 
LTKTVTPRLN 
VSGAPAPWVB 
CYSGDENPDI 



21 

I 

ATSRREVCDC 
CLPCNCNSKG 
CDPAGIAGPC 
EYSVHKITST 
QQVSYGQSLS 
EHPSNNWSPQ 
QCICPVGYKG 
ECADCPIGPY 



31 
I 

NGKSRQCIPD 
SLSAHCDNSG 
DAGRCVCKPA 
FHQDVDGWKA 
FDYRVDRGGR 
LSYPEYRRLL 
QFCQDCASGY 
NDPHDPRSCK 



41 
I 

RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
HPSAHDVILE 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGPSC 



51 
I 

FRCLNCNDNT 
ARCDRCLPGP 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 



60 
120 
180 
240 
300 
360 
420 
480 
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WO 02/086443 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN 
CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS 
NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG 
QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM 
RLITQMQLSL AESEASDGNT NIPASDHYVG PNGFKSLAQE 
ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL 
ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS 
NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM 
QVDNRKABAB EAMKRLSYIS QKVSDASDKT QQAERALGSA 
IEQEIGSLNL EANVTADGAL AMEKGLASLX SEMREVEGEL 
AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL 

Seq ID NO: 592 DNA sequence 
Nucleic Acid Accession 8; AP101051.1 
Coding sequence: 221.856 



PCT/US02/12476 



NNVDPSASGN 
EPVGCRSDGT 
DGWPDTELE 
TVERVRALGS 
ATRLAESHVE 
EKTKSLAQQL 
LSTLVTRHMD 
GNATFYEVES 
AADAQRAKNG 
ERKELEFDTN 
VLLEQKLSRA 
PPGCYNTQAL 



1 
I 

GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 
GCACTGGTGT 
AGCAAGGCAT 
CTGATCTTCC 
GTGGTTTTGT 
CGTTTTGGTG 
. GCCTTAACCA 
AAGATTCTGA 
ACAGATGTAA 
TTTTGAATCA 
TGTTAGCTGG 
CTACACAAGG 
TGCCTTCCAA 
ATACATAGAT 
ACAAAAAAAT 
TTTGATCTTT 
TTATAATGGG 
AAAAGGAAAA 



11 
I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 
CTGGAGACCT 
TTGGCTGCTG 
CACCTCACAG 
AATTTGAAAA 
TTGCTTTTCA 
GTCTCTCAAG 
GGAAGTCTTA 
TGGGAAGAAA 
TAATAACTCA 
CAGCTGACGC 
AAAGTCAGCC 
ACCTGAGAAT 
CTTCATQATG 
TTTATGGCCC 
TTATATTCTT 
AATTTGTATA 
AAAAAAAAAA 



21 
I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAAAAG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 
GGATTTGAGT 
TAAGCTTATT 
TGATGTTGTG 
GTGCTATACT 
AATGTTTGAA 
TGATGAGACA 
TCTTCTGCAG 
TAAAAGCCTA 
TAAGGTGCTA 
TGCTAGGATA 
ACCGTGTCTT 
ATATGCTTTT 
TGTGAGTGTA 
AAAATGACCA 
CTACCACACC 
AAGCATTACT 



31 

■I 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTCAATAT 
ACTCAAArGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 
CTTGGTGCTA 
GCTTCATCTG 
GGGATCCAGT 
AAGGGAAAGA 
AATAAAAAAA 
GTGAAGTAAA 
TGAGTATGGC 
CGTGTTGGTA 
TCTGTTCAGT 
GTTAGTTTGG 
ATGAGGAATT- 
GGAAGTTAAA 
ATTCCATGTG 
ACGAAATTGT 
TGGAAACAGA 
CTTTTTCAAT 



41 
I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
GAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 
TCAATCACCG 
TAAGCGGTGG 
GAGATAGAAT 
ATTGAGGAAT 
TGTTAAGAAA 
ATTGAGTGCA 
CCAATGCTTT 
AATCCAACAG 
GATGCCCTCA 
AAATGGTACT 
GGACCTAATA 
ATTTAAATGG 
GATATCAGTT 
TACAATAGAA 
CCAATAGACA 
AAATTGTTTT 



CDRLTGRCLK 
CVCKPGPGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 
TREATQAEIE 
EFKRTQKNLG 
ILKNLREFDL 
AGEALEISSB 
MDAVQMVITE 
KTQINSQLRP 
EQQ 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT. 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 
TCTGTGTTTG 
TTTGTAATTC 
ACATGTAAGT 
TAACTGCATA 
TGGGTTTCTT 
CTAAACGAAT 
CTGTGGCTAA 
CAAGGGAGAT 
GAGCTCTTGC 
TCATAATAAA 
AATTTTAGTG 
CTTTTGCCAC 
ACCAAACATT 
TTTATCCAAT 
TTTTGGGGTT 
TTAATTTAAA 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID NO; 593 Protein sequence 
Protein Accession #: AAD16433.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 
QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 
IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALPTGWA AASLCLLGGA 
LLCCSCPRKT TSYPTPRPYP KPAESSGKDY V 



60 
120 
180 
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Seq ID NO: 594 DNA sequence 

Nucleic Acid Accession #: NM_00G180.l 

Coding sequence i 352.. 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCX3GCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCC GG TGC AGOGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC OGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT OGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 10 B0 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

• ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGGGAAG GAGC CTTTG O AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCOGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GOGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGAGGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTGGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGAOGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTT T CTTC T T TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACOGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO) S95 Protein sequence 
Protein Accession #; NP_006171.1 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAP ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INKDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RIIWI SSDDS GKQISCVAES hVGBDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNBYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCUjVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLKPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAB LLINLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA BGNPPTELTQ SQMLHIAQQI . 660 

AAGMVYLASQ HFVHRDLATR NCLVGENIiLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 
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WO 02/086443 

PPESIMYRKP TTESDVWSLG WLWEIPTYQ KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO i S96 DNA sequence 
Nucleic Acid Accession #: AF410899 
Coding sequence: 483.. 2999 

1 11 21 31 41 51 

I I I I I I 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 180 

CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 

CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC COGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 

TGTCTGAACT GATCCTGGTG GGCAATCCAT- TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGAOGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 2280 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTOCTTT 3240 

TTCTTTTTTT AAATTTTCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA. 3660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTOG GCACCTTCCC 3780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO» 597 Protein sequence 
protein Accession #: AAL67965.1 

1 11 21 31 41 51 

! II ! I 1 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAPPRLEP 60 

NSVDPENITE IPIANQKRLE I XNEDDVEAY VGLRNLTIVD SGLKPVAHKA PLKNSNLQHI 120 

NPTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIKWIK TLQBAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 
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5 

10 
15 
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25 
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WO 02/086443 

NETSHTQGSL RITHISSDDS GKQISCVAEN LVGEDQDSVN 
WCIPFTVKGN PKPALQHFYN GAXLHBSKYI CTKIHVTNHT 
LIAKNEYGKD EKQISAHFMG WPGXDDGAHP NYPDVIYEDY 
DVTDK7GRBH LSVYAVWIA SWGPCLLVM LFLLKLARHS 
VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVI IG 
KPDTPVQHIK RBHtVLKREL GEGAFGKVPL A3CYNLCPEQ 
PHREAELLTN LQHEHIVKFY GVCVEGDPLI MVFEVMKHGD 
PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV 
DYYRVGGHTM LPIRWHPPES IMYRKFTTES DVWSLGWLW 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH 

Seq ID NO: 598 DNA sequence 
Nucleic Acid Accession S: AB053906 
Coding sequence: 74.. 814 



PC17US02/12476 



LTVHFAPTIT 
EYHGCLQLDN 
GTAANDIGDT 
KFGMKDPSWF 
MTKIPVIENP 
DKILVAVKTL 
LNKFLRAHGP 
GENLLVKIGD 
EIFTYGKQPW 
TLLQNLAKAS 



FLESPTSDHH 
PTHMNNGDYT 
TNRSNEIPST 
GFGKVKSRQG 
QYFGITNSQL 
KDASDNARKD 
DAVLMAEGNP 
FGMSRDVYST 
YQLSNNEVIE 
PVYLDILG 



1 
I 

AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTOCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 
I 

GGTGATTCAT 
TTAATGGCAG 
TCOSGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



ATTCTTTCOG 
ATGATTGTTT 
TCCCAAAAAA 



Seq ID NO: 599 Protein sequence 
Protein Accession #t BAB61048.1 

1 11 



GGCTCTCACC 
CCCAGTATCT 
GCCCCAAGGA 
AGTGGGTACA 
ACTACTACAG 
ATTACTTCTT 
ACACCTGTGC 
TCTACGAAGT 
AGGGATCTGT 
CCACCCCTGG 
GACAGACAGA 
CTTO CTT C T T 
AAACAGTAGC 



11 

I 

CTCCTCTCCT 
GAGTACCCTG 
GGAGGATAGG 
GCGTGCCCTT 
ACGTCCGCTG 
CGACGTAGAG 
CTTCCATGAA 
TCCCTGGGAG 
GCCAGGCCAT 
ACTGGTGGCC 
GAAGGCTGCA 
GCTTCTAATA 
ATCGCC 



21 
I 

GCAGCTCCAG 
CTGCTCCTGC 
ATAATCCCGG 
CACTTCGCCA 
CGGGTACTAA 
GTGGGCCGCA 
CAGCCAGAAC 
AACAGAAGGT 
TCGCACCAGC 
CCCACCCTGC 
GGAGTCCTTT 
GCCCTGGTAC 



31 
I 

CTTTGTGCTC 
TGGCCACCCT 
GTGGCATCTA 
TCAGCGAGTA 
GAGCCAGGCA 
CCATATGTAC 
TGCAGAAGAA 
CCCTGGTGAA 
CACCACCCAC 
GGGAGGCCTC 
GTTGCTCAGC 
ATGGTACACA 



41 

I 

TGCCTCTGAG 
AGCTGTGGCC 
TAACGCAGAC 
TAACAAGGCC 
ACAGACCGTT 
CAAGTCCCAG 
ACAGTTGTGC 
ATCCAGGTGT 
TCCCACCCCC 
CCCATGTGCC 
AGGGOGCTCT 
CCCCCCCACC 



Seq ID NO: 601 Protein sequence 
Protein Accession NP_001889.1 



11 



21 



300 
360 
420 
480 
540 
600 
660 
720 
780 




21 31 41 51 

I I 1 I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 
FLHYDOGNKT VTPVSPLGKK LNVTTAWKAQ NPVLRBWDI LTEQLRDIQL ENYTPKEPLT 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL PDSEKHMWTT VHPGARKMKE KWENDKWAM 
SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 
PILPGI 

Seq ID NO: 600 DNA sequence 

Nucleic Acid Accession 8: NM_00l89B.l 

Coding sequence : 57 . . 482 



60 
120 
180 
240 



SI 
I 

GAGACCATGG 
CTGGCCTGGA 
CTCAATGATG 
AOCAAAGATG 
GGGGGGGTGA 
CCCAACTTGG 
TCTTTCGAGA 
CAAGAATCCT 
TGTAGTGCTC 
TGCGCCAAGA 
GCCCTCCCTC 
TCCTGCAATT . 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



31 41 51 

1 1 I I I I 

MAQYLSTLLL LLATLAVALA WSPKEEDRII PGGIYNADLN DEWVQRALHF AJSEYNKATK 
DDYYRRPLRV LRARQQTVGG VNYPFDVBVG RTICTKSQPN LDTCAPHEQP ELQKKQLCSF 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 602 DNA sequence 

Nucleic Acid Accession #t NM_003976.2 

Coding sequence: 299.961 



60 
120 



11 

I 



21 
I 



31 
I 



51 
I 



419 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 
ctctgagctt ctctgagcct tgtttgctca tctgqaaaaa 
catggagttg tgaaagaata gctgcaaagc acctaacaca 
ctacttctgc tgggttgagt ctagctgtgt aggccccttg 
gggtggcagg ccggtccccc acaaaagata actcatctct 
caggagggtg ggggaacagc tcaacaatgg ctgatgggcg 
ggaacttgga cttggaggcc tctccacgct gtcccactgc 
tgccctgtgg cccaccctgg ccgctctggc tctgctgagc 
qggctccgcg ccccgcagcc ctgccccccg cgaaggcccc 
cgccggccac ctgccggggg gacgcacggc ccgctggtgc 
gccgccgcag ccttctcggc ccgcgccccc gccgcctgca 
cgggggccgc gcggcgcggg ctgggggccc gggcagccgc 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG 
CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC 
CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG 
GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA 
CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC 
AGGGCTCGCT CCAGGGCTTT GCAGACTGGA CCCTTACCGG 
CCTCCCGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG 
AGGCCCCTAC CGGTGGGTGA TGGATATCAT CCCCGAACAG 
CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG 
GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG 
CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCATCAGCC 
ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA 
CCTGTACTCA CTCATGGGAG CTGGCCCC 

Seq ID NO: 603 Protein sequence 
Protein Accession ft: NP 003967.1 



PCT/US02/12476 



GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGT 
CCCTGGCCTA 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG 
GCTCGGGCAG 
CTCGQCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
AOGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTOG 
CCCGCCCAGG 
AGTGCCTGTG 



CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CQGGGQCGCQ 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCOG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



11 



21 



31 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 
CCCGGGCCTG 
TAAAAGAGGC 
GCCCAGCACT 
TCAACAGGAG 
AGATGGAACT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCOGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGTCAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCOC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



11 
I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 
GAGCCCCACA 
ACTGCCAGGT 
GGTCCCCGGA 
GGTGGGGGAA 
TGGACTTGGA 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGCCCTC 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
CTCACTCATG 



21 

I 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 
CCCGAGGGTG 
GTACAGTCCT 
AAGGTGCCTA 
CAGCTCAACA 
GGCCTCTCCA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



31 

I 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 
CAGACTGGCT 
GGGCATGCGC 
GAAGAACAAG 
ATGGCTGATG 
CGCTGTCCCA 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCOGCC 
GCCCGGGCAG 
TGCCGGTGOG 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC 
TCATCCCCGA 
TCCCAGCCTA 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



41 
I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACOGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 
GCCAAGGCCA 
TGTTTGAGCT 
GTGCAGGACC 
GGCGCTCCTG 
CTGCCCCTGG 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGGA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCCGCCGC 
CCTGCGACGG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



Seq ID NO: 605 Protein sequence 
Protein Accession Jh NP 003967.1 



11 



21 



31 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



41 51 

I I i 111 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 604 DNA sequence 
Nucleic Acid Accession #: NM_057091.1 
Coding sequence: 783.. 1445 



41 51 

I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



Seq ID NO: 606 DNA sequence 

Nucleic Acid Accession ft: NM_057160.1 



60 
120 
180 



51 

I 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 
CACTTTTGGC 
TCGGGGGAGA 
CCGTGCTGCC 
GTGTTGATAG 
CCTAGGCGGC 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGCGCVCTC 
CCCCCGGGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 



60 
120 
180 



420 



WO 02/086443 PCT/US02/12476 

Coding sequence: 1..714 

1 11 21 31 41 51 

- I I I I I I 

5 ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTO 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTQ AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

10 CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGO CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

15 AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCOG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGAOGAAGG CCTCAAAGCT GAGAGGCCCC 840 

■ TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

20 AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGQACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 



25 



35 
40 



65 



75 



Seq ID NO i 607 Protein sequence 
Protein Accession Jh NP 476501.1 



1 11 21 .31 41 51 

30 | | | | | l 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAAIiALL SSVAEASLGS 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 

RAARAGGPGS RARAAGARGC RLRSQLVPVK ALGLGHRSDE LVRFRPCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 



Seq ID NOt 608 DNA sequence 

Nucleic Acid Accession ft: NM_057090.1 

Coding sequence: 29.. 715 



1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTOGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGOGTC GCAGAGGCCT CCCTGGGCTC 180 

45 CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCOCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGGC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGOGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCOGACGA 480 

50 GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCOGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

55 GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

60 TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 



Seq ID NOt 609 Protein sequence 
Protein Accession ft: NP_476431.1 



1 11 21 31 41 51 

I I I I I I 

MELGXjGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAALAL LSSVAEASLG SAPRSPAPRE 60 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 
70 SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRPCSG SCRRARSFHD ItSIASLLGAG 180 
; ALRPPPGSRP VSQPCCRPTR YEAVSPMDVN STWRTVDRLS ATACGCLG 



Seq ID NO: 610 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..174S 



1 11 21 31 41 51 

I I I I I I 

_ _ ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

80 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

85 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGrCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 



421 



WO 02/086443 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 7 BO 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCOG CCAGTTGCAG 960 

GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATOGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG CATCTTOGAT CACCTGGGGA AACTGTGTGA GCTGOGGCTG 1260 

TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG OGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 

TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCOGTAG AGAAGCAGGT 1920 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 

AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCOGCT CAGTTCCTGG 2400 

AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

CCCCAGCACA GCAAGCTCAG CCTTTTAGAO AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC OGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 

TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAQ 3060 

GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC AOGTCTGCCC 3600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CAOGACCAAT CCOGATCGGC 3560 

TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3900 

TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 

AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCOGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 

GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACOCTTA 4320 

GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCTCTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 

GTGTTGGAGA AGAAACAACA AAAGCCAATT AGAACCACTA TTTTTAAAAA GTGCTTACTG 4800 

TGCACAGATA CTCTTCAAGC ACTGGACGTG GATTCTCTCT CTAGCCCTCA GCACCCCTGC 4B60 

GGTAGGAGTG CCGCCTCTAC CCACTTGTGA TGGGGTACAG AGGCACTTGC TCTTCTGCAT 4920 

GGTGTTCAAT AGGCTGGGAG TTTTATTTAT CTCTTCAAAC TTTGTACAAG AGCTCATGGC 4980 

TTGTCTTGGG CTTTCGTCAT TAAACCAAAG GAAATGGAAG CCATTCCCCT GTTGCTCTCC 5040 

TTAGTCTTGG TCATCAGAAC CTCACTTGGT ACCATATAGA TCAAAAGCTT TGTAACCACA 5100 

GGAAAAAATA AACTCTTCCA TCCCTTAAAG AATAGAATAG TTTGTCCCTC TCATGGGAAT 5160 

TGGGCTGTAT GTATATTGTT CTTCCTCCTT AGAATTTAGA GATACAAGAG TTCTACTTAG 5220 

AACTTTTCAT GGACACAATT TCCACAACCT TTCAGATGCT GATGTAGAGC TATTGGGAAA 5280 

GAACTTCCAA ACTCAGGAAG TTTGCAGAGA GCAGACAGCT AGAGATAACT CGGGACCCAG 5340 

AGTTGGTCGA CAGATGTTAG ATGTATCCTA GCTTTTAGCC ATAAACCACT CAAAGATTCA 5400 

GCCCCCAGAT CCCACAGTCA GAACTGAATC TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 5460 

GGAAGGAAGC CATGGCTGTG GTTCAGAGAG GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 5520 

CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT AAGGAGAGAT TGTTCTCACC AACCCGCTGC 5580 

CTTCATGCTG CCTTCAAAGC TAGATCATGT TTGCCTTGCT TAGAGAATTA CTGCAAATCA 5640 

GCCCCAGTGC TTGGGGATGC ATTTACAGAT TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 5700 

AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT CTTCTGCTGG ATGCTGCTTG TAATCCATTT 5760 
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GQTGTACAGA ATCAACAATA AATAATATAC ATQTAT 

Seq ID NO: 611 Protein sequence 
Protein Accession ft: BAB84587.1 



PCT7US02/12476 



MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 



YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
1AAIVIGIVA 



11 

I 

VGCQAWGAGI* 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFAKVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVBCT 
RITPGAFRNL 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFKGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 
I 

GARIVAVPTP 
GSLRYLSIiAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENLPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



51 

I 

LFWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWL 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



Seq ID 210: €12 DNA sequence 
Nucleic Acid Accession ft: xm_098151 
Coding sequence: 1..447 



ATGATGCATT 
AGTGGGGTAC 
TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CAGAGGCACT 
AACTTTGTAC 
AAGCCATTCC 



11 

I 

TGCTCAATTC 
AGAGTTCAGT 
GAGCTGGGAG 
AAAGTGCTTA 
TCAGCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 
I 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
I 

AATGAGCCGG 
GTTTACAGCT 
AGAAGAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAGGCTGG 
GGGCTTTGGT 



CTGGTCCCCC 
CCTTGACAGT 



AGGACTGGAC 
TACCCACTTG 
GAGTTTTATT 
CATTAAACCA 



SI 
I 

AGAAAGCTGG 
CCCACGCCCA 
ATTAGAACCA 
GTGGATTCTC 
TGATGGGGTA 
TATCTCTTCA 
AAGGAAATGG 



Seq ID NO: 613 Protein sequence 
Protein Accession ft: XPJ)9B151 

1 11 21 31 41 51 

I I I I I I 

MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 
LFLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVLFISS 
NFVQELMACL GLSSLNQRKM KPFPCCSP 

Seq ID NO: 614 DNA sequence 

Nucleic Acid Accession ft: NM_002658.1 

Coding sequence: 77.. 1372 ~ 



1 

I 

GTCCCOGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 
TGTTGTGAAG 
CACCACCAAA 
CTCAGGGGGA 
CTGGGGCCGT 
CTTACCCTGG 
AGGGAGGAAA 
TCCATCAGCT 
CACCACCAGG 
CAGACCCTCT 
TGTCTTTTTC 
GGCTCGAAGG 
AATGAATAAT 
AATGTGGGAG 
ATTCCATGAA 
GCTGTGAGTG 
AAACTGTGTG 
CTGGGGCCTC 
ACCTGTGACC 
ATCCCTTCCT 
ACACTGAATA 
ATCAATAAAA 



11 
I 

CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 
CTGATTTCCC 
ATGCTATGTG 
CCCCTCGTCT 
GGATGTGCCC 
ATCCGCAGTC 
CGGGCACCAC 
GTAAGAAGAG 
GTGAACGACA 
GGCCAGGATG 
TGGACTGAAG 
GAGAGCCAGC 
TTCCCAATTA 
CAGCGGTTTG 
TGTATCAGGA 
TAAGTGTGAG 
GACTGTGATG 
TTGGGTCCCC 
AGCACTGTCT 
TTTAGCCTAG 
TTTATATTTC 
TGTGATTTTT 



21 
I 

CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 
ACCGGGAGTG 
CTGCTGACCC 
GTTCCCTCCA 
TGAAGGACAA 
ACACCAAGGA 



ACTGGGAAGA 
ATAGCTTTAC 
GAGGGGTGGT 
CCTGCAGGAG 
TCCCCCGACC 
GGAAGTGTAA 
GGGAGCAGAG 
AATATATATG 
TAAGAGCTGG 
CCACACAGAG 
CACGTGACAG 
CAGTTTCACT 
TTCATCCAAT 
ACTATTTTTA 
CTGA 




TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACACG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 
TCAGCAGCCC 
CCAATGGAAA 
AGGCCGCATG 
GCCAGGCGTC 
AGAGAATGGC 
GCTGGTTGTC 
TAGGCTCTGC 
CCTCACGGAT 
CCTGACTCAA 
TTAAAAAGGG 
GGTGGGCATT 
GCAGCTGAGG 
ACACTAACGA 
TGTGTGTATG 
TGTCTGATTG 
TGGTCTTTCT 
TGCCTGGGAA 
TTCACATAGA 
CCTCACTGGG 
TTTATATTTT 



41 
I 

GAGGCCGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGOG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 
CACTACTACG 
ACAGATTCCT 
ACTTTGACTG 
TACACGAGAG 
CTGGCCCTCT 
ATTTTTGCAG 
ACAGATGGAT 
AGGCCTGGGT 
CATGTTACTG 
CAGGGCATCT 
TGTGAGGCCC 
TCTCTTGAGG 
CTTCAGGGCA 
TTTGCACACT 
TTAAGTCTAA 
GGAGAGGTTA 
TGTACTTATT 
TGTCCCTTTC 
TGGGGTGAGG 
TGTAATTTTA 



51 
I 

CCGTCTAGCG 
TCCTGGTCGT 
ACTGTCTAAA 
ACTGCCCAAA 
AGGGGAATGG 
TGCCCTGGAA 
TTCAGCTGGG 
GGTGCTATGT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGCGGC 
GCCTCATCAG 
AGGAGGACTA 
TGAAGTTTGA 
ACCACAACGA 
CCCGGACTAT 
GCTGTGAGAT 
TGAAAATGAC 
GCTCTGAAGT 
GCCAGGGAGA 
GAATTGTGAG 
TCTCACACTT 
GAGGGTCCCC 
TAGAGTCATC 
TTGCCTGTGG 
GCTGGCTGCC 
ACCAGCAACT 
CCTGTGCATG 
ATGGTTGAGA 
GAGCTTAGCC 
GGGCTCTGAT 
TGTTGTGTGG 
ATATTTCCTT 
TAGGTCACTC 
CTGCAGCATG 
TTGGCCAGTT 
ACCACTCCTT 
AATAAAAGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 . 
300 
360 
420 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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Seq ID NO: 615 Protein sequence 
Protein Accession ft: NP_002649.1 

1 11 21 31 41 51 

I I I I I I 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRPKII 180 

GGEFTTIEWQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCPIDYP XKEDYIVYI/G 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKBQRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHPLPWIR 420 
SHTKEENGUV L 

Seq ID NO: 616 DNA sequence 

Nucleic Acid Accession ft: NMJJ24422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 51 

I I I I I I 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCO CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA. GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TAGAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCXTTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTOCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

OGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAA AAGTGTATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CC AAGA CTAT 2760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TOGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2940 

TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

AATTAAGTGT TCATGTGGTG CTTGGAAACT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
TTTCTAGCCA GGCATTGACT ATTACAATTT CATT 

Seq ID NO: 617 Protein sequence 
Protein Accession ft: NP_077740.1 

1 11 21 31 41 51 

I I I I I 1 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECPTA 60 

ANLIHSSDPD PQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIP VFLEHQTKVL 120 
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KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ 
EPRNLFYVER DTGNLYCTRP VDREQYESFE IIAFATTPDG 
PIFTEETYTF TIFENCRVGT TVQQVCATDK DEPDTMHTRL 
TGVITTTSSQ LDRELIDKYQ LKIKVQBMDG QYFGLQTTST 
VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE 
KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV 
KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI 
KNGIYNITVL ASDQGGRTCT GTLGI ILQDV NDNSPFIPKK 
EPIHGPPFDP SLESSTSEVQ RMWRLKAIND TAARLSYQND 
VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI 
TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT 
QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR 
CNQDENHKHA QDYVLTYNYE GRGSVAGSVG 
R 

Seq ID NO: 618 DNA sequence 
Nucleic Acid Accession #: NMJ>04949.1 
Coding sequence: 202.. 2745 



PCT/US02/12476 



CGCCAAAGGA 
CTCTCCGCGC 
GCTCCGGCCG 
GACCTGCCCC 
CTCTGCCGGC 
AATGTGACAT 
CTGAAAGAGT 
TTGGAGGATG 
TTTACCATAT 
GAGCATCAAA 
AAGAGAAGAT 
CTTTTCCTTC 
.AGAGGTCCTG 
AACTTGTATT 
TTTGCAACAA 
GAGGATGAAA 
GAAAATTGCA 
GACACGATGC 
CTATTTTCTA 
GAGTTAATTG 
GGTCTACAGA 
ACATTTACTC 
TTACGAGTTA 
ACCATTTTAA 
GAAGGAGTTC 
CAAATTGGTG 
AGCACAGCAA 
CCAATACAGA 
AAAGCATATG 
CCAACAGGGT 
GATAGAGAGG 
CAAGGAGGGA 
AGCCCATTCA 
ATTGTTGCGG 
AGTTCTACTT 
QGTCTTTCCT 
GATAGACTTG 
GAAAATGACT 
AAGTGGGCCA 
CTGGTCTGTG 
CAGCAGAACC 
AATGGCTTCA 
TCAGGAATCA 
TOGGAATCCT 
ACGGAGGTGG 
CTTGGTGAAG 
GTATCTGTGT 
CTATGAAGGA 
AGATGGGCTT 
CATGAAGAGA 
AAAAAATTAC 
TCATTATTTG 
AGTACACAAA 
TCTACAGAGA 
TTTATAGTGC 
CCTAATGGAA 
GTGGTGCTTG 
ATTATTTTAT 
TTGACTATTA 



11 
I 

AAAGCCCCTT 
GCCCCACCTC 
CGGCCCTCGC 
GAGCCCTCTC 
TGCTCCTGCT 
TACATGTTCC 
GCTTTACAGC 
GTTCAGTCTA 
TACTTTCCAA 
CAAAGGTCCT 
GGGCTCCAAT 
AACAGGTTCA 
GAGTTGACCA 
GTACTCGTCC 
CTCCAGATGG 
ATGATAACTA 
GAGTGGGCAC 
ACACACGCCT 
TGCATCCAAC 
ACAAGTACCA 
CAACTTCAAC 
GTACTTCTTA 
CTGTTGAGGA 
AGGGCAATGA 
TTTGTGTAGT 
TAGTTAATGA 
CAGTTACTGT 
CTGTTCGCAT 
ACCCAGAAAC 
GGGTCACCAT 
CAGAGACCAT 
GAACATGTAC 
TACCTAAAAA 
TTGATCCTGA 
CAGAAGTACA 
ATCAGAATGA 
GCATGTCTAG 
GCACACATCG 
TCCTTGCAAT 
GGGCTTCTGG 
TAATTGTATC 
CAACCCAAAC 
AAAACGGAGG 
GCCGGGGGGC 
ACAACTGCAG 
AATCCATTAG 
AATCAAGATG 
AGAGGATCGG 
GAATTTTTGG 
TGAGTGTGTT 
AAACCAAGAA 
GATGGAATCT 
TTTTTCAATT 
AATTAAAGTC 
AATAAAATGT 
AATTGTAGAG 
GAAACTGTTG 
TCTTGTAATG 
CAATTTCATT 



21 
I 

GGATGAGAGG 
CTCCGCCTCG 
CCCGCGGAGC 
CATGGAGGCA 
GACCCTOGCG 
CTCCAAACTA 
TGCAAATCTA 
TACAACAAAT 
CACTGAGAAC 
AAAGAAAAGA 
TCCTTGTTCG 
ATCTGACACG 
AGAACCTCGG 
TGTAGATCGT 
GTATACTCCA 
CCCAATTTTT 
TACTGTGGGA 
GAAGTACTCC 
TACAGGCGTG 
GTTGAAAATA 
TTGTATCATT 
TGTGACATCA 
TAAGGACTTA 
AAATGGCAAT 
TAAGCCTTTG 
AGCTCCATTT 
TAATGTAGAA 
GAAAGAAAAT 
AAGAAGTAGC 
TGATGAAAAT 
CAAAAATGGC 
GGGGACACTG 
GACAGTGATC 
TGAGCCTATC 
GAGAATGTGG 
TCCTCCATTT 
TGTCACTTCA 
TGTAGATCCA 
ATTGTTGGGC 
GACGTCTAAA 
AAACACAGAA 
TGTGGGCGCT 
TCAGGAGACC 
TGGCCACCAT 
ATACACTTAC 
AGGACACACT 
AAAATCACAA 
TGGCTGGGTC 
ATAATTTGGA 
CTAATAAGTC 
TTTTTTAAAG 
CTTTGGTCAA 
TTTACATATT 
TGCCTTATTT 
AATTAATTCA 
ACCTTGCTTT 
TTTTCCTGAA 
TGACCTTTTC 



31 
I 

CAGGCGCTTC 
CGCTCCTCCT 
CCTCCTACCC 

gcccgcccct 
atcttaatat 
gatgccgaga 
attcattcaa 

ACTATTCTAT 
CAAGAAAAGA 
CATACTAAAG 
ATGCTAGAAA 
GCCCAAAACT 
AATTTATTTT 
GAGCAGTATG 
GAACTTCCAC 
ACAGAAGAAA 
CAAGTGTGTG 
ATCATTGGGC 
ATCACCACAA 
AAAGTACAAG 
AACATTGATG 
GTGGAAGAAA 
GTGAATACTG 
TTTAAAATTG 
AATTATGAAG 
TCCAGAGAGG 
GATCAGGATG 
GCAGAAGTGG 
AGTGGCATAA 
ACAGGATCAA 
ATATATAATA 
GGCATTATAC 
ATCTGCAAAC 
CATGGCCCAC 
AGACTGAAAG 
GGCTCATATG 
TTGGATGTTA 
AGGATTGGCG 
ATAGCATTGC 
CAACCAAAAG 
GCTCCTGGAG 
TCTGCTCAGG 
ATCGAAATGG 
CACACCCTGG 
TCGGAGTGGC 
CTGATTAAAA 
GCATGCCCAA 
TGTAGGTTGT 
GCCCAAATTT 
TCTGAAAGCC 
CAGAAGATGC 
ATGCACATTT 
TTTAAATTAC 
GTTACATTTG 
AGTCCTTATT 
AACATTATCT 
CATCTAAAGT 
ACTGTGCAAA 



SDTAQNYTIY 
YTPEIiPLPLI 
KYSIIGQVPP 
CIINIDDVND 
NGNFKIVTDA 
NVEDQDEGPE 
DENTGSIKVP 
TVIICKPTMS 
PPFGSYWPI 
LLGIALLFCI 
VGASAQGVCG 
YTYSEWHSFT 
LEFLDNLEPK 



41 
I 

AGAGAAGCTA 
GAGCAGCGGG 
CGGCCCGACG 
CCGGCTCCTG 
TTGCCAGTGA 
AACTTGTTGG 
GTGATCCTGA 
TGTCCTCGGA 
AGAAAATATT 
AAAAAGTTCT 
ACTCCTTGGG 
ATACCATATA 
ATGTGGAGAG 
AATCTTTTGA 
TGCCCCTAAT 
CTTATACTTT 
CTACTGACAA 
AGGTGCCACC 
CATCATCTCA 
ACATGGATGG 
ATGTAAATGA 
ATACAGTTGA 
CTAAC7GGAG 
TAACAGATGC 
AAAAGCAACA 
CTAGTCCAAG 
AGGGCCCTGA 
GAACAACAAG 
GGTATAAGAA 
TCAAAGTTTT 
TTACAGTCCT 
TTCAAGACGT 
CCACCATGTC 
CCTTTGACTT 
CAATTAATGA 
TAGTACCTAT 
CACTGTGTGA 
GTGGAGGAGT 
TCTTTTGCAT 
TAATTCCTGA 
ATGACAAAGT 
GAGTTTGTGG 
TGAAAGGAGG 
ACTCCTGCAG 
ACAGTTTTAC 
ATTAAACAAT 
GACTATGTCC 
TGCAGTGAAC 
AGGACACTAG 
AGTGGCTTTA 
TATTTGTGGG 
ACAGAGAGAC 
TTATCTTCTA 
GGTATAATGA 
ATAGACTATT 
CCAGTTAATT 
GTGTAGACTG 
GGGAGATTTC 



YSIRGPGVDQ 



SPTLFSMHPT 
HLPTFTRTSY 
KTNEGVLCW 
CNPPIQTVRM 
RSLDREAETI 
SAEIVAVDPD 
TVRDRLGMSS 
LFTLVCGASG 
TVGSGIKNGG 
QPRLGEKVYL 
FRTLAEACMK 



51 
I 

AGAAAAGCAC 

CCCAGACTGC 

CTCGGCCCGC 

GAACGGAGCC 

TGCCTGCAAA 

TAGAGTTAAC . 

CTTCCAAATT 

GAAGAGAAGT 

TGTCTTTTTG 

AAGGCGOGCC 

TCCTTTTCCA 

CTATTCCATA 

ACSACACTGGA 

GATAATTGCC 

AATCAAAATA 

TACAATTTTT 

AGATGAGCCT 

ATCACCCACC 

GCTAGACAGA 

TCAGTATTTT 

CCACTTGCCA 

TGTGGAAATC 

AGCTAATTAT 

CAAAACCAAT 

GATGATCTTG 

ATCAGCCATG 

GTGTAACCCT 

CAATGGATAT 

ATTAACTGAT 

CAGAAGCCTG 

TGCATCAGAC 

GAATGATAAC 

ATCTGCGGAG 

TAGTCTGGAG 

TACAGCAGCA 

AACAGTGAGA 

CTGCATTACC 

ACAACTTGGA 

CCTGTTTACG 

TGATTTAGCC 

GTATTCTGCG 

CACCGTGGGA 

ACACCAGACC 

GGGAGGACAC 

TCAGCCCCX3T 

GAAAGAAAGT 

TGACATATAA 

GACAAGAAGA 

CAGAAGCATG 

TGACTTTTAA 

GGTTTTTCTC 

ACTATAAACA 

TCCAAGGAGG 

CAACAGCCAA 

TGAAGCACAA 

AAGTGTTCAT 

CATTCTTGCT 

TAGCCAGGCA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 619 Protein sequence 
Protein Accession #: NP_004940.1 

11 



31 



41 



51 



21 

I II I i 1 

MEAARPSGSW NGALCRLLLL TLAILIPASD ACKNVTIiHVP SKLDAEKLVG RVNLKECFTA 



60 



425 



WO 02/086443 PCT/US02/12476 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSPTILLSN TENQEKKKIF VFLEHQTKVL 120 

KKRHTKEKVL RRAKRRNAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFB I IAPATTPDG YTPELPLPLI IKIEDENDNY 240 

PIPTEETYTP TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

5 TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HLPTPTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVP RSLDREAETI 540 

KNQIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

10 EPIHGPPPDP SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTBVDNCR YTYSEWHSFT QPRLGEESIR 840 
GHTLIKN 



15 



50 



60 



70 



Seq ID NOt 620 DNA sequence 

Nucleic Acid Accession #: NMJJ32545.1 

Coding sequence : 4 6 . . 7 1 8 



20 1 11 21 31 41 51 

I ) I I I I 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

25 GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTOGGAG AGGGTGCGTC 300 

CGCGCGGCCG CGCTGCTGCA GGAAOGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

30 CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG OGGGAGOGGC GCCCCTGOGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 

35 TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 



40 



Seq ID NO: 621 Protein sequence 
Protein Accession #: NP_115934.1 



1 11 21 31 41 51 

I I I I I I 

MTWRHHVRLL FTVSLALQII NLGNSYQREK ENGGREEVTK VATQKHRQSP LNWTSSHFGE 60 
VTGSAEGHGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SPCVCPAHFT GRYCEHDQRR 120 
45 SECGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DPLASHAHGP SAGGAPSLLL 180 
• LLPCAIiLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 



Seq ID NO: 622 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..390 



1 11 21 31 41 51 

I 1 I I I I 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 

55 TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 

GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 

CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 

TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 

TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 360 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 

Seq ID NO: 623 Protein sequence 
Protein Accession ft: PGENESH predicted 

65 1 11 21 31 41 51 

I I 1 I 1 I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCPELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 



Seq ID NO: 624 DNA sequence 
Nucleic Acid Accession 8: Ml 6 72 8.1 
Coding sequence: SI.. 1085 



75 l 11 21 31 41 51 

1 I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

80 ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

oc TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACOGGACA GTTCCATGTA TACCCGGAGC 480 

O J TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 



426 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA 
ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC 
CCTCAAAGGC CAATTACOGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC 
ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC 
TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA 
CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA 
TCCTCTCAGC TGTGGCCACC GTOGGCATCA OGATTGGAGT GCTGGCCAGG 
TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG 
GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA 
CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT 
ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA 
GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC 
AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT 
TGCC T CTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG 
GGGTAACTTA ACAGAGTGTC AGATCTATCT TGTCAATCCC AACGTTTTAC 
AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG 
AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT 
TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC 
GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC 
GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG 
CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG 
CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT 
CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG 
GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA 
ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT 
TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC 
GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA 
ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC 
TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT 
TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC 
CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT 
TAGCTCTATA ACT 

Seq ID NO: 625 Protein sequence 
Protein Accession ft: AAA59907.1 



PCT/US02/12476 



GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 



CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



720 
780 
B40 
900 
960 
1020 
1080 
. 1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



41 



51 



1 11 21 31 

I I 1 I I I 

MGPPSAPPCR LHVPWKEVLL TASLL.TFWNP PTTAKLTIES TPFNVAEGKE VLLLAENLPQ 
NRIGYSWYKG ERVDGNSLIV GYVIGTQQAT PGPAYSGRET IYFNASLLIQ NVTQNDTGFY 
TLQVIKSDLV KEEATGQFHV YPELPKPSIS SNNSNPVEDX DAVAFTCEPB VQNTTYLWWV 
NGQSLPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 
TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQBLFIPNIT VNNSGSYMCQ 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID NO i 626 DNA sequence 
Nucleic Acid Accession 8: M18728.1 
Coding sequence; 1355.. 1657 



GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACOGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 
GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 



41. 
I 

AGCAGAGACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TCGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 



51 
1 

ATGGGACCCC 
ACAGCCTCAC 
ACGCCATTCA 
AATCGTATTG 
GGATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCGGAGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



427 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 
GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 
ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTOCOCTC TACTTTAACT 
TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 
TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 
CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 
TAGCTCTATA ACT 

Seq ID NO: 627 Protein sequence 
Protein Accession 8: AAA5990B.1 

1 11 21 31 41 51 

1 I I I I 1 

MDSPSQDVKT RLLIMIRLLP PPNLSLLMPA SPAWQDDAVI SISQEVASEG NLTECQIYLV 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSELDF 

Seq ID MO: 628 DNA sequence 
Nucleic Acid Accession ft: M18728.1 
Coding sequence: 2370,. 2501 



2220 
2280 
2340 
2400 
2460 
2520 



PCT/US02/12476 



i 
I 

GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



11 

I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
OTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 
GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GAOGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CT GT TCTT GT 
TTAATTCATA 



41 
I 

AGCAGAGACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TOGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



51 
I 

ATGGGACCCC 
ACAGCCTCAC 
AOGCCATTCA 
AATCGTATTG 
GGATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCGGAGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



Seq ID NO: 629 Protein sequence 
Protein Accession .#: AAA59909.1 

1 11 21 31 41 

1 I I I I 

MLTNVPISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC PNS 



Seq ID NO: 630 DNA sequence 

Nucleic Acid Accession 8: NM_016639.1 

Coding sequence: 40.. 42 9 



SI 
I 



1 

I 

GCGGCGGGCG 
CGGTTGCTGC 
GAGCAAGCGC 
AAGTGCATGG 
GCTGCAGCAC 
CTGACCTTCG 
GAGAAGTTCA 



11 
I 

CAGACAGCGG 
GGCTCCTCGT 
CAGGCACCGC 
ACTGCGCGTC 
CTCCTGCCCC 
TGCTGGGGCT 
CCACCCCCAT 



21 
I 

CGGGCGCAGG 
GCTGGGGCTC 
CCCCTGCTCC 
TTGCAGGGCG 
CTTCCGGCTG 
GCTTTCTGGC 
AGAGGAGACC 



31 
I 

ACGTGCACTA 
TGGCTGGCGT 
CGCGGCAGCT 
OGACCGCACA 
CTTTGGCCCA 
TTTTTGGTCT 
GGCGGAGAGG 



41 
I 

TGGCTCGGGG 
TGCTGCGCTC 
CCTGGAGCGC 
GCGACTTCTG 
TCCTTGGGGG 
GGAGACGATG 
GCTGCCCAGC 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



51 
I 

CTCGCTGCGC 
CGTGGCCGGG 
GGACCTGGAC 
CCTGGGCTGC 
CGCTCTGAGC 
CCGCAGGAGA 
TGTGGOGCTG 



60 
120 
180 
240 
300 
360 
420 



428 



WO 02/086443 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTOGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC A6TCTCT6CC TCCCAGACGC GGCGGGAGCC AA6CTCCTCC AACCACAAGG 540 

GGGGTGGGGG GOGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG OAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 700 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 
AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 631 Protein sequence 
Protein Accession §z NP_057723.l 

I 11 21 31 41 . 51 

II 1 ! I I 

MARGSLRRLL FXLVLGLWLA LLRSVAGBQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 

SDFCLGCAAA PPAPFRLLWP ILGGALSLTP VLGLLSGFLV WRRCRRREKP TTPIEETGGE 120 
GCPAVALIQ 

Seq ID NO: 632 DNA sequence 

Nucleic Acid Accession #: NM J) 03 8 16.1 

Coding sequence: 79.. 2538 

1 11 21 31 41 51 

I * I I I" I I 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 60 

CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGGGGAC CCTTCGTGTC 120 

CGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GT6CGGCGCG GCCAGGCTTT 180 

CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 

AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 

AAAGAGCATA TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 

TATACTTACA ACAAGGAAGG GACTTTAATC ACPGACCATC CCAATATACA GAATCATTGT 420 

CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 

GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 

AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 

TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 

CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC CCGGTATGTG 720 

GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 

GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT 840 

ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 

GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 

CGTGGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 

CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAATCACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 1200 

GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 1380 

GAATGTGAAT TGGACCCTTG CTGCGAAGGA AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 

TGTGCATATG GTGACTGTTG TAAAGACTGT CGGTTCCTTC CAGGAGGTAC TTTATGCCGA 1500 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 

GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 1680 

GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATC CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCCGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCGCAT CAAGGCCACC TCCACCACAA 2460 

CCGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCOGTC CTGCTCCTGC ACCTCCTTTA 2520 

TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2700 

AGTTGTGAAA TACAAGGAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2760 

CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 

TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 

CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 3300 

TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 

AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 



429 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 
AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAA6T GTTTTT GOTT 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 
AAAAAAAAAA AAAAAAAAAA AAAAA 



PCT/US02/12476 



3720 
3780 
3840 



Seg ID NO: 633 Protein sequence 
Protein Accession ft: NP_003807.1 



MGSGARFPSG 
PYSKQVSYVI 
EGVHNSSIAL 
DIEKETAKDE 
LLANYLDSMY 
AQLVLKKGFG 
RDCSCGAKSC 
LVDAGEECDC 
DVPEYCNGSS 
IEVNSKGDRF 
WGVDFQLGSD 
NCHCENGWAP 
DQLWRSYFRK 
KQPOQPPSRP 



11 
I 

TLRVRWLLLL 
QAEGKEHIIH 
SDCFGLRGLL 
EEEPPSMTQL 
IMLNIRIVLV 
GTAGWAFVGT 
IMNSGASGSR 
GTPKBCELDP 
QFCQPDVPIQ 
GNCGPSGNEY 
VPDPGMVNEG 
PNCETKGYGG 
KRSQTYESDG 
PPPQPKVSSQ 



21 
I 

GLVGPVLGAA 
LERKKDLLPE 
HLENASYGIE 
IiRRRRAVLPQ 
GLEIWTNGNL 
VCSRSHAGGI 
NFSSCSAEDF 
CCEGSTCKLK 
NGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYNE 
KNQANPSROP 
GNXiIPARPAP 



31 
I 

RPGFQQTSHL 
DFWYTYNKE 
PLQNSSKFEH 
TRYVELFIW 
1NIVGGAGDV 
NVFGQITVET 



SFAECAYGDC 
YCYNGMCQYY 
GKLQCENVQE 
NFQCVDASVL 
MNTALRDGLL 
GSVPRHVSPV 
APPLYSSLT 



Seq ID NO: 634 DNA sequence 

Nucleic Acid Accession #: NM_002091.1 

Coding sequence: 56. .503 



AGTCTCTGCT 
CGGCAGTGAG 
AGCGGTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



11 
1 

CTTCCCAGCC 
CTCCCGCTGG 
CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AATCAGCAGC 
GGCAAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



21 

I 

TCTCCGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA 
GTTGGTTAGA 



31 
I 

GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAGAGCACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT 
TGCTCCAGGT 
GCCTCTCTCA 
TACGGATCAT 
TTGACTAAAT 
ATTGTCGAAA 
TTCAAGGCCC 



41 
I 

SSYEIITPWR 
GTLITDHPNI 
I IYRMDDVYK 
DKERYDMMGR 
LGNFVQWRER 
FASIVAHELG 
CLLNIPKPDE 
CKDCRFLPGG 
DAQCQVIFGS 
IPVFGIVPAI 
NYDCDVQKKC 
VFFFLIVPLI 
TPPREVPIYA 



41 

I 

CTTCCCGTCG 
TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTGGGAAG 
CACCAGCCAC 
AGCAGCAACT 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



51 
I 

LTRERREAPR 
QNHCHYRGYV 
EPLKCGVSNK 
NQTAVREEMI 
FLITRRRHDS 



AYSAPSCGNK 
TLCRGKTSEC 
KAKAAPKDCF 
IQTPSRGTKC 



VCAIFIFIKR 
NRFAVPTYAA 



51 
I 

GGACCATGCG 
CCCGGGGGCG 
CGCGCGGCAA 
CTTCTGTTTC 
AAGCTGCAAG 
CTCAACCCAA 
TCAAAGATGT 
AAGGAAGGAA 
ACAAAACCCC 
TCCTTGTGCA 
CAAGCAGCAT 
AATTAATGCT 
CCATTCACAA 



Seq ID NO: 635 Protein sequence 
Protein Accession #: NP_002082.l 



11 



21 



31 
I 



51 



MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 636 DNA sequence 

Nucleic Acid Accession #i NM_016522.1 

Coding sequence: 2 65.. 12 99 



GCGGAAGCAG 
CTGG CAAAAQ 
TTTTCTCCTC 
CCGCACCCCA 
TCGGGGAAGT 
TGCCTCGTGG 
AGCGGAGATG 
GCCACCCTCA 
ACCATCCTCT 
AACACCCAAA 
TACACCTGCT 
CAAGTATCTC 
ATTAGCCTCA 
TCTCCCAAAG 
CGGGAACAG7 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 
CTGCCTCTTC 
CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 



11 
I 

CGAGGAGGGA 
CCGAGGCTGG 
CCCGCGCCTC 
CCCACTTCCT 
TGTCGCTGTC 
TCGTGTCTCT 
CCACCTTCCC 
GGTGCACTAT 
ATGCTGGGAA 
CGCAGTACAG 
CGGTGCAGAC 
CCAAAATTGT 
CCTGCATAGC 
CGGTTGGCTT 
CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 



21 
I 

GCCCCCTTTG 
ATTTGGGGGA 
CCGGTCGCCG 
GTGCTCGCCC 
GAGAATGGGG 
CAGGCTGCTG 
CAAAGCTATG 
TGACAACCGG 
TGACAAGTGG 
CATCGAGATC 
AGACAACCAC 
AGAGATTTCT 
AACTGGTAGA 
TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 



31 
1 

GCCGTCCTCC 
GGAATATTAG 
CGGGTTCACC 
GGGGGGCGTG 

TTCCTTGTAC 
GACAACGTGA 
GTCACCCGGG 
TGCCTGGATC 
CAGAACGTGG 
CCAAAGACCT 
TCAGATATCT 
CCAGAGCCTA 
GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 



41 

I 

GTGGAACCGG 
ACTCGGAGGA 
GCTCAGTCCC 
TGCCGTGCGG 
ACCTGTTCCT 
CCACAGGAGT 
CGGTCCGGCA 
TGGCCTGGCT 
CTCGCGTGGT 
ATGTGTATGA 
CTAGGGTCCA 
CCATTAATGA 
CGGTTACTTG 
TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 



51 
I 

TTTTCCGAGG 
GTCTGCGCGC 
CGCGCTCGCT 
CTGCCGGAGT 
GCCCTGGAAG 
GCCCGTGCGC 
GGGGGAGAGC 
AAACCGCAGC 
CCTTCTGAGC 
CGAGGGCCCT 
CCTCATTGTG 
AGGGAACAAT 
GAGACACATC 
GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



430 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

CCOGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG 
GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATTCTGGAGC TGGCCATCCC 
AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 
CCGGCCCAAG CGTQGCGCTQ COGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 

Seq ID NO i 637 Protein sequence 
Protein Accession ft; NP_057606.1 

1 11 21 31 41 51 

1 I I | I I 

MGVCGYLPLP WKCLVWSLR LLFWPTGVP VRSGDATFPK AMDNVTVRQG ESATLRCTID 
NRVTRVAWLN RSTILYAGND KWCLDPRWL LSNTQTQYSI EIQNVDVYDE GPYTCSVQTD 
NHPKTSRVHL IVQVSPKIVE ISSDISINEG NNISLTCIAT GRPEPTVTWR HISPKAVGPV 
SEDEYLEIQG ITREQSGDYE CSASNDVAAP WRRVKVTVN YPPYISEAKG TGVPVGQKGT 
LQCEASAVPS AEFQWYKDDK RLIEGKKGVK VENRPFLSKL IPFNVSEHDY GNYTCVASNK 
LGHTNASIML FGPGAVSEVS NGTSRRAGCV WLLPLLVLHL LLKP 



PCT/US02/12476 



1620 
1680 
1740 
1800 



Seq ID NO: 638 DNA sequence 

Hucleic Acid Accession th NM_012261.1 

Coding sequence t 203 . . 1045 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTCGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 
GGCGACTTGG 
CAAAGACGCA 
CACCCCCGCT 
TGATCCGCAG 
TATCTCAGAT 
GGAAGAAACC 
CGCGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 
I 

GCCAGCAGCT 
CGCTCCCTCC 
GCGACTTTGA 
GGCACTGCGA 
CTCCTGATGT 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACC 
AAGCACATGG 
CAGGTAGAAC 
ATCAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 

I 

GTCGGTGCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGGATCT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA 
CCTGGGTATC 
TCTTTCGGAT 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG 
ATGTTTCACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 
1 

CGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAGCCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 
GTCTGCGGTC 
TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 



41 

I 

CGAGTCCTAG 
GCCTCTCGCT 
CTCTGCAGCA 
GGGGTCCCCA 
ATCATGGCAG 
ATATTTGTGG 
ATTGTACCTT 
ATCGCATTGA 
CAAGTGTTCT 
AACATGTCCA 
TCC7CGGAGA 
CACCACCTCT 
ACCATTTCAC 
CACATCCAAC 
GTGGATGAGC 
CTCGTCATCA 
GTGCAGATCC 
CACCCCCTAT 
TTGTACACGA 
TTGGCTTGTG 
AAATGGCAAT 
TGCTCATGGT 
CTGGCCCCAA 
CAGAAGAATA 
AATGCACACA 
TTCTGGCTGG 
TCCAGCGACT 
TCCTGTGCCA 
AAATGAAATA 



51 
I 

CTAGGCGCTC 
CACCCCGGCC 
GCACAGCCGG 
GCATCGACAO 
AACAAGAAGT 
TGCGGGAAAA 
ATGATGTGTG 
CCCGGGGAGC 
GGGTGGATCG 
AGGGACCTGA 
AAACGCACTT 
CTGCCTTGGT 
TGGCCTCTAG 
CTTTTGACAT 
GGGAGCAACT 
TGGTAACACT 
CTCGGGACAG 
TCCTGCTCCC 
GATACACCAA 
TCCATGCTTA 
TATTCTCTCC 
GGCTTGGCTT 
AGTTTAGGGA 
T3GGGTGCTT 
GAATACAACC 
CATTCTGCAT 
GCAGCACCAG 
GGTCCAAGTC 
AAACACACTA 



Seq ID NO: 639 Protein sequence 
Protein Accession ftt NPJJ36393.1 

1 11 21 31 41 51 

I I I I I I 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 
EFAAKPIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 
FVKESHNMSK. GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 
ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEKKCPVDER EQLEETLPLI 
LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 

Seq ID NO: 640 DNA sequence 

Nucleic Acid Accession ft« NM_002993.1 

Coding sequence: 64.. 408 



GGCACGAGCC 
ACTATGAGCC 
GCGCTGCTCG 
GTCTCTGCTG 
CCCAAAACGA 
GTGGTAGCCT 
AAGAAAGTCA 
ACCATGCATC 
CAGTAAGAAT 
GAAGAGTGTG 
CTAATATAGT 
CAATTGACCA 
TGAAGATAAC 
ATTTCGTATG 
ACTCACTCTT 



11 

I 

AGTCTCCGCQ 
TCCCGTCCAQ 
CGCTGCTGCT 
TGCTGACAGA 
TTGGTAAACT 
CCCTGAAGAA 
TCCAGAAAAT 
ATAAAATTGC 
AAGAAGGAAG 
GGGGAAAGCC 
ATTTCCACTA 
TATTGTGAGC 
TATTGTATTT 
GAAATAATGT 
CTCATAAAAT 



21 
I 

CCTCCACCCA 
CCGCGCGGCC 
CCTGCTGACG 
GCTGCGTTGC 
GCAGGTGTTC 
CGGGAAGCAA 
TTTGGACAGT 
CCAGTCTTCA 
GGTTGGTTTT 
TACGCTTCTC 
TTTACTGTTA 
AAAGAATCAC 
CTATCATACA 
TTTATTAGTG 
AGGAAATATT 



31 
I 

GCTCAGGAAC 
CGTGTCCCGG 
CCGCCGGGGC 
ACTTGTTTAC 
CCCGCAGGCC 
GTTTGTCTGG 
GGAAACAAGA 
GCGGAGCAGT 
TTTCCATTTT 
CCTGAAGTTT 
TTTTACCTGA 
TGGTTATTAG 
TTCCTTAAAG 
TGCTGTTGAG 
TTAGTTCTGT 



41 
I 

CCGCGAACCC 
GTCCTTCGGG 
CCCTCGCCAG 
GCGTTACGCT 
CGCAGTGCTC 
ACCCGGAAGC 
AAAACTGAGT 
TTTCTGGAGA 
CTACATGGAT 
ACAGCTCAGC 
TAAGTTATTG 
TCTTTCAATG 
TCTTACCGAA 
GGAGGTATCC 
TTTCTTGGGG 



60 
120 
180 
240 
300 



60 
120 
180 
'240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 



51 
I 

TCTCTTGACC 
CTCCTTGTGC 
CGCTGGTCCT 
GAGAGTAAAC 
CAAGGTGGAA 
CCCTTTTCTA 
AACAAAAAAG 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTGAAT 
AAGGCTGTGG 
TGTTGTTCTT 
AATATGTTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
640 
900 



431 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA 
TTATCTGTGC AGAATATATT TCCTTATTCA QAATTTCTAA 
CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC 
CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT 
ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG 
TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT 
AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA 
CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT 
TTTAAAGGTT TTGACCATTT TOTTATGAGG AATTATACAT 
AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA 

Seq ID NO: 641 Protein sequence 
Protein Accession ft: NP_002984.1 



PCT/US02/12476 



ACACTGGGTG 
AAATTTAAGT 
TTCTTAGTAT 
TTCACTATAG 
CCCAAAAACA 
TAAATAAAAG 
CAATTTGAAT 
CAGATATCTA 
GTATCACATT 
TTGGTACTTG 



TGTCATACCG 
TCTGTAAGGG 
GGCATAATGT 
GATGACTATA 
GATAAATTCC 
CAAAATTAAC 
ATAAATTCAT 
TTGTGGATCT 
CACTATATTA 
TATTGTCATT 



51 



1 11 21 31 41 

1 I I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASLKNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ZD NO: 642 DNA sequence 

Nucleic Acid Accession ft: NMJ>13271.1 

Coding sequence: 27*. 809 ' 



TCCGGAGCCA 
CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCCGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 



11. 
1 

GGCTCGCTGG 
CGGCCTTTTG 
GCCGGTAAAG 
TCCTOGCCGC 
GGCGCGGGCG 
GCAGGAGGCT 
CCCCCGCAAC 
GCTCGCTCGC 
CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCCC 



21 
I 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGOGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCC 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCG 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



31 
I 

CGGGGTCGCC 
TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GOGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



Seq ID NO: 643 Protein sequence 
Protein Accession ft: NP_037403.1 



51 



CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 

TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACOCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 
AAGGAGGTTT 
AAGAGGCTAA 



11 

I 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 

TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TOGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 
TGACGCCATG 
AAGATTGCTG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGGCCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 
CTTCAGGCAG 
CTGGTGATGA 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGATCTC 
CTGTCTGTGA 
CAGATCAGAC 



41 

I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
COGCOGTGCC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCGAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 
AAGTCATATC 
GTCTCATCTC 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



1 11 21 31 41 

I I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQBLARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRFPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALI* 
RVKRLETPAP QVPARRLLPP 

Seq ID NO: 644 DNA sequence 
Nucleic Acid Accession #: NM_002214 
Coding sequence: 681.. 2990 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 
AATGCAGCAT 
GAGGATTTCA 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 
GGATGGCGAA 
GCTCTTGATA 



60 



41 


51 




1 

GCTGCTCTGG 


1 

GGGCCGOGGG 


60 


GTTTCGGCCG 


CCCCCCGCGC 


120 


AGOGTCTCCG 


CCCTTGGCTG 


160 


AGGTGAGGCG 


GCGGGGGCGG 


240 


GGAACGTCAG 


GAGCGGGCGC 


300 


CCTGGCGCAG 


CTGCTGCGCG 


360 


GGACGACGAC 


CCCGACGCGC 


420 


TGACCCTGCC 


GCCCTAGCAG 


480 


ACCCCGGCCC 


CCGGTCTACG 


540 


CGAGACACCC 


GACGTGGACC 


600 


AAGCGCGGAC 


TCCGAGGGGG 


660 


TGTGGGCTCT 


GAGCTGCCCC 


720 


AGAGACCCCG 


GCGCCCCAGG 


780 


ATCCCGTGCA 


CCCTGGGACC 


840 


GCCAGCAGGT 


CCAGAGCAAC 
ACAATAACAT 


900 
960 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



432 



WO 02/086443 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAQ AAATTAATAG 1660 

ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA -1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGG CCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC . CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA ' TTAGACAGGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2860 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCAGGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT 'GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 

Seq ID NO: 645 Protein sequence 
Protein Accession ft: NP w 002205 

1 11 21 31 41 51 

I ( I i i I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 

SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKA 240 

VHRQKISGNI DTPEGGPDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 

VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KXIDNNINVI PAVQGKQFHW YKDLLPLLPG 360 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAII KPIGFKBTAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETPLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD PSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 

CSGRGTCVCG RCECTDPRSI GRPCEHCPTC YTACXENWNC MQCLHPHNLS QAILDQCKTS 660 

CALMEQQHYV DQTSECPSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

Seq ID NO: 646 DNA sequence 

Nucleic Acid Accession 8: NM_003318.1 

Coding sequence: 1..2574 

1 11 21 31 41 51 

t I I I I I 

ATGGAATCCG AGGATTTAAG TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTG 60 

AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACTG ATGAACTAAG CTTGAATAAA 120 

ATTTCTGCTG ATACTACAGA TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

AACCCAGAGG ACTGGTTGAG TTTGTTGCTC AAACTAGAGA AAAACAGTGT TCCGCTAAGT 240 

GATGCTCTTT TAAATAAATT GATTGGTCGT TACAGTCAAG CAATTGAAGC GCTTCCCCCA 300 

GATAAATATG GCCAAAATGA GAGTTTTGCT AGAATTCAAG TGAGATTTGC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 480 

AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGA6 CAGTACCACT AGAAATGCTG 540 

GAAATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 

AAGAATTTAT CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTCCGG TTCACTTGGG 660 

CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 780 

CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCTTCTAAAT 840 

AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT GAAAAGACAA 900 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 

TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC ATTTCAAGGA ACCTCTGGTG 1020 

TCAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 

GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 1320 

ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 



433 



WO 02/086443 

GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAO 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTOT TTCCAQCAGC AACAGCATCA AATACTTGCC 1500 

ACTCCACTTC AAAATTTACA GCTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

ATTTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

Seq ID NO: 647 Protein sequence 
Protein Accession g: NP_003309.1 

1 11 21 31 41 51 

I I I I I 1 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDMSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKVGQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAFVHISFA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 

EIALRNLNLQ KKQLLSEEEK KNLSASTVLT AQESFSGSLG HLQNRNNSCD SRGOTTKARF 240 

LYGENMPPQD AEIGYRMSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE 'TKEYQEPEVP ESNQKQWQSK RKSECTNQNP AASSNHWQIP ELARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 480 

LSTPYGQPAC FQQQQHQIIA TPLONLQVLA SSSANBCISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFIiIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEAIKDMSS SRENGKSKSK ISPKSDVWSL 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CLKRDPKQRI 780 

SIPELLAHPY VQIGTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 

Seq ID HO: 648 DNA sequence 
Nucleic Acid Accession #: NM_015507 
Coding sequence: 241.. 1902 

1 11 21 31 41 51 

I I I I I I 

COGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCOCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GOGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGOGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGOGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AQATGCTTTC CAGGATACAC CGGGAAAACC T8CAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTQTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTGGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACOGGCTG GCCGGAGACA AW3TCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT I860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTCGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 



434 



WO 02/086443 

TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



Seq ID NO: 649 Protein sequence 
Protein Accession #: NP_056322 

1 11 21 31 41 51 

I I I I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 

CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYXCFC 120 

LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PEPTRTPTPK VNLGPFNYEE IVSRGGNSHG GKKGNEBKMK BGLEDBKREB KALKNDIEER 360 

SLRGDVPFPK VNBAGEPGLI LVQRKALTSK LEHKDLNISV DCSFNKGICD WKQDREDDFD 420 

WNPADRDNAI GPYMAVPALA GHKKD1GRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 

FVKNSNNALA WEKTTSBDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

Seq ID NO: 650 DNA sequence 

Nucleic Acid Accession #: NM_003506.1 

Coding sequence: 259. .2379 

1 11 21 31 41 51 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAO ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 4 B0 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTOTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 

CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTOGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AAOGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAO AAATCCAAAC CTCACCAGAA 2040 

ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGOGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 

TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3060 

GGTGCTTACT CAAAGAGTGT CCACTATTGA TTGTATTATG CTGCTCACTG ATCCTTCTGC 3120 

ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 

AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 3300 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: 651 Protein sequence 
Protein Accession #: NP_o 034 97.1 

1 11 21 31 41 51 

I I . I I I I 
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WO 02/086443 

MEMFTFLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT 
PLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE 
LECDRLQYCD ETVPVTFDPH TEPLGPQKKT EQVQRDIGPW 
CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK 
VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV 
LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI 

SGLYLVPLVT llgcyvyeqv nritweitwv sdhcrqyhip 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRM RKRDPISESR 
KKHYKPSSHK MCVISKSMGT STGATANHGT SAVAITSHDY 
KADGASTPRL REQDCGEFAS PAASISRLSG EQVDGKGQAG 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG 

Seq ID NO i 652 DNA sequence 

Nucleic Acid Accession #: NMJJ14791.1 

Ceding sequence: 271.. 2126 



PCT/US02/12476 



FFPNLMGHYD 
KVYSDCKKLI 
CPRHLKTSGG 
LIDVRRFRYP 
ACTVLFMLLY 
KLLAIiNKVEG 



CPYQAKAKAR 
RVLQESCEPF 
LGQETLTEIQ 



QSIAAVEMEH 
DTFGIRWPBE 
QGYKFLGIDQ 
ERPIIYYSVC 
PFTMAGTVWW 
DMISGVCPVG 
KKFMIRIGVF 
PELALFMIKY 
LKHNSKVXHK 
TSPETSMREV 
RISPKSDITD 



GCHSDT 



TTGGCGGGCG 
GCCGCGTCTC 
TCOGCCCCTC 
ATGATGAACT 
AGGTCAAACT 
AAAACACACT 
TGAGACATCA 
TGGTTCTTGA 
TGTCAGAAGA 
ACAGCCAGGG 
ATAAATTAAA 
ATCTACAGAC 
CATATCTTGG 
GTGGATTTCT 
GAAAATATGA 
TGCAGGTGGA 
TGCAAGATTA 
ATGATTGCGT 
TAATTTCACT 
AGGCTCGGGG 
CTACCCCATT 
ATAAAAATTA 
GTGCTGCTAC 
AATCTAAATC 
AAAATGTATA 
CAAAGACTCC 
ACACTACACC 
TAAATTCAAC 
GCTCAGTGGA 
CCAAAGTGTT 
GCAAAAGGAA 
CTACAACTAG 
AGAAGCATGT 
TTGGGAAAGT 
TGGGTATCAG 
ACATCCTATC 
GGTGTGATAC 
CTACCAACTT 
GATATTATTT 
TAATCATGTG 
AATGTAAGCT 
TTGTGAATAT 



11 

I 

GAAGCGGCCA 
TCAGGACAGC 
AGGTTCTTTT 
TCTCAAATAT 
TGCCTGCCAT 
AGGGAGTGAT 
GCATATATGT 
GTACTGCCCT 
GGAGACCCGG 
CTATGCTCAC 
GCTGATTGAC 
ATGCTGTGGG 
ATCAGAGGCA 
ACCATTTGAT 
TGTTCCCAAG 
CCCAAAGAAA 
CAACTATCCT 
AACAGAACTT 
GTGGCAGTAT 
AAAACCAGTT 
CACAGACATC 
TGTGGCGGGA 
TCCCCGAACA 
ATTAACTCCA 
TACTCCTAAG 
AGTTAATAAG 
CTCAAAAGCT 
AGGAACAGAC 
ATTGGATCTC 
TGGGAGCCTT 
GGGTTCTGCC 
ATTAGTGAAT 
TGACTTTGTA 
GACAATGCAA 
GAGGCAGCGG 
TAGCTGCAAG 
AGCCTACATA 
GTTTCTAAAG 
TGTGTATGAA 
GTTTTGTATA 
CTTAACTATG 



21 
I 

CAACCCGGCG 
AGGCCCCTGT 
TCTAATTCCA 
TATGAATTAC 
ATCCTTACTG 
TTGCCCCGGA 
CAACTCTACC 
GGAGGAGAGC 
GTTGTCTTCC 
AGGGACCTCA 
TTTGGTCTCT 
AGTCTGGCTT 
GATGTTTGGA 
GATGATAATG 
TGGCTCTCTC 
CGGATTTCTA 
GTTGAGTGGC 
TCTGTACATC 
GATCACCTCA 
CGTTTAAGGC 
AAGTCAAATA 
TTAATAGACT 
TCACAGTTTA 
GCCTTATGCA 
TCTGCTGTAA 
AACCAGCATA 
AGAAACCAGT 
AAGTTAATGA 
AACCAAGCAC 
GAAAGGGGGT 
AGAGACGGGC 
CCAGATCAAC 
CAAAAGGGTT 
TTTGAATTAG 
CTTAAGGGCG 
GTATAATTGA 
AAGACTGTTA 
AGCTATCTTA 
TCTAAATCAA 
TTAATAATTG 
TCTCTTTGTA 



31 
I 

ATCGAAAAGA 
CCTTCTGTCG 
AATAAACTTG 
ATGAAACTAT 
GAGAGATGGT 
TCAAAACGGA 
ATGTGCTAGA 
TGTTTGACTA 
GTCAGATAGT 
AGCCAGAAAA 
GTGCAAAACC 
ATGCAGCACC 
GCATGGGCAT 
TAATGGCTTT 
CCAGTAGCAT 
TGAAAAATCT 
AAAGCAAGAA 
ACAGAAACAA 
OGGCTACCTA 
11 ' TCfT C ITT 
ATTGGAGTCT 
ATGATTGGTG 
CCAAGTACTG 
GAACACCTGC 
AGAATGAAGA 
AGAGAGAAAT 
GCCTGAAAGA 
CAGGTGTCAT 
ATATGGAGGA 
TGGATAAGGT 
CCAGAAGACT 
TGTTGAATGA 
ATACACTGAA 
AAGTGTGCCA 
ATGCCTGGGT 
TGGATTCTTC 
TGATCGCTTT 
AGACCAATAT 
GCCCATCTGT 
TTGACTTTCT 
ATGTGTAATT 



CAAGGGTAAC 
TGAGTTAATA 
ACTGTTATAT 
ATACAAGAAG 
TCTGCTTCTT 
ATTGAACCAT 
TCCTTTTATT 
CAGGCAAACA 
TCTTCTGCTT 
CTCCTGTGGA 
GGAAGATGTG 
TGAAGATGAT 
GACAGAATCA 
AAATAAATTA 
GTACTTTATG 
ACTCACTACG 
AACTCCAATT 
TAGCCCTGAG 
GACTCCAAAA 
TATCACTGTG 
AAAGCTTCAC 
AATAATGTCT 
GTGTCAAACA 
GCTTCAAAAA 
TTACAAAAGA 
CATCCTGCCG 
GATTTTAAAG 
CTCTTTGTTT 
CATTATGTTA 
TAGATTCACT 
TCTTTCTGAA 



51 
I 

CGCCGTACCA 
AGCCGTGCCC 
ATGAAAGATT 
GGCTTTGCAA 
ATCATGGATA 
TTGAAGAACC 
AAAATATTCA 
CAGGATCGCC 
GCTTATGTGC 
GATGAATATC 
AAGGATTACC 
CAAGGCAAAT 
GTTCTTATGT 
ATTATGAGAG 
CAACAAATGC 
CCCTGGATCA 
CACCTCGATG 
ATGGAGGATT 
CTAGCCAAGA 
CAAGCCAGTG 
ACCGCAAGTG 
TTATCAACAG 
AATGGGGTGG 
AAGAACAAAG 
TTTCCTGAGC 
CCAAATCGTT 
AAAATACCAG 
AGGCGGTGCC 
AGAAAGGGAG 
CTCACCAGGA 
TATAATGTGA 
ATTCTTCCAA 
CAGTCAGATT 
CCCGATGTGG 
TTAGTGGAAG 
GATGAGTGTG 
TTCATTGGAA 
TTAAACAAAA 
CTGTCTTTTT 
TCCATATGTG 
ATAAAACCAT 



Seq ID HO: 653 Protein sequence 
Protein Accession #: NP 055606.1 



1 
I 

MKDYDELLKY 
LKNLRBQHIC 
AYVHSQGYAH 
QGKSYLGSEA 
QQMLQVDPKK 
MEDLISLWQY 
TASDKNYVAG 
KNKENVYTPK 
KIPVNSTGTD 
LTRSKRKGSA 
QSDFGKVTMQ 



11 

I 

YELHETIGTG 
QLYHVLETAN 
RDLKPENLLF 
DVWSMGILLY 
RISMKNLLNH 
DHLTATYLLL 
LIDYDWCEDD 
SAVKNEEYFM 
KLMTGVISPE 
RDGPRRLKLH 
FEL.EVOQLQK 



21 
I 

GFAKVKLACH 
KIFKVLEYCP 
DEYHKLKLID 
VLMCGFLPFD 
PWIMQDYNYP 
LAKKARGKPV 
LSTGAATPRT 
FPEPKTPVNK 
RRCRSVELDL 
YNVTTTRLVN 
PDWGIRRQR 



31 
I 

ILTGEMVAIK 
GGELPDYIIS 
FGLCAKPKGN 
DDNVMALYKK 
VEWQSKNPFI 
RLRLSSFSCG 
SQFTKYWTES 
NQHKREILTT 
NQAHMEETPK 
PDQLLNEIMS 
LKGDAWVYKR 



41 

1 

IMDKNTLGSD 
QDRLSEEETR 
KDYHLQTCCG 
IMRGKYDVPK 
HLDDDCVTEL 
QASATPPTDI 
NGVESKSLTP 
PNRYTTPSKA 
RKGAKVFGSL 
ILPKKHVDFV 
LVEDILSSCK 



51 
I 

LPRIKTEIEA 
WFRQIVSAV 
SLAYAAPELI 
WLSPSSILLL 
SVHERNNRQT 
KSNNWSLEDV 
ALCRTPAKKL 
RNOCLKETPI 
ERGLDKVITV 
QKGYTLKCQT 
V 



Seq ID NO : 654 DNA sequence 
Nucleic Acid Accession #: NM_0005B2 
Coding sequence: 88.. 990 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540* 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600. 



1 11 21 31 41 51 

I t I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 
AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 



60 
120 
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WO 02/086443 

GGCATCACCT GTGCCATACC 
CTTTACAACA AATACCCAGA 
CA6AATCTCC TAGCCCCACA 
GATGATATGG ATGATGAAGA 
AACGACTCTG ATGATGTAGA 
TCTGATGAAT CTGATGAACT 
TTCACTCCAQ TTGTCCCCAC 
GGACTGAGGT CAAAATCTAA 
GACGAGGACA TCACCTCACA 
CCCGTIGCCC AGGACCTGAA 
GAAACGAGTC AGCTGGATGA 
TATAAGCGGA AAGCCAATGA 
CTTTCCAAAG TCAGCCGTGA 
GTTGTAGACC CCAAAAGTAA 
TTAGATAGTG CATCTTCTGA 
ATTTAGTCAA AAGAAAAAAT 
CTCAGTTTAT TGGTTGAATG 
ATTAGTTTAG TTTGTGGCTT 
CTATGTTCAT TCTATAGAAG 
TCATGAATAG AAATTTATGT 
ATAACATTTT ATGTCACTAT 
TATCTTTTTG TGGTGTGAAT 
AATTGCTTAT TTGTTTTCCC 
GCCTAAAAAA AAAAAAAAAA 



AGTTAAACAG 
TGCTGTGGCC 
GACCCTTCCA 
TGATGATGAC 
TGACACTGAT 
GGTCACTGAT 
AGTAGACACA 
GAAGTTTCGC 
CATGGAAAGC 
OGCGCCTTCT 
CCAGAGTGCT 
TGAGAGCAAT 
ATTCCACAGC 
GGAAGAAGAT 
GGTCAATTAA 
GCTTTATAGC 
TGTATCTATT 
CATGGAAACT 
AAATGCAAAC 
AGAAGCAAAC 
AATCTTTTGT 
AAATCTTTTA 
ACGGTTGTCC 



GCTGATTCTG 
ACATGGCTAA 
AGTAAGTCCA 
CATGTGGACA 
GATTCTCACC 
TTTCCCACGG 
TATGATGGCC 
AGACCTGACA 
GAGGAGTTGA 
GATTGGGACA 
GAAACCCACA 
GAGCATTCCG 
CATGAATTTC 
AAACACCTGA 
AAGGAGAAAA 
AAAATGAAAG 
TGAGTCTGGA 
CCCTGTAAAC 
TATCACTGTA 
AAAATACTTT 
TTTTTAAGTT 
TCTTGAATGT 
AGCAATTAAT 



GAAGTTCTGA 
ACCCTGACCC 
ACGAAAGCCA 
GCCAGGACTC 
AGTCTGATGA 
ACCTGCCAGC 
GAGGTGATAG 
TCCAGTACCC 
ATGGTGCATA 
GCCGTGGGAA 
GCCACAAGCA 
ATGTGATTGA 
ACAGCCATGA 
AATTTCGTAT 
AATACAATTT 
AGAACATGAA 
AATAACTAAT 
TAAAAGCTTC 
TTTTAATATT 
TACCCACTTA 
AGTGTATATT 
AATAAGAATT 
AAAACATAAC 



GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG 
CATTGACTCG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 



PCT/US02/12476 



Seq ID NO: 655 Protein sequence 
Protein Accession #: NP_000573 

1 11 



51 



21 31 41 

I ! I ! II 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 
DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKP RRPDIQYPDA TDEDITSHME 
SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 
NEHSDVIDSQ ELSKVSREFH SHEFRSHEDM LWDPKSKEE DKHLKPRISH ELDSASSEVN 

Seq ID NOt 656 DNA sequence 

Nucleic Acid Accession 8: NM_003 108.1 

Coding sequence: 76.. 1401 



1 

I 

GGGGTGGGAG 
GCCCTGCAAC 
CGGGAGGCGC 
GAGAGCGACC 
TTCATGGTAT 
AACGCCGAGA 
ATCCCGTTCA 
TACAAGTACC 
CAGAGCCCAG 
GGTGCCAAGA 
GCGGGCGCCA 
GACGACTACG 
ACGGTCAAGT 
CAGCTGCAGA 
CTGCAGCCGC 
CCCGCCAGCC 
GAGGTGCGGG 
AAGAACATCA 
TCGCGCTCGG 
GAGGACGCCG 
GCCAGCGAGC 
GATAAGGATT 
TACTGCACGC 
GACCTGGTGT 
AGCTGGGTTC 
ATGATGGTGG 
ATATTGATAA 
TTAAAGTGAA 
TCCTTTATCG 
AAAATGTGTT 
GAGGGGGCGG 
GTCGGTCTTT 
TCTAGGGAGT 
TTTTTAACAA 



11 

I 

GGGGAGGGGG 
GGATCATGGT 
TGGACACGGA 
CAGACTGGTG 
GGTCCAAGAT 
TCTCCAAGAG 
TCCGGGAGGC 
GGCCCCGGAA 
AGAAGAGCGC 
CCTCCAAGGG 
AGGCGGGCGC 
TGCTGGGCAG 
GCGTGTTTCT 
TCAAACAGGA 
CGGGGCAGCA 
CTACGCTGAG 
COGGCGCGAC 
CCAAGCAGCA 
TGTCCACCTC 
ACGACCTGAT 
AGCAGCTGGG 
TGGATTCGTT 
CGGAGCTGAG 
TCACATATTG 
CTTGGGAGGA 
TGTTGATGGT 
GATGTCGTGA 
ATGAGTAGTT 
TGTCTCAAGG 
TTTGTAATTA 
CGCGGCGGAG 
GAAGTCTGGA 
TGGTGGAGAT 
AAAAAGGG 



CAGCGAGGGC 
CGAGATGATC 
AAAGGCGCCC 
AGTTGTAGTG 
GGCGGTGGTA 
CGCAAAGAAA 
TTTAAACATT 
TAGTTGCATA 
CTATTTCTTT 
GGGAGGTAGG 
AGACGTCTGC 
ATTTTTTTTT 



31 
I 

GAGACCCAGC 
GAGAGCTTGG 
TTCATGGCTT 
TCGGGCCACA 
AAGATCATGG 
CGCTGGAAAA 
CGGCTCAAGC 
ATGGACCCCT 
GGCGGCGGGA 
AAATGCGGCA 
GCCCAGTCCG 
AGGGGCTCGG 
GACGACGACG 
GAGGACGAGG 
CTGCTGAGAC 
GAGTCCCCCG 
GGGGGCGGCA 
CTCGCGCAGC 
AGCAGCGGCA 
AGCTTGAATT 
GCGGCCGGGA 
AGCCTGGGCT 
GCGGGGGACT 
GCTGCTCGCT 
GTGATGATGA 
GGGTGGAGGG 
TTGGAAAACA 
TTTCCTGTCC 
CCTAGTCTGG 
TTCCTGAAAT 
ACCCGCTCCG 
AGAGGACCCT 
CTTAAGAGAA 



41 
I 

GGCCCGGGTT 
AAGCGGAGAG 
GCAGCCCGGT 
TCAAGCGGCC 
AGCAGTCTCC 
TGCTGAAGGA 
ACATGGCCGA 
CGGCCAAGCC 
GCGCGGGCGG 
AGCTCAAGGC 
GGGACTACGG 
GCGGCGGCGG 
ACGACGACGA 
AACCACCGCA 
GCTACAACGT 
AGGGAGCGAG 
GCCGCCTCTA 
CCGCGCTGTC 
GCAGCAGCGG 
TCTCTCAAAG 
ACCTGTCCCT 
CCCACTTCGA 
GGCTGGAGGC 
CTTTCTCTCG 
TGATGATGAT 
GAGAGAAGAA 
TGATGAAAAT 
TTTTTTTGTC 
AGTTGTGATT 
TCGTGATTGC 
GAAGGCGCTG 
TTTGGCAGCA 
CTTAAAGAAC 



Seq ID NO: 657 Protein sequence 
Protein Accession 8: NP 003099.1 



1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



51 
I 

GGAGCGTCCA 
CAACCTGCCC 
GGCCCTGGAC 
GATGAACGCG 
GGACATGCAC 
CAGCGAGAAG 
CTACCCCGAC 
CAGCGCCAGC 
AGGCGCGGGC 
CCCCGCGGCC 
GGGCGGGGGC 
CGCGGGCAAG 
CGACGAGCTG 
CCAGCAGCTC 
CGCCAAAGTG 
CCTCTACGAC 
C7ACAGCTTC 
GCCCGCGTCC 
CAGCAGCGGC 
CGCGCACAGC 
GTCGCTGGTG 
GTTCCCOGAC 
GAACTTCTCC 
GAGGGTGCAG 
AATGATGATG 
GATGCTGATG 
TTTGGTGGAG 
CCCCCTCCCT 
ATTTTCCCAA 
AACAAAGGCA 
TTTGAAGCTT 
CAACTGTTAC 
TGGTGATTTT 



60 
120 
160 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620. 
1680 
1740 
1800 
1860 
1920 
1980 



1 11 21 31 41 51 

I I I I I I 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGHIK RPMNAFMVWS 
KIERRKIMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 
RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KGSSKKCGKL KAPAAAGAKA 
GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTVKCV FLDEDDDDDD DDDELQLQIK 



60 
120 
180 
240 



437 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

QEPDEEDEBP PHQQIAQPPG QQPSQLLRRY NVAKVPASPT 
ATSGAGGGSR LYYSPKNITK QHPPPIAQPA LSPASSRSVS 
LMFDLSLNFS QSAHSASEQQ LGGGAAAGNL SLSLVDKDLD 
LSEMIAGDWL EAMPSDLVFT V 

Seq ID NOt 65B dna sequence 
Nucleic Acid Accession Hi NM_001719 
Coding sequencet 123.. 1418 



PCT/US02/12476 



GGGCGCAGCG 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCT6 
GGCACAACCT 
AGTTGGOGGG 
TCTTCAAGGC 
GCCAGAACCG 
AGAACAGCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



11 

I. 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGQ 
ACGCTTCGAC 
CAGGGAATCG 
QCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 
I 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGTCA 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTCTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGATCATCG 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTCG 
GCCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTTGATCAQ 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 

I 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
TGGOGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAQCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCCCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



Seq ID NO: 659 Protein sequence 
Protein Accession #: NP 001710 



MHVRSLRAAA 
ILGLPKRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLOLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RMMWRACGC 



11 

! 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATMHAIVQ 
H 



LPLLRSAliAD 
MFMLDLYNAM 



HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



31 
I 

FSLDNEVHSS 
AVEEGGGPGG 
YHHREFRFDL 
LDSRTLWASE 
QKKQPFMVAP 
KKHELYVSFR 
VPKPCCAPTQ 



41 

I 

GGGACGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
QGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GOGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCtGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAAOGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



41 
I 

FIHRRLRSQE 
QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



Seq ID NO; 660 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 211.. 1895 



GGATCTGAGG 
GGGGCTTGGG 
GAGGAATTAT 
TGATTTTTTT 
GTGCTTTTTC 
CACAGGTTCC 
CTTGTGCTGA 
GAAGGTAATT 
AAAATATCGG 
TTCCGACACT 
GCCAATTATT 
TTCTTTGAAC 
GCTGTGGCTA 
CACATGCACT 
GTAGTCCATG 
CAAAATTCCA 
GTTGTGATGT 
TACCTGCATA 
ATCTTGATAG 
ACTCTGGCTG 
GCACCGATCT 
CTAGCTACCA 
AAACTGGCCA 



11 
I 

GGCGCCCAGT 
AGGCAGCCTG 
CTGATAAAAT 
CCCTCGAAAA 
TTTTCTCTTC 
TTGAACAGCT 
AAGCGAAAGT 
GTTTCCCTGA 
CTGTTCCATG 
GTAACCCCAA 
CAGACTGCCT 
GCCTCTATGT 
TTCTCATCAT 
TATTTGTGTC 
CTCACATAGG 
TTGAGGCAAC 
TTATTTACTT 
ATCTCATCTT 
GCTGGGGGTT 
ATGCGAGGTG 
TAGCAGCTAT 
AAATCTGGGA 
AATCGACACT 



21 

I 

CACTTCCTCC 

ctctccagtc 
tcctgggtta 
tgaccttttt 

TTTTTCTACG 
GGATTCTGAT 
ACAATGTGAA 
ATGGGATGGA 
CCCTCCTTAT 
TGGAACATGG 
TCGCTTTCTG 
AATGTATACC 
TOGTTACTTC 
TTTCATGCTG 
AGTAAAGGAG 
TTCTGTGGAC 
CCTGGCTACA 
TGTGGCTTTC 
TCCAGCAGCA 
CTGGGAACTT 
TGGGCTGAAT 
GACCAATGCA 
GGTCCTGGTC 



31 
I 

ACGTTCTCGT 
CCTATCCACC 
ATATTTTTAA 
ATGCTTCGAA 
ATAAATGAAA 
GGCACCATTA 
CTCAACATCA 
CTCATTTGTT 
ATTTATGACT 
GATTTTATGC 
CAGCCAGATA 
GTTGGCTACT 
AGACGATTGC 
AGAGCTACAA 
CTGGAGTCCC 
AAATCACAAT 
AATTATTATT 
TTTTCGGACA 
TTTGTTGCAG 
AGTGCTGGAG 
TTTATTCTGT 
GTTGGGCATG 
CTAGTCTTTG 



41 
I 

GCTGGGCGGG 
CACAGGTTTT 
AAACGGAGAG 
GCAGTTTGTC 
GCATTTCTTC 
CTATAGAGGA 
CAGCTCAACT 
GGCCCAGAGG 
TCAACCATAA 
ACAGCTTAAA 
TCAGCATAGG 
CCATCTCTTT 
ATTGCACTAG 
GCATCTTTGT 
TAATAATGCA 
ATATCGGGTG 
GGATCCTGGT 
CCAAATACCT 
CATGGGCTGT 
ACATCAAGTG 
TTCTGAATAC 
ACACAAGGAA 
GAGTGCATTA 



ASLYDEVRAG 
SGSSGEDADD 
FEFPDYCTPE 



51 
I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGQCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA 
GGCAAGGGGT 
ATAAATGTCA 



51 
I 

RREMQREILS 
FSTQGPPLAS 
AAEFRIYKDY 
TSNHWWNPR 
IRSTGSKQRS 
PEGYAAYYCE 
DSSNVILKKY 



51 
I 

AGGAGCGGAT 
TTGGGTOGGA 
TTTTTAAAAA 
AACCAGCATA 
AAGAAAAAGG 
GCAGATTGTC 
CCAGGAGGGA 
AACAGTGGGG 
AGGAGTTGCT 
TAAAACATGG 
AAAGCAAGAA 
TGGTTCCTTG 
GAACTATATC 
CAAAGACAGA 
GGATGACCCA 
CAAGATTGCT 
GGAAGGTCTC 
GTGGGGCTTC 
GGCACGAGCA 
GATTTATCAA 
GGTTAGAGTT 
GCAATACAGG 
CATCGTGTTC 



300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



438 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA 
TTCTTCAACT CCTTTCAG6G TTTCTTTGTG TCTATCATCT 
GTTCAGGCAG AGGTGAAGAA GATGTGGAGT CGGTGGAATC 
ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA 
AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC 
AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC 
TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CACGAGGAGA 
CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 661 Protein sequence 
Protein Accession ih Eos sequence 



PCT/US02/12476 



i 
I 

MLRSSLSTSI 
LNITAQLQEG 
DPMHSLNKTW 
RRLHCTRNYI 
KSQYIGCKIA 
FVAAWAVARA 
VGHDTRKQYR 
SIIYCYCNGE 
RMVLISGKAA 
SRPMESNPDT 



H 
I 

VLFLFSSFST 
EGNCFPEWDG 
ANYSDCLRFL 
HMHLFVSFML 
WMFIYPLAT 
TLADARCVJEL 
KLAKSTLVLV 
VQAEVKKMWS 
KIASRQPDSH 
EGOQGETEDV 



21 
I 

INES1SSRKR 
LICWPRGTVG 
QPDISIGKQE 
RATSIFVKDR 
NYYWILVEGL 
SAGDIKWIYQ 
LVFGVHYIVF 
RWNLSVDWKR 
ITLPGYVWSN 
L 



31 

! 

HRFLEQLDSD 
KISAVPCPPY 
FFERLYVMYT 
WHAHIGVKE 
YLHNLIFVAF 
APILAAIGLN 
VCLPHSFTGL 
TPPCGSRRCG 
SEQDCLPHSF 



TCCGCATGCA 


CTGTGAGCTC 


1440 


ACTGCTACTG 


CAATGGAGAG 


1500 


TCTCCGTGGA 


CTGGAAAAGG 


1560 


CCACCGTGAC 


GCACAGCACC 


1620 


TTATCTCTGG 


CAAAGCTGCC 


1680 


CTGGCTATGT 


CTGGAGTAAC 


1740 


CCAAGGAAGA 


TAGTGGGAGG 


1800 


TGGAATCTAA 


CCCAGACACT 


1860 


41 
■ 


51 
i 




1 

GTITIEEQIV 


1 

LVLKAKVQCE 


60 


1 X Dr NilKJGVA 


FRHCNPNGTW 


120 


VGYSISFGSL 


AVAILIIGYF 


180 


LESLIMQDDP 


QNSIEATSVD 


240 


FSDTKYLWGF 


ILIGWGFPAA 


300 


FILFLNTVRV 


LATKIWETNA 


360 


GWEIRMHCEL 


FPNSFQGFFV 


420 


SVLTTVTHST 


SSQSQVAAST 


480 


HEETKEDSGR 


QGDDILKEKP 


540 



Seq IO NO i 662 DNA sequence 
Nucleic Acid Accession 8: NM_005048 . 
Coding sequence: 143.. 17 95 



GGCCGGTGGC 
TGGGCCAGCC 
TCTTCCTACA 
GCTAATGCTC 
TATAGAGGAG 
AGCTCAACTC 
GCCCAGAGGA 
CAACCATAAA 
CAGCTTAAAT 
CAGCATAGGA 
CATCTCTTTT 
TTGCACTAGG 
CATCTTTGTC 
AATAATGCAG 
TATCGGGTGC 
GATCCTGGTG 
CAAATACCTG 
ATGGGCTGTG 
CATCAAGTGG 
TCTGAATACG 
CACAAGGAAG 
AGTGCATTAC 
CCGCATGCAC 
CTGCTACTGC 
CTCCGTGGAC 
CACCGTGACG 
TATCTCTGGC 
TGGCTATGTC 
CAAGGAAGAT 
GGAATCTAAC 
CATTTGTGGC 
ATACTCCTAT 
TTTTAGGCTC 
GGAGTAGTTT 
GCTCTGTGAT 
GCTGTAGCTT 
ATTTTCCTTT 
ATTTATTtfTG 
GATCTAAGAA 
TTATAACAAT 
ACATCCCTTC 
TTCTTTGTAA 
TTGATTTTGT 
TTTGAGCTGT 
T 



11 
I 

CCGGGCCCGA 
AAGTTGGCAA 
GCCGTTCCGG 
GGCAGCTGCC 
CAGATTGTCC 
CAGGAGGGAG 
ACAGTGGGGA 
GGAGTTGCTT 
AAAACATGGG 
AAGCAAGAAT 
GGTTCCTTGG 
AACTATATCC 
AAAGACAGAG 
GATGACCCAC 
AAGATTGCTG 
GAAGGTCTCT 
TGGGGCTTCA 
GCACGAGCAA 
ATTTATCAAG 
GTTAGAGTTC 
CAATACAGGA 
ATCGTGTTCG 
TGTGAGCTCT 
AATGGAGAGG 
TGGAAAAGGA 
CACAGCACCA 
AAAGCTGCCA 
TGGAGTAACT 
AGTGGGAGGC 
CCAGACACTG 
TGACTTTCAT 
GCTTGAGTTC 
CATGAATTGG 
ATTACCTTCT 
TGTTCATTTT 
TCTCTCATAT 
TAGAAACTAG 
CCTGTGCATA 
CAAGTACTTG 
TACATGTGTT 
TTTTGAATGG 
ACCATGTCAT 
TTGTAATGTA 
TACTACATTG 



21 

1 

CCACCCCAGC 
CTTGGAAGCT 
GCATGGCCGG 
TCCTGGCCAG 
TTGTGCTGAA 
AAGGTAATTG 
AAATATCGGC 
TCCGACACTG 
CCAATTATTC 
TCTTTGAACG 
CTGTGGCTAT 
ACATGCACTT 
TAGTCCATGC 
AAAATTCCAT 
TTGTGATGTT 
ACCTGCATAA 
TCTTGATAGG 
CTCTGGCTGA 
CACCGATCTT 
TAGCTACCAA 
AACTGGCCAA 
TATGCCTGCC 
TCTTCAACTC 
TTCAGGCAGA 
CACCGCCATG 
GCAGCCAGTC 
AGATCGCCAG 
CAGAGCAGGA 
AGGGAGATGA 
AAGGATGCCA 
GGGCTGGTCC 
AAAGGCTGAA 
CTCCTGTAAA 
ATTGGCATCA 
TTTCTGCTAC 
ATATCACCCT 
TATTCTCTTA 
GGAGCAATTA 
CTGGAAAATT 
TTTGGGAACA 
CCTCTTTGTG 
GTGGAAAGAT 
TTTTGATAGC 
TACATGGCAT 



31 
I 

TGCGCGTCGT 
TCTCCCGGGC 
GCTGGGGGCG 
AGCCCAGCTG 
.AGCGAAAGTA 
TTTCCCTGAA 
TGTTCCATGC 
TAACCCCAAT 
AGACTGOCTT 
CCTCTATGTA 
TCTCATCATT 
ATTTGTGTCT 
TCACATAGGA 
TGAGGCAACT 
TATTTACTTC 
TCTCATCTTT 
CTGGGGGTTT 
TGCGAGGTGC 
AGCAGCTATT 
AATCTGGGAG 
ATOGACACTG 
TCACTCCTTC 
CTTTCAGGGT 
GGTGAAGAAG 
TGGCAGCCGC 
ACAGGTGGCG 
CAGACAGCCT 
CTGCCTGCCA 
TATTCTAATG 
AGGAGAAACT 
AATGGCTGGT 
AATTCAGTTA 
TACTAACGAC 
AGTTTTCCTC 
TTTTGGGTAG 
AAATATAATG 
TTTCTTACTT 
GGATCTAAAA 
AGTTGGCTGG 
AGGAAAATTT 
ACCAGCCAGA 
TTCCTCAGTT 
AAATCATGCT 
GTGGGATCAA 



41 
I 

TACTGGCCAC 
TCTGGAGGAG 
TCGCTCCACG 
GATTCTGATG 
CAATGTGAAC 
TGGGATGGAC 
CCTCCTTATA 
GGAACATGGG 
CGCTTTCTGC 
ATGTATACCG 
GGTTACTTCA 
TTCATGCTGA 
GTAAAGGAGC 
TCTGTGGACA 
CTGGCTACAA 
GTGGCTTTCT 
CCAGCAGCAT 
TGGGAACTTA 
GGGCTGAATT 
ACCAATGCAG 
GTCCTGGTCC 
ACTGGGCTCG 
TTCTTTGTGT 
ATGTGGAGTC 
AGATGCGGCT 
GCCAGCACAC 
GACAGCCACA 
CACTCTTTCC 
GAGAAGCCTT 
GAGGATGTTC 
TGTGTGAGAG 
AGGTGTTACT 
ATGAAAATGC 
TAAATTAATG 
AAAAAAGATT 
AAGATCTTTT 
TAATGTACTT 
AAATATATGG 
ACATTGATAA 
CTCAAAAAAG 
CCTCAGGTCT 
AGTGAGCTTG 
GCATCTATAT 
TTAAAAATTT 



51 
I 

AAGTTTGCTC 
GGTCCCTGCT 
TCTGGGGTTG 
GCACCATTAC 
TCAACATCAC 
TCATTTGTTG 
TTTATGACTT 
ATTTTATGCA 
AGCCAGATAT 
TTGGCTACTC 
GACGATTGCA 
GAGCTACAAG 
TGGAGTCCCT 
AATCACAATA 
ATTATTATTG 
TTTCGGACAC 
TTGTTGCAGC 
GTGCTGGAGA 
TTATTCTGTT 
TTGGGCATGA 
"TAGTCTTTGG 
GGTGGGAGAT 
CTATCATCTA 
GGTGGAATCT 
CAGTGCTCAC 
GCATGGTGCT 
TCACTTTACC 
ACGAGGAGAC 
CCAGGCCTAT 
TCTGAATGGA 
GGCTTGGCTG 
TAATAATAGT 
AAGTGTCAAT 
TATGGTATTT 
CAATTGCTTG 
AGTGTGTATC 
CTATCACTGC 
GAAGATAAAA 
AATAATGCAT 
AATATTTCAC 
TCACTCTTTC 
TGTCTGCAAA 
CTTTTTCTTG 
GTTTTAAAAA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



51 



Seq ID NO: 663 Protein sequence 
Protein Accession fi: NP_005039 

1 . U 21 31 41 

t ) I I I I 

MAGLGASLHV WGWLMLGSCL LARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGE 
GKCFPEWDGL ICWPRGTVGK ISAVPCPPYI YDFNHKGVAF RHCNPNGTWD FMHSLNKTWA 
NYSDCLRFLQ PDISIGKQEF FERLYVMYTV GYSISPGSIA VAILIIGYFR RLHCTRNYIH 
MHLFVSFMLR ATSIFVKDRV VHAHIGVKBL ESLIMQDDPQ NSIEATSVDK SQYIGCKIAV 



60 
120 
180 
240 
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VHPIYFLATN YYWILVEGLY 
LADARCWELS AGDIKWIYOA 
LAKSTLVLVL VFGVHYIVFV 
QAEVKKMWSR WNLSVDWKRT 
IASRQPDSHI TLPGYVWSNS 
GOQGETEDVL 



PCT/US02/12476 



LHNLIFVAFP SDTKYLWGFI LIQWGFPAAP VAAWAVARAT 
PILAAIGLNF ILFLNTVRVL ATKIWBTNAV GHDTRKQYRK 
CLPHSFTGLG WEIRMHCELF FKSFQGFFVS IIYCYCNGEV 
PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 
EQDCLPHSFH EETKEDSGRQ GDDILMEKPS RPMESNPDTE 



300 
360 
420 
480 
540 



Seg ID MO: 664 DMA sequence 
Nucleic Acid Accession #» NM_Q12152 
Coding sequence: 43.. 1104 



1 
I 

CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTOCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 
1 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 
I 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 

ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTOCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



41 
I 

CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
CGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT. 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



51 
I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTOCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGrCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 665 Protein sequence 
Protein Accession #: NP_036284 

1 11 21 31 41 51 

I I I I 1 I 

MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 
FHFPFYYLIiA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL CNISACSSIA 
PIYSRSYLVF WTVSNLMAFL IMWVYLRIY VYVKRKTOVL SPHTSGSISR RRTPMKLMKT 
VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP I IYSYKDEDM 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 

Seq ID NO: 666 DNA sequence 
Nucleic Acid Accession ft: NM_002B21 
Coding sequence: 150.. 3362 



1 

I 

AACTCCCGCC 
GCGCTCCGGT 
CCTCAGCTCC 
CCCGCCGGTT 
CCATTGTCTT 
TTCGCTGTGA 
CTGTCCAGGA 
ACCGGCTGCA 
AAGCCCGCAG 
TGAAGCATCC 
ACATTGATGG 
ATGGTCAGAG 
GTCCTGAGCA 
GCAGCCAGAA 
CCCAGGACGT 
AGCCACCCCC 
GCCCCCCACA 
TCCGGCCACG 
TCATCCTGGA 
GGGTGTTTAC 
AGCCCAGCGT 
AGAAGGGCCA 
GCCACGOGGC 
TGCCCTCCTG 
TGGATTGCCT 
TCATCTCAGA 
TGGAGGTGTA 
AGGCGCAAGC 
AGCAGTGCAT 
AGCCCACTAT 
ACGCTGGGAC 
TTGCCTCCAA 
TTATCACCTT 
TGCAGTGCGA 
TCCTGGACCC 



11 
I 

TCGGGACGCC 
GOGTCCGCCT 
TTTTCCTGAG 
GCCTCTGCTC 
CATCAAGCAG 
GGTTGAGGCT 
CACGGAGCGG 
GGACTCTGGC 
TGCCAACGCC 
AGCCTCGGAA 
GCACCCTOGG 
CAACCACACA 
TAGTGGGCTG 
CTTCACCTTG 
GGTAGTAGCG 
GAGCCTGCAG 
CCTCCGCAGA 
CAATGCAGGG 
AGCCACACTT 
AGCTGGCAGC 
GTGGTGGGAG 
CGAGCTGGTG 
CAACCTGGCT 
GCTGAAGAAG 
GACCCAGGCC 
GGACTCACGG 
TGATGGGACA 
CCGTGTCCAA 
GGAGTTTGAC 
TAAGTGGGAA 
CCTGCATTTT 
CGGGCCGCAG 
CAAAGTGGAA 
GGCCCAGGGG 
CACCAAGCTG 



21 
t 

TCGGGGTCOG 
CCTGTGCCCG 
CCCGCCGCGA 
AGCGTCCTGC 
CCGTCCTCCC 
CCGGGCCCGG 
CGTTTCGCCC 
ACCTTCCAGT 
TCCTTCAACA 
GCTGAGATCC 
CCCACCTACC 
GTCAGCAGCA 
TATTCCTGCT 
AGCATTGCTG 
AGGTATGAGG 
TGGCTCTTTG 
GCCACAGTGT 
ATCTACCGCT 
CACCTAGCAG 
GAGGAGCGTG 
CACGCGGGAG 
TTGGCCAATA 
GGTCAGCGGA 
CCCCAAGACA 
ACACCAAAAC 
TTCGAGGTCT 
TGGTACCGTT 
GTGCTGGAAA 
AAGGAGGCCA 
CGGGCAGATG 
GCCCGGGTGA 
GGCCAGATTC 
CCAGAGCG7A 
GACCCCAAGC 
GGACCCAGGA 



31 
I 

GCTCCGGCTG 
CCGOGGAGCA 
TGGGAGCTGC 
TGCTGCCGCT 
AGGATGCACT 
TACATGTGTA 
AGGGCAGCAG 
GTGTGGCTCG 
TCAAATGGAT 
AGCCACAGAC 
AATGGTTCCG 
AGGAGCGGAA 
GCGCCCACAG 
ATGAAAGCTT 
AGGCCATGTT 
AGGATGAGAC 
TTGCCAACGG 
GCATTGGCCA 
AGATTGAAGA 
TGACCTGCCT 
TCCGGCTGCC 
TTGCTGAAAG 
GACAGGATGT 
GCCAGCTGGA 
CTACAGTTGT 
TCAAGAATGG 
GTATGAGCAG 
AGCTCAAGTT 
CGGTGCCCTG 
GGAGCAGCCT 
CTCGAGATGA 
GTGCCCATGT 
CGACTGTGTA 
CGCTGATTCA 
TGCACATCTT 



41 

1 

CGGCTGCTGC 
GTCTGCGGCC 
GCGGGGATCC 
GCTGGGCGGT 
GCAGGGGCGC 
CTGGCTGCTC 
CCTGAGCTTT 
GGATGATGTC 
TGAGGCAGGT 
CCAGGTCACA 
AGATGGGACC 
CCTGACGCTC 
TGCTTTTGGC 
TGCCAGGGTG 
CCATTGCCAG 
TCCCATCACT 
GTCTCTGCTG 
GGGGCAGAGG 
CATGCCGCTA 
TCCCCCCAAG 
CACCCATGGC 
TGATGCTGGT 
CAACATCACT 
GGAGGGCAAA 
CTGGTACAGA 
GACCTTGCGC 
CACCCCAGCC 
CACACCACCA 
TTCAGCCACA 
CCCAGAGTGG 
CGCTGGGAAC 
CCAGCTCACT 
CCAGGGCCAC 
GTGGAAAGGC 
CCAGAATGGC 



51 
I 

TGCGGCGCCC 
CGCCGTGCGC 
CCGGCCAGAC 
ACCCAGACAG 
CGGGOGCTGC 
GATGGGGCCC 
GCAGCTGTGG 
ACTGGAGAAG 
CCTGTGGTCC 
CTTCGTTGCC 
CCCCTTTCTG 
CGGCCAGCTG 
CAGGCTTGCA 
GTGCTGGCAC 
TTCTCAGCCC 
AACCGCAGTC 
CTGACCCAGG 
GGCCCACCCA 
TTTGAGCCAC 
GGTCTGCCAG 
AGGGTCTACC 
GTCTACACCT 
GTGGCCACTG 
CCCGGCTACT 
AACCAGATGC 
ATCAACAGCG 
GGCAGCATCG 
CCCCAGCCAC 
GGCCGAGAGA 
GTGACAGACA 
TACACTTGCA 
GTGGCAGTTT 
ACAGCCCTAC 
AAGGACCGCA 
TCCCTGGTGA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
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540 
600 
660 
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900 
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1440 
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1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 



440 



WO 02/086443 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2S20 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2640 

GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTO AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2940 

TGGCTGCGCG TAACTGOCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAQGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGOC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 3780 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGOGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 
TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 

Seg ID NO» 667 Protein sequence 
Protein Accession #: NP„002812 

1 11 21 31 41 51 

I I I I I I 

MGAARGSPAR PRRLPLI»SVL LLPLLGGTQT AIVFIKQPSS QDALQGRRAL LRCBVEAPGP 60 

VHVYWLLDGA PVQDTERRPA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARS ANAS FN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAPGQAC SSQNPTLSIA DESPARWLA PODVWARYE 240 

EAMFHOQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 420 

SQIiEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV XHDVAPEDSG 660 

SYTCIAGNSC NIKHTEAPLY WDKPVPEES EGPGSPPPYK M1QTIGLSVG AAVAYIIAVL 720 

GLMPYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SLGSGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAQGLEE GVAETLVLVK SLQTKDEQQQ 840 

LDFRRELEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAMVPL RWMSPEAILE GDFSTKSDVW AFGVLMKEVF THGEMPHGGQ ADDEVLADLQ 1020 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 

Seq ID NO: 668 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1389. 

1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGATTTAGA TGACAGAGAA 60 

ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTGTCAACT CGATTArAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 180 

GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTAOGGA CTTTTCCCTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 300 

AAAACTTTCG GCTTTCCAGG GTATCTGCTC CTCTCTGTTC TTCAGTTTTT GTATCCTTTT 360 

ATAGCAATGA TAAGTTACAA TATAATAGCT GGAGATACTT TGAGCAAAGT TTTTCAAAGA 420 

ATCCCAGGAG TTGATCCTGA AAACGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTCC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACCGAAATA TAGCAAAGCT TGGAAAGGTC 540 

TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTGGAA TTGTAATGGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 720 

TACAGTTCTC TAGAAGAACC CACAGTAGCT AAGTGGTCCC GCCTTATCCA TATOTCCATC 780 

GTGATTTCTG TATTTATCTG TATATTCTTT GCTACATGTG GATACTTGAC ATTTACTGGC 840 

TTCACCCAAG GGGACTTATT TGAAAATTAC TGCAGAAA7G ATGACCTGGT AACATTTGGA 900 

AGATTTTGTT ATGGTGTCAC TGTCATTTTG ACATACCCTA TGGAATGOT TGTGACAAGA 960 

GAGGTAATTG CCAATGTGTT TTTTGGTGGG AATCTTTCAT CGGTTTTCCA CATTGTTGTA 1020 

ACAGTGATGG TCATCACTGT AGCCACGCTT GTGTCATTGC TGATTGATTG CCTCGGGATA 1080 

GTTCTAGAAC TCAATGGTGT GCTCTGTGCA ACTCCCCTCA TTTTTATCAT TCCATCAGCC 1140 

TGTTATCTGA AACTGTCTGA AGAACCAAGG ACACACTCCG ATAAGATTAT GTCTTGTGTC 1200 

ATGCTTCCCA TTGGTGCTGT GGTGATGGTT TTTGGATTCG TCATGGCTAT TACAAATACT 1260 
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CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 
AATACCTCAQ AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 
TTTCAATGA 

Seq ID NO; 669 Protein sequence 
Protein Accession #* Eos sequence 



1320 
1380 



PC1YUS02/12476 



i 
I 

MGYQRQEPVI 
GFPLGILLLF 
IAMISYNIIA 
SltlSTGLTTL 
YSSLEEPTVA 
RPCYGVTVrL 
VLELNGVLCA 
QDCTHGQEMF 



11 
I 

PPQRDWDRE 
WVSYVTDFSL 
GDTLSKVFQR 
ILGIVKARAI 
KWSRLIHMSI 
TYPMECFVTR 
TPLIPIIPSA 
YCFPDNFSLT 



21 

I 

TLVSEHEYKE 
VLLIKGGALS 
IPGVDPENVF 
SLGPHIPKTE 
VISVFICIFF 
EVIANVFFGG 
CYLKLSEEPR 
NTSESHVQQT 



31 
I 

KTCQSAALFN 
GTDTYQSLVN 
IGRHFIIGXiS 
DAHVFAKPNA 
ATCGYLTFTG 
NLSSVFHIW 
THSDKIMSCV 
TQLSTLNISI 



Seq ID NO: 670 DMA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 1..1284 



ATGGGCTACC 
AAGCAAGCTG 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAGAA 
GGACTTTCCA 
GGAAAGGTCT 
AGGGCAATTT 
CCCAATGCCA 
TTCTTAGTTT 
ATGTCCATCG 
TTTACTGGCT 
ACATTTGGAA 
GTGACAAGAG 
ATTGTTGTAA 
CTCGGGATAG 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCTCTCACAA 
ATTAGTATCT 



11 
I 

AGAGGCAGGA 
GGTTTCCTTT 
TTTTATTGAT 
AAACTTTCGG 
TAGCAATGAT 
TCCCAGQAGT 
CAGTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGGT 
ACAGTTCTCT 
TGATTTCTGT 
TCACCCAAGG 
GATTTTGTTA 
AGGTAATTGC 
CAGTGATGGT 
TTCTAGAACT 
GTTATCTGAA 
TGCTTCCCAT 
AAGACTGCAC 
ATACCTCAGA 
TTCAACTCGA 



21 
I 

GCCTGTCATC 
GGGAATATTG 
AAAAGGAGGG 
CTTTCCAGGG 
AAGTTACAAT 
TGATCCTGAA 
TACTCTGCCT 
TACAGGTTTA 
ACACATACCA 
CGGGGTTATG 
AGAAGAACCC 
ATTTATCTGT 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
CATCACTGTA 
CAATGGTGTG 
ACTGTCTGAA 
TGGTGCTGTG 
CCATGGGCAG 
GTCTCATGTT 
GTAA 



31 
I 

CCGCCGCAGA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCTG 
AACGTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AAAACAGAAG 
TCTTTTGCAT 
ACAGTAGCTA 
ATATTCTTTG 
GAAAATTACT 
GTCATTTTGA 
TTTGGTGGGA 
GCCACGCTTG 
CTCTGTGCAA 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



41 
I 

WNSIIGSGI 
KTFGFPGYLL 
TVTFTLPLSL 
IQAVGVMSFA 
FTQGDLFENY 
TVMVITVATL 
MLPIGAWMV 
FQ 



41 

1 

GAGGATTGCC 
GGGTTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GAGATACTTT 
TTGGTCGCCA 
ACCGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AGTGGTCCCG 
CTACATGTGG 
GCAGAAATGA 
CATACCCTAT 
ATCTTTCATC 
TGTCATTGCT 
CTCCCCTCAT 
CACACTCCGA 
TTGGATTCGT 
ACTGCTTTCC 
CACAACTTTC 



51 
I 

IGLPYSMKQA 
LSVLQPLYPF 
YRNIAKLGKV 
FICHHNSFLV 
CRNDDLVTFG 
VSLLIDCLGI 
FGFVMAITNT 



51 
I 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTG 
GAGCAAAGTT 
CTTCATTATT 
AGCAAAGCTT 
TGTAATGGCA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
ATACTTGACA 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GATTGATTGC 
TTTTATCATT 
TAAGATTATG 
CATGGCTATT 
TGACAATTTC 
TACTTTAAAT 



Seq ID NO: 671 Protein sequence 
Protein Accession #: Eos sequence 



MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTFTLP 
PNAIQAVGVM 
FTGFTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 
1 

PPQRGLPYSM 
YLLLSVLQPL 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFGFVMAI 



21 
I 

KQAGFPLGIL 
YPFIAMISYN 
GKVSLISTGL 
FLVYSSLEEP 
TFGRFCYGVT 
LGIVLELNGV 
TUTQDCTHGQ 



31 
I 

LLFWVSYVTD 
IIAGDTLSKV 
TTLILGIVMA 
TVAKWSRLIH 
VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



Seq ID NO: 672 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 



1 
I 

ATGGGCTACC 
AAAGGAGGGG 
TTTCCAGGGT 
AGTTACAATA 
GATCCTGAAA 
ACTCTGCCTT 
ACAGGTTTAA 
CACATACCAA 
GGGGTTATGT 
GAAGAACCCA 
TTTATCTGTA 
GACTTATTTG 
GGTGTCACTG 
AATGTGTTTT 
ATCACTGTAG 
AATGGTGTGC 
CTGTCTGAAG 
GGTGCTGTGG 
CATGGGCAGG 
TCTCATGTTC 



11 
I 

AGAGGCAGGA 
CCCTCTCTGG 
ATCTGCTCCT 
TAATAGCTGG 
ACGTGTTTAT 
TATCCTTGTA 
CAACTCTGAT 
AAACAGAAGA 
CTTTTGCATT 
CAGTAGCTAA 
TATTCTTTGC 
AAAATTACTG 
TCATTTTGAC 
TTGGTGGGAA 
CCACGCTTGT 
TCTGTGCAAC 
AACCAAGGAC 
TGATGGTTTT 
AAATGTTCTA 
AGCAGACAAC 



21 
I 

GCCTGTCATC 
AACAGATACC 
CTCTGTTCTT 
AGATACTTTG 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
CGCTTGGGTA 
TATTTGCCAC 
GTGGTCCCGC 
TACATGTGGA 
CAGAAATGAT 
ATACCCTATG 
TCTTTCATCG 
GTCATTGCTG 
TCCCCTCATT 
ACACTCCGAT 
TGGATTCGTC 
CTGCTTTCCT 
ACAACTTTCT 



31 
I 

CCGCCGCAGT 
TACCAGTCTT 
CAGTTTTTGT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTG 
GTAATGGCAA 
TTTGCAAAGC 
CATAACTCCT 
CTTATCCATA 
TACTTGACAT 
GACCTGGTAA 
GAATGCTTTG 
GTTTTCCACA 
ATTGATTGCC 
TTTATCATTC 
AAGATTATGT 
ATGGCTATTA 
GACAATTTCT 
ACTTTAAATA 



41 

I 

FSLVLLIKGG 
FQRIPGVDPE 
RAISLGPHIP 
MSIVISVFIC 
VTRBVIANVF 
PSACYLKLSE 
SLTNTSESHV 



41 
I 

TTTCCCTTGT 
TGGTCAATAA 
ATCCTTTTAT 
TTCAAAGAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 
TCTTAGTTTA 
TGTCCATCGT 
TTACTGGCTT 
CATTTGGAAG 
TGACAAOAGA 
TTGTTGTAAC 
TCGGGATAGT 
CATCAGCCTG 
CTTGTGTCAT 
CAAATACTCA 
CTCTCACAAA 
TTAGTATCTT 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



51 
I 

ALSGTDTYQS 
NVFIGRHFII 
KTEDAWVFAK 
IFFATOGYLT 
FGGNLSSVFH 
EPRTHSDKIM 
QQTTQLSTLN 



51 
I 

TTTATTGATA 
AACTTTCGGC 
AGCAATGATA 
CCCAGGAGTT 
AGTTACCTTT 
CCTCATCTCT 
ACTGGGTCCA 
TCAAGCGGTC 
CAGTTCTCTA 
GATTTCTGTA 
CACCCAAGGG 
ATTTTGTTAT 
GGTAATTGCC 
AGTGATGGTC 
TCTAGAACTC 
TTATCTGAAA 
GCTTCCCATT 
AGACTGCACC 
TACCTCAGAG 
TCAACTCGAG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



442 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TAA 

Seq ID NO j 673 Protein sequence 
Protein Accession #: Eos sequence 



PCT/US02/12476 



MGYQRQEPVI 
SYNIIAGDTL 
TGLTTLILGI 
EEPTVAKWSR 
GVTVILTYPM 
NGVLCATPLI 
HGQEMPYCFP 



11 

I 

PPQFSLVLLI 
SKVFQRIPGV 
VMARAISLGP 
LIHMSIVISV 
ECFVTREVIA 
FIIPSACYLK 
DKFSLTNTSE 



21 
I 

KGGALSGTDT 
DPENVFIGRH 
HIPKTEDAWV 
PICIPFATCG 
NVFFGGNLSS 
LSEEPRTHSD 
SHVQQTTQLS 



31 
I 

YQSLVNRTFG 
FIIGLSTVTP 
FAKPNAIQAV 
YLTFTGPTQG 
VFHIWTVMV 
KIMSCVMLPI 
TLKISIFQLE 



41 

I 

FPGYLLIiSVL 
TLPLSLYRNI 
GVMSFAFICH 
DLFENYCRND 
ITVATLVSLL 
GAWMVFGPV 



51 
I 

QPLYPFIAMI 
AKLGKVSLIS 
HNSFLVYSSL 
DLVTFGRFCY 
IDCLGIVLEL 
MAITNTQDCT 



Seq ID NO: 674 DNA sequence 

Nucleic Acid Accession #s Eos sequence 

Coding sequence: 1..1140 



ATGGGCTACC 
CCAGGGTATC 
TACAATATAA 
CCTGAAAACG 
CTGCCTTTAT 
GGTTTAACAA 
ATACCAAAAA 
GTTATGTCTT 
GAACCCACAG 
ATCTGTATAT 
TTATTTGAAA 
GTCACTGTCA 
GTGTTTTTTG 
ACTGTAGCCA 
GGTGTGCTCT 
TCTGAAGAAC 
GCTGTGGTGA 
GGGCAGGAAA 
CATGTTCAGC 



11 
I 

AGAGGCAGGA 
TGCTCCTCTC 
TAGCTGGAGA 
TGTTTATTGG 
CCTTGTACCG 
CTCTGATTCT 
CAGAAGACGC 
TTGCATTTAT 
TAGCTAAGTG 
TCTTTGCTAC 
ATTACTGCAG 
TTTTGACATA 
GTGGGAATCT 
CGCTTGTGTC 
GTGCAACTCC 
CAAGGACACA 
TGGTTTTTGG 
TGTTCTACTG 
AGACAACACA 



21 
I 

GCCTGTCATC 
TGTTCTTCAG 
TACTTTGAGC 
TCGCCACTTC 
AAATATAGCA 
TGGAATTGTA 
TTGGGTATTT 
TTGCCACCAT 
GTCCCGCCTT 
ATGTGGATAC 
AAATGATGAC 
CCCTATGGAA 
TTCATCGGTT 
ATTGCTGATT 
CCTCATTTTT 
CTCCGATAAG 
ATTCGTCATG 
CTTTCCTGAC 
ACTTTCTACT 



31 
I 

CCGCCGCAGG 
TTTTTGTATC 
AAAGTTTTTC 
ATTATTGGAC 
AAGCTTGGAA 
ATGGCAAGGG 
GCAAAGCCCA 
AACTCCTTCT 
ATCCATATGT 
TTGACATTTA 
CTGGTAACAT 
TGCTTTGTGA 
TTCCACATTG 
GATTGCCTCG 
ATCATTCCAT 
ATTATGTCTT 
GCTATTACAA 
AATTTCTCTC 
TTAAATATTA 



41 

I 

TCAATAAAAC 
CTTTTATAGC 
AAAGAATCCC 
TTTCCACAGT 
AGGTCTCCCT 
CAATTTCACT 
ATGCCATTCA 
TAGTTTACAG 
CCATCGTGAT 
CTGGCTTCAC 
TTGGAAGATT 
CAAGAGAGGT 
TTGTAACAGT 
GGATAGTTCT 
CAGCCTGTTA 
GTGTCATGCT 
ATACTCAAGA 
TCACAAATAC 
GTATCTTTCA 



Seq ID NOi 675 Protein sequence 
Protein Accession fl: Eos sequence 



I 

MGYQRQEPVI 
PENVFIGRHF 
IPKTEDAWVF 
ICIFFATCGY 
VFFGGNLSSV 



HVQQTTQLST 



11 
I 

PPQVNKTFGF 
IIGLSTVTFT 
AKPNAIQAVG 
LTFTGFTQGD 
FHIWTVMVI 
IMSCVMLPIG 
LNISIFQLE 



21 
I 

PGYLLLSVLQ 
LPLSLYRNIA 
VMSFAFICHH 
LFENYCRNDD 
TVATLVSLLI 
AWMVFGFVM 



31 

! 

FLYPFIAMIS 
KLGKVSLIST 
NSFLVYSSLE 
LVTFGRFCYG 
DCLGIVLELN 
AITNTQDCTH 



41 

YNIIAGDTLS 
GLTTLILGIV 
EPTVAKWSRL 
VTVILTYPME 
GVLCATPLIF 
GQEMPYCFPD 



51 
I 

TTTCGGCTTT 
AATGATAAGT 
AGGAGTTGAT 
TACCTTTACT 
CATCTCTACA 
GGGTCCACAC 
AGCGGTCGGG 
TTCTCTAGAA 
TTCTGTATTT 
CCAAGGGGAC 
TTGTTATGGT 
AATTGCCAAT 
GATGGTCATC 
AGAACTCAAT 
TCTGAAACTG 
TCCCATTGGT 
CTGCACCCAT 
CTCAGAGTCT 
ACTCGAGTAA 



51 
I 

KVFQRIPGVD 
MARAISLGPH 
IHMSIVISVF 
CFVTREVIAN 
IIPSACYLKL 
NFSLTNTSES 



Seq ID NO: 676 DNA sequence 

Nucleic Acid Accession #: NM_006853.1 

Coding sequences 26.. 874 



AGGAATCTGC 
ATCGGGCAGA 
CATGAGGATT 
CAGGATCATC 
CGAGAAGACG 
AGCCCACTGC 
GGAGGGCTGT 
CAGCCTCCCC 
CTCCATCACC 
CAGCTGCCTC 
CTTGOGATGC 
CAACATCACA 
GGGTGACTCC 
CCAGGATCCG 
GGACTGGATC 
AOCCTCCATT 
CAAGACCCTC 
AATCAACCTG 
GACTCTGGGA 
TCCTGGCCAT 



11 

I 

GCTCGGGTTC 
GGTCTCACAG 
CTGCAGTTAA 
AAGGGGTTCG 
OGGCTACTCT 
CTCAAGCCCC 
GAGCAGACCC 
AACAAAGACC 
TGGGCTGTGC 
ATTTCCGGCT 
GCCAACATCA 
GACACCATGG 
GGGGGCCCTC 
TGTGCGATCA 
CAGGAGACGA 
TCCACTTGGT 
TACGAACATT 
GGGTTCGAAA 
ATGACAACAC 
ATATCAAGGT 



21 
I 

CGCAGATGCA 
CAGCCAAGGA 
TCCTGCTTGC 
AGTGCAAGCC 
GTGGGGCGAC 
GCTACATAGT 
GGACAGCCAC 
ACCGCAATGA 
GACCCCTCAC 
GGGGCAGCAC 
CCATCATTGA 
TGTGTGCCAG 
TGGTCTGTAA 
CCCGAAAGCC 
TGAAGAACAA 
GTTTGGTTCC 
CTTTGGGCCT 
TCAGTGAGAC 
CTGGTTTGTT 
TTCAATAAAT 



31 
I 

GAGGTTGAGG 
ACCTGGGGCC 
TCTGGCAACA 
TCACTCCCAG 
GCTCATCGCC 
TCACCTGGGG 
TGAGTCCTTC 
CATCATGCTG 
CCTCTCCTCA 
GTCCAGCCCC 
GCACCAGAAG 
CGTGCAGGAA 
CCAGTCTCTT 
TGGTGTCTAC 
TTAGACTGGA 
TGTTCACTCT 
CCTGGACTAC 
CTGGATTCAA 
CTCTGTTGTA 
ATTTGCTAAA 



41 
I 

TGGCTGCGGG 
CGCTCCTCCC 
GGGCTTGTAG 
CCCTGGCAGG 
CCCAGATGGC 
CAGCACAACC 
CCCCACCCCG 
GTGAAGATGG 
CGCTGTGTCA 
CAGTTACGCC 
TGTGAGAACG 
GGGGGCAAGG 
CAAGGCATTA 
ACGAAAGTCT 
CCCACCCACC 
GTTAATAAGA 
AGGAGATGCT 
ATTCTGCCTT 
TCCCCAGCCC 
TGAGTG 



51 

! 

ACTGGAAGTC 
CCCTCCAGGC 
GGGGAGAGAC 
CAGCCCTGTT 
TCCTGACAGC 
TCCAGAAGGA 
GCTTCAACAA 
CATCGCCAGT 
CTGCTGGCAC 
TGCCTCACAC 
CCTACCCCGG 
ACTCCTGCCA 
TCTCCTGGGG 
GCAAATATGT 
ACAGCCCATC 
AACCCTAAGC 
GTCACTTAAT 
GAAATATTGT 
CAAAGACAGC 



Seq ID NO: 677 Protein sequence 
Protein Accession #.• NP_006844.1 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



11 



21 



41 



51 



31 

I I ! I 1 I 

MRIXjQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRWIjLTA 



60 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 

85 



WO 02/086443 

AHOiKPR Y IV HLGOHNLQKE EGCSQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 120 
SITWAVRPLT LSSRCVTAGT SCLISGWGST SSPQLRLPHT LiRCANITIIE HQKCENAYPG 180 
NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 240 
DWIQETMKNN 

Seq ID NO: 678 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..933 



PCTYUS02/12476 



i 
I 

ATGTGCAGCA 
TTCGACAAQA 
TTCCCCTGTG 
GACTGTCCOG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCATCGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
CCGCCCTACT 
CGGTCCGGGA 
GACACCAGCC 
TCTGAGCCGA 



11 
I 

ATGGACGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTGACTTC 
GCTCCGTCAT 
AGCGGAACAA 
GCCTGGTGGT 
TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 
GCCAGGGCAC 



21 
I 

CATCCCGGGC 
GGAGTGCCCC 
CCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAATCTCTG 
TGCCAGCTCC 
GCAGCCTGGC 
TGAAGAAGTA 



31 
I 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTCGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CAGAGGCCTG 
AACCAAGCCG 
CAGGCAGCCA 
CCCCAGGAGG 
TAA 



41 

I 

GTGACGGGCT 
CGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCAT 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGTCAC 
ATGCGTCGGA 
CGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 

I* 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
CGATGGACAG 
ACCCGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTCCA 
CTACCGCTCC 
GAGCGTGGAA 
GCCCAGGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
€60 
720 
780 
840 
900 



Seq ID NO: . 679 Protein sequence 
Protein Accession #: Eos sequence 



51 



1 11 21 r T I 

MCSNGRCIPG AWQCDGLPDC FDKSDEKECP KAKSKCGPTF PPCASGIHCI IGRPRCNGFE 
DCPDGSDEEN CTANPLLCST ARYHCKNGLC IDKSPICDGQ NNOQDNSDEE SCESSQEPGS 
GQVFVTSENQ LVYYPSITYA IIGSSVIFVL WALLALVLH HQRKRNNLMT LPVHRLQHPV 
LLSRIiWLDH PHHCNVTYNV NNGIQYVASQ AEQNASEVGS PPSYSEALLD QRPAWYDLPP 
PPYSSDTESL NQADLPPYRS RSGSANSASS QAASSLLSVE DTSHSPGQPG PQEGTAEPRD 
SEPSQGTEEV 

Seq ID NO: 680 DNA sequence 
Nucleic Acid Accession #: S78203.1 
Coding sequence: 1..2190 



I 



ATGAATCCTT 
GAGGTACCAC 
AACTATCCAC 
TATGGAATGA 
ACCTCCACAT 
GCAGCCATTG 
TATGTGCTTG 
GTACACACAG 
AAACCCTGTG 
ACTAGATACT 
ATCACACCCA 
TTTGGAGTTC 
ATATACAATA 
TTTGCTATTT 
CTAGACTGGG 
AGGGTACTAT 
TCACGATGGA 
CCGGACCAGA 
TTTGTCATTT 
GCTGTTGGTA 
ATAAATGAAA 
CTGGCAGATG 
GAGTCCATCA 
AGCCAGGATT 
GTGCAGGAGA 
ATGATGGTAA 
AACACTTTGC 
GAAGACTATG 
TGTAGAACAG 
TATCTGTTTG 
ATTCCAGCCA 
GGGGAGGTCA 
ATGAAATCTG 
CTTGTTGTGG 
CTCCTGCTGG 
ACAGAGGATA 
AAACTAGAGA 



11 . 
1 . 

TCCAGAAAAA 
CTCGACCACC 
TGAGCATTGC 
AAGCTGTGCT 
CTATATACCA 
CTGACTCGTG 
GCCATGTGAT 
TCCTATCATT 
TGGCAGCTTT 
TCTCAGTCTT 
TGCTGAGAGG 
CAGGACTGCT 
AACCACCCCC 
CCAATCGTTT 
CAGCTGAGAA 
TCCTTTATAT 
CTTTGCAAGC 
TGCAGGTTCT 
ATCGTCTGGT 
TGATCCTAGC 
TGGCCCCAGC 
ATGAGGTGAA 
AATCCTTTCA 
TTCACTTCCA 
AGAACTGGTA 
AGGATACAGA 
ATAAAGATGT 
GTGTGTCTGC 
AAGATAAGAA 
TTATTACTAA 
ACAAAATGTC 
TGTTCTCTGT 
TGCTCCAGGC 
CACAGTTCAG 
TGATCTGCCT 
TGCGGGGTCC 
CCAAGAAGAC 



21 



TGAGTCCAAG 
TAGCCCTCCA 
CTTCATTGTG 
GATCCTGTAT 
TGCCTTCAGC 
GTTGGGAAAA 
CAAGTCCTTG 
GATCGGCCTG 
TGGTGGAGAC 
CTACCTGTCC 
AGATGTGCAA 
CATGGTAATT 
TGAAGGAAAC 
CAAGAACCGT 
ATATCCAAAG 
CCCATTGCCC 
CATCAGGATG 
AAATCCCTTT 
CTCCAAGTGT 
GTGCCTGGCA 
CCAGTCAGGT 
GGTGACAGTG 
GAAAACACCA 
CCTGAAATAT 
CAGTCTTGTC 
AAGCAAAACA 
CAACATCTCC 
TTATAGAACT 
CTTTTCTCTG 
TAACACCAAT 
CATTGCGTGG 
CACAGGTCTT 
AGCTTGGCTA 
TGGCCTGGTA 
GATCTTCTCC 
AGCAGATAAG 
AAAACTCTGA 



31 
I 

GAAACTCTTT 
AAGAAGCCAT 
GTGAATGAAT 
TTCCTGTATT 
AGCCTCTGTT 
TTCAAGACAA 
GGTGCCTTAC 
AGTCTAATAG 
CAGTTTGAAG 
ATCAATGCAG 
TGTTTTGGAG 
GCACTTGTTG 
ATAGTGGCTC 
TCTGGAGACA 
CAGCTCATTA 
ATGTTCTGGG 
AATAGGAATT 
CTGGTTCTTA 
GGAATTAACT 
TTTGCAGTTG 
CCCCAGGAGG 
GTGGGAAATG 
CACTATTCCA 
CACAATTTGT 
ATTCGTGAAG 
ACCAATGGGA 
CTGAGTACAG 
GTGCAAAGAG 
AATTTGGGTC 
CAGGGTCTTC 
CAGCTACCAC 
GAGTTTTCTT 
TTGACAATTG 
CAGTGGGCCG 
ATCATGGGCT 
CACATTCCTC 



41 
I 

TTTCACCTGT 
CTCCGACAAT 
TCTGCGAGCG 
TCCTGCACTG 
ATTTTACTCC 
TCATCTATCT 
CAATACTGGG 
CTTTGGGGAC 
AAAAACATGC 
GGAGCTTGAT 
AAGACTGCTA 
TGTTTGCAAT 
AAGTTTTCAA 
TTCCAAAGCG 
TGGATGTAAA 
CTCTTTTGGA 
TGGGGTTTTT 
TCTTCATCCC 
TCTCATCACT 
CGGCAGCTGT 
TTTTCCTACA 
AAAACAATTC 
AACTGCACCT 
CTCTCTACAC 
ATGGGAACAG 
TGACAACCGT 
ATACCTCTCT 
GAGAATACCC 
TTCTAGACTT 
AGGCCTGGAA 
AATATGCCCT 
ATTCTCAGGC 
CAGTTGGGAA 
AATTCATTTT 
ACTACTATGT 
ACATCCAGGG 



60 
120 
180 
240 
300 



51 
I 

CTCCATTGAA 
CTGTGGCTCC 
CTTTTCCTAT 
GAATGAAGAT 
CATCCTGGGA 
CTCCTTGGTG 
AGGACAAGTG 
AGGAGGCATC 
AGAGGAACGG 
TTCTACATTT 
TGCATTGGCT 
GGGAAGCAAA 
ATGTATCTGG 
ACAGCACTGG 
GGCACTGACC 
TCAGCAGGGT 
TGTGCTTCAG 
GTTGTTTGAC 
TAGGAAAATG 
AGAGATAAAA 
AGTCTTGAAT 
TCTGTTGATA 
GAAAACAAAA 
TGAGCATTCT 
TATCTCCAGC 
GAGGTTTGTT 
CAATGTTGGT 
TGCAGTGCAC 
TGGTGCAGCA 
GATTGAAGAC 
GGTTACAGCT 
TCCCTCTAGC 
TATCATCGTG 
GTTTTCCTGC 
TCCTGTAAAG 
GAACATGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: 681 Protein sequence 
Protein Accession th AAB3438B.1 



444 



WO 02/086443 



PCTVUS02/12476 
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1 
I 

MNPFQKNESK 
YGMKAVLILY 
YVLGHVIKSL 
TRYFSVFYLS 
IYNKPPPEGN 
RVLFLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFSGLV 
KLETKKTKL 



11 
I 

ETLPSPVSIE 
FLYFLHWNED 
GALPILGGQV 
INAGSLISTF 
IVAQVPKCIW 
MPWALLDQQG 
GIKFSSLRKM 
VGNENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QLPQYALVTA 
QWAEFILFSC 



21 
I 

EVPPRPPSPP 
TSTSIYHAPS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAISNRFKNR 
SRWTLQAIRM 
AVGMILACLA 
ESIKSFQKTP 
MMVKDTESKT 
CRTEDKNFSL 
GEVMFSVTGL 
LLLVI CLIPS 



31 
I 

KKPSPTICGS 
SLCYFTPILG 
SLIALGTGGI 
CFGEDCYALA 
SGDIPKRQKW 
NRNLGPPVLQ 
FAVAAAVEIK 
HYSKLHUCTK 
TNGMTTVRFV 
NLGLLDPGAA 
EFSYSQAPSS 
IMGYYYVPVK 



Seq ID NO; 682 DMA sequence 

Nucleic Acid Accession #: NM_016077.1 

Coding sequence: 128.. 667 



1 
I 

CGCGATAGAA 
ACTGTAGATG 
CTTGGCTGTT 
GATGCTCCCC 
CTTGGGAGAC 
AAAAGGGAAA 
AAGAAGAAAT 
CAAAGCTCCT 
GACTGTAAGT 
CCTAGGGATT 
TTACTAGGTG 
GATTCTAACA 
AAACCTATTC 



11 
I 

ATTCTTGATC 
ACGTGTTCGC 
CCCTCCAAAT 
GGAGTTGCTT 
AAAAGCAAGA 
AGCGGGGAGT 
GTGGCTGCCC 
CCTGAAATGC 
GATGAAGAAA 
TTAATTCAAG 
GGGCCAGGAC 
GACTTTGATA 
ACAAAAGCTG 
CCATGTTCTA 



21 
I 

CGGAACTTTO 
TTGCCCAGAA 
CCTTGGTTAT 
GTGGCATGTG 
CGAGCAAGAC 
ACAAGATGAT 
AGTGCTCTCA 
TCAAACAATG 
CCCTGATTGC 
ATGCTGGACG 
CAGCAGACCT 
TGACAACAAC 
AATTTCTTCA 



31 
I 

TCACCCAGGA 
GAAGGGAAGG 
GGAATATTTG 
CCTGGGCTGG 
ACACACAGAT 
TCTTGTGGTT 
TGCTGCTGTT 
GGAATACTGT 
ATTATTGGCC 
TACTCAGATT 
AATTGACAAA 
CCCTCCATCA 
CCCAACTTAA 



41 
I 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGO 
FGVPGLLMVI 
LDHAAEKYPK 
PDQMQVLNPF 
INEMAPAQSG 
SQDFHFKLKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADX 



41 
I 

ACCCCGGAAG 
CGCGAGTGAG 
GCTCATCCCA 
AGCCTTCGAG 
ACTGAAAGTG 
CGAAATGACT 
TCAGCCTACA 
GGCCAGCCCA 
CATGCAAAAA 
GCACCAGGCT 
GTCACTGGTC 
CAAGTGTTTG 
ATGTTCTTGA 



51 

I 

VNEPCERPSY 
FKTIIYLSLV 



ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAHK1ED 
LTIAVGNIIV 
HIPHIQGNMI 



51 
1 

AGGTAGCTCA 
GAAAGGAGGT 
GTACACTCGG 
TATGCTTTGG 
AAGCAAGCAT 
TAAAGATGGG 
AGCAGATTCA 
AGGTGGTGGT 
TGCTGGGACT 
CTCAAACTGT 
ACCTAAAACT 
AAGCCTGTCA 
GATGAAAATA. 



Seq ID NO: 683 Protein sequence 
Protein Accession I: NP_057161.1 

1 11 21 31 41 51 

I I I I I I 

MPSKSLVMEY LAHPSTLGLA VGVAOGMCLG WSLRVCFGML PKSKTSKTHT DTESEASILG 
DSGEYKMIItV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 
PDEETLIALL AHAKMLGLTV SL1QDAGRTQ IAPGSQTVLG IGPGPADLID KVTGHLKLY 

Seq ID NO: 684 DNA sequence 

Nucleic Acid Accession #: NM_004864.1 

Coding sequence: 26.-952 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 
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GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
rPGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGOGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 




GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGOG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 
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CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 
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CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGGCTGCGGG 
GTCCGGATAC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAAGCCG 
GGGCCCGGGC 
GCQGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGGCTG 
TGTCTGAACT 
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TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGGGGCG 
GTTGCTGCCG 
TGCTGTCGCC 
GGGCGGCAAA 
AGCCAGCGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



Seq ID NO: 685 Protein sequence 
Protein Accession #: NP__004855.1 
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MPGQELRTVN GSQMLLVLLV 
EDLLTRLRAN QSWEDSNTDL 
HRAIiFRLSPT ASRSWDVTRP 
-ELHLRPQAAR GRRRARARNG 
IGACPSQFRA ANMHAQIKTS 
LAKDCHCI 
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LSWLPHGGAL SIiAEASRASF PGPSELHSED SRFRELRKRY 
VPAPAVRILT PEVRLGSGGH LKLRISRAAL PBGLPEASRL 
LRRQLSLARP QAPALKLRLS PPPSQSDQLL AESSSARPQIi 
DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 
LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 
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180 
240 
300 



Seq ID NO: 686 DNA sequence 
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Nucleic Acid Accession ft: NMJ>02423.2 
Coding sequences 48.. 851 

1 11 21 31 41 51 
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ACCAAATCAA CCATAGGTCC AAGAACAATT GTCTCTGGAC GGCAGCTATG CGACTCACCG 60 

TGCTGTGTQC TGTGTGCCTG CTGCCTGGCA GCCTGGCCCT GCOGCTGCCT CAGGAGGCGG 120 

GAGGCATGAG TGAGCTACAG TGGGAACAGG CTCAGGACTA TCTCAAGAGA TTTTATCTCT 180 

ATGACTCAGA AACAAAAAAT GCCAACAGTT TAGAAGCGAA ACTCAAGGAG ATGCAAAAAT 240 

10 TCTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC 300 

CCAGATGTGG AGTGCCAGAT GTTGCAGAAT ACTCACTATT TCCAAATAGC CCAAAATGGA 360 

CTTCCAAAGT GGTCACCTAC AGGATCGTAT CATATACTCG AGACTTACCG CATATTACAG 420 

TGGATCGATT AGTGTCAAAG GCTTTAAACA TGTGGGGCAA AGAGATCCCC CTGCATTTCA 480 

GGAAAGTTGT ATGGGGAACT GCTGACATCA TGATTGGCTT TGCGCGAGGA GCTCATGGGG 540 

15 ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGOG CCTGGGACAG 600 

GTCTOGGAGG AGATGCTCAC TTCGATGAGG ATGAACGCTG GACGGATGGT AGCAGTCTAG 660 

GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 720 

CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 780 

AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA AGTAATTCAA 840 

20 GAAAGAAATA GAAACT7XAG GCAGAACATC CATTCATTCA TTCATTGGAT TGTATATCAT 900 

TGTTGCACAA TCAGAATTGA TAAGCACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 960 

CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 1020 

ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGOGCGTAG ATGTCAATAA 1080 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 
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Seg ID NO: 687 Protein sequence 
Protein Accession fts NP 002414.1 



1 11 21 31 41 SI 
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MRliTVLCAVC LLPGSLALPL PQEAGGMSBL QWEQAQDYLK RPYLYDSETK NANSLEAKLK 60 

EMQKFFGLiPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKHTSKWT YRIVSYTRDL 120 

PHITVDRLVS KALNMWGKEI PLHFRKWWG TADIMIGPAR GAHGDSYPPD GPGNTLAHAP 180 

APGTGIiGGDA RFDEDERWTD GSSLGINPLY AATHELGHSIi GMGHSSDPNA VMYPTYGNGD 240 
PQNFKLSQDD IKGIQKLYGK RSNSRKK 



Seq ID NO: 688 DNA sequence 

Nucleic Acid Accession ft: NM_005221.3 

Coding sequence : 1 . . 870 
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ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACTATAGCTA CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

55 CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840. 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 



Seq ID NO: 689 Protein sequence 
Protein Accession ft: np 005212.1 



1 11 21 31 41 51 
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rc MTGVPDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 

65 PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 

EVTEPEVRMV NGKPKKVRKP RTIYSSPQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 

VKIWPQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQG8S RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP IAIASGTLY 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
ncorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 LA method of detecting a lung cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1A-16. 

1 2. The method of claim 1, wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1 A- 16. 

1 3. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 4. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5. The method of claim 4, wherein the nucleic acids are mRNA 

1 6. The method of claim 4, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 A- 16. 

1 8. The method of claim 1 , wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 1 0. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 11. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat lung cancer. 

1 12. The method of claim 1, wherein the patient is suspected of having lung 

2 cancer. 

1 1 3 . A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 
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3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1A-16, 

8 thereby monitoring the efficacy of the therapy. 

1 14, The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated transcript to a level of the lung cancer-associated 

3 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. ... 

1 15. The method of claim 13, wherein the patient is a human. 

1 1 6. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated antibody in the biological 

6 sample by contacting the biological sample with a polypeptide encoded by a polynucleotide 

7 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 

8 Tables 1A-16, wherein the polypeptide specifically binds to the lung cancer-associated 

9 antibody, thereby monitoring the efficacy of the therapy. 



1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated antibody to a level of the lung cancer-associated 

3 antibody in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 18. The method of claim 1 6, wherein the patient is a human. 

1 1 9. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 
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5 (ii) determining the level of a lung cancer-associated polypeptide in the 

6 biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables 1A-16, thereby 

9 monitoring the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 

3 polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

21. The method of claim 19, wherein the patient is a human. 

22. An isolated nucleic acid molecule consisting of a polynucleotide 
sequence as shown in Tables 1 A-16. 

23. The nucleic acid molecule of claim 22, which is labeled. 

24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

25. An expression vector comprising the nucleic acid of claim 22. 

26. A host cell comprising the expression vector of claim 25. 

27. An isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1A-16. 

28. An antibody that specifically binds a polypeptide of claim 27. 

29. The antibody of claim 28, further conjugated to an effector component. 

30. The antibody of claim 29, wherein the effector component is a 
fluorescent label. 

31. The antibody of claim 29, wherein the effector component is a 
radioisotope or a cytotoxic chemical. 

32. The antibody of claim 29, which is an antibody fragment 
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1 33 . The antibody of claim 29, which is a humanized antibody 

1 34. A method of detecting a lung cancer cell in a biological sample from a 

2 patient, the method comprising contacting the biological sample with an antibody of claim 

3 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 

1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to lung cancer in a patient, 

2 the method comprising contacting a biological sample from the patient with a polypeptide 

3 encoded by a nucleic acid comprises a sequence from Tables 1 A-16. 

1 38. A method for identifying a compound that modulates a lung cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a lung cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1 A-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 38, wherein the functional effect is a physical 

2 effect. ' 

1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. . 

1 41 . The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43. The method of claim 38, wherein the polypeptide is recombinant. 
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1 44. A method of inhibiting proliferation of a lung cancer-associated cell to 

2 treat lung cancer in a patient, the method comprising the step of administering to the subject a 

3 therapeutically effective amount of a compound identified using the method of claim 38. 

1 45. The method of claim 44, wherein the compound is an antibody. 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having lung cancer or a cell 

3 isolated therefrom; 



4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A-16 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of lung cancer. 

1 48. The assay of claim 47, wherein the control is a mammal with lung 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 

1 50. A method for treating a mammal having lung cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 5 1 . A phaimaceutiPcal composition for treating a mammal having lung 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 
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